java - 查看级联协议缓冲区的文件：Stanford CoreNLP

Question

我试图在他们声明的文档中复制使用斯坦福核心 NLP 的论文的结果：

the fully annotated sentences are provided in a file of concatenated
  protocol buffers:

delimitedSentences.proto.bz

This file should be read with the Java function
  `CoreNLPProtos.Sentence.parseDelimitedFrom(<input stream>)`,
  or in other languages taking into consideration that every protocol buffer is
  prepended with the size of the buffer, as a VarInt.
Each proto contains all the annotations for the MIML-RE featurizer, in addition to
  some useful additions (e.g., antecedent for every token).

我已经搜索了该CoreNLPProtos.Sentence.parseDelimitedFrom(<input stream>)函数的代码，但无处可寻。

我对protos不太熟悉。

我该怎么办？

score 0 · Accepted Answer

希望这些将出现在 CoreNLP 的下一个版本中——与此同时，该文件位于公共 GitHub 上：https ://github.com/stanfordnlp/CoreNLP/blob/master/src/edu/stanford/nlp/pipeline /CoreNLPProtos.java

如果您在使用数据时遇到其他问题，请告诉我！我可以在出现错误时修复它们，因此希望该过程对未来的用户来说更顺畅。

java - 查看级联协议缓冲区的文件：Stanford CoreNLP

1 回答 1

Related

Reference