0

我试图在他们声明的文档中复制使用斯坦福核心 NLP 的论文的结果:

the fully annotated sentences are provided in a file of concatenated
  protocol buffers:

delimitedSentences.proto.bz

This file should be read with the Java function
  `CoreNLPProtos.Sentence.parseDelimitedFrom(<input stream>)`,
  or in other languages taking into consideration that every protocol buffer is
  prepended with the size of the buffer, as a VarInt.
Each proto contains all the annotations for the MIML-RE featurizer, in addition to
  some useful additions (e.g., antecedent for every token).

我已经搜索了该CoreNLPProtos.Sentence.parseDelimitedFrom(<input stream>)函数的代码,但无处可寻。

我对protos不太熟悉。

我该怎么办?

4

1 回答 1

0

希望这些将出现在 CoreNLP 的下一个版本中——与此同时,该文件位于公共 GitHub 上:https ://github.com/stanfordnlp/CoreNLP/blob/master/src/edu/stanford/nlp/pipeline /CoreNLPProtos.java

如果您在使用数据时遇到其他问题,请告诉我!我可以在出现错误时修复它们,因此希望该过程对未来的用户来说更顺畅。

于 2015-03-04T19:29:04.927 回答