3

这是我的代码的一部分:

String sentence="My dog also likes eating sausage.";
LexicalizedParser lp = new LexicalizedParser("englishPCFG.ser.gz"); 
TokenizerFactory tf = PTBTokenizer.factory(false, new WordTokenFactory());
TreePrint tp = new TreePrint("penn,typedDependenciesCollapsed");

List tokens = tf.getTokenizer(new StringReader(sentence)).tokenize(); 
lp.parse(tokens); // parse the tokens
Tree t = lp.getBestParse();

如何获得主题(狗)的价值?

这是我想提取主题的依赖项:

nsubj(likes-4, dog-2)
4

1 回答 1

6

尝试这样的事情:

String sentence="My dog also likes eating sausage.";
LexicalizedParser lp = new LexicalizedParser("resources/stanford-parser-2011-06-27/grammar/englishPCFG.ser.gz");
TokenizerFactory tf = PTBTokenizer.factory(false, new WordTokenFactory());
TreePrint tp = new TreePrint("penn,typedDependenciesCollapsed");

List tokens = tf.getTokenizer(new StringReader(sentence)).tokenize();
lp.parse(tokens); // parse the tokens
Tree t = lp.getBestParse();

TreebankLanguagePack languagePack = new PennTreebankLanguagePack();
GrammaticalStructure structure = languagePack.grammaticalStructureFactory().newGrammaticalStructure(t);
Collection<TypedDependency> typedDependencies = structure.typedDependenciesCollapsed();

for(TypedDependency td : typedDependencies) {
  if(td.reln().equals(EnglishGrammaticalRelations.NOMINAL_SUBJECT)) {
    System.out.println(td);
  }
}

这将打印:

nsubj(likes-4, dog-2)
于 2011-11-05T21:09:22.293 回答