python - StanfordCoreNLP openIE 问题

Question

我面临与斯坦福 CoreNLP OpenIE 注释器相同的问题，我尝试 output = nlp.annotate(s, properties={"annotators":"tokenize,ssplit,pos,depparse,natlog,openie", "outputFormat": "json", "openie.triple.strict":"true", "openie.max_entailments_per_clause":"1","openie.splitter.disable":"true"})

但我仍然得到 4 个子句

(u'are pull from', u'Twenty percentelectric motor', u'组装线') (u'are pull from', u'percent Electric motor', u'组装线') (u'are', u '百分之二十的电动机'，你'拉'）（你是'，你'百分比电动机'，你'拉'）我做错了什么吗？如何获得精确的三倍（'从'，'百分之二十的电动机'，'装配线'）

score 1 · Accepted Answer

这实际上是预期的行为。OpenIE 系统的设计决策是生成所有在逻辑上由原始句子包含的三元组，即使它们是多余的。这个想法是这些三元组通常用于类似于 IR-ish 查找的东西，在这些情况下，不必对任何三元组是否与查询“足够相似”进行模糊匹配是很方便的。

python - StanfordCoreNLP openIE 问题

1 回答 1

Related

Reference