在 NLTKsenseval
模块中,语义的形式HARD1
为 ,HARD2
等(参见此处的源代码)。但是,似乎没有办法获得实际定义。我正在尝试实现 Lesk 算法,现在我正在尝试检查 Lesk 算法预测的感觉是否正确(使用 WordNet 的定义)。
我遇到的问题是如何将 WordNet 定义与senseval
答案 ( HARD1
, HARD2
) 统一起来。有人知道如何将 SENSEVAL 意义翻译成定义,或者在某个地方查找吗?
在 NLTKsenseval
模块中,语义的形式HARD1
为 ,HARD2
等(参见此处的源代码)。但是,似乎没有办法获得实际定义。我正在尝试实现 Lesk 算法,现在我正在尝试检查 Lesk 算法预测的感觉是否正确(使用 WordNet 的定义)。
我遇到的问题是如何将 WordNet 定义与senseval
答案 ( HARD1
, HARD2
) 统一起来。有人知道如何将 SENSEVAL 意义翻译成定义,或者在某个地方查找吗?
我最终发现这些与 WordNet 1.7 中的感觉相对应,这是相当古老的(在 Mac OS X 或 Ubuntu 11.04 上似乎不容易安装)。
我找不到 WordNet 1.7 的在线版本。
这个站点也有一些关于这三个语料库的有用信息。例如,它说的六种含义interest
取自在线朗文英语词典(大约 2001 年)。看这里
它将 HARD 的来源描述为 WordNet 1.7。
最终,我最终手动将定义映射到 WordNet 3.0 中的定义。如果你有兴趣,这里是字典。但是请注意,我不是语言学专家,而且它们并不准确
# A map of SENSEVAL senses to WordNet 3.0 senses.
# SENSEVAL-2 uses WordNet 1.7, which is no longer installable on most modern
# machines and is not the version that the NLTK comes with.
# As a consequence, we have to manually map the following
# senses to their equivalent(s).
SV_SENSE_MAP = {
"HARD1": ["difficult.a.01"], # not easy, requiring great physical or mental
"HARD2": ["hard.a.02", # dispassionate
"difficult.a.01"],
"HARD3": ["hard.a.03"], # resisting weight or pressure
"interest_1": ["interest.n.01"], # readiness to give attention
"interest_2": ["interest.n.03"], # quality of causing attention to be given to
"interest_3": ["pastime.n.01"], # activity, etc. that one gives attention to
"interest_4": ["sake.n.01"], # advantage, advancement or favor
"interest_5": ["interest.n.05"], # a share in a company or business
"interest_6": ["interest.n.04"], # money paid for the use of money
"cord": ["line.n.18"], # something (as a cord or rope) that is long and thin and flexible
"formation": ["line.n.01","line.n.03"], # a formation of people or things one beside another
"text": ["line.n.05"], # text consisting of a row of words written across a page or computer screen
"phone": ["telephone_line.n.02"], # a telephone connection
"product": ["line.n.22"], # a particular kind of product or merchandise
"division": ["line.n.29"], # a conceptual separation or distinction
"SERVE12": ["serve.v.02"], # do duty or hold offices; serve in a specific function
"SERVE10": ["serve.v.06"], # provide (usually but not necessarily food)
"SERVE2": ["serve.v.01"], # serve a purpose, role, or function
"SERVE6": ["service.v.01"] # be used by; as of a utility
}