Lucene 有一个默认的停止过滤器(http://lucene.apache.org/core/4_0_0/analyzers-common/org/apache/lucene/analysis/core/StopFilter.html),有谁知道列表中的单词是什么?
问问题
30108 次
1 回答
44
和中设置的默认停用词来自,如源文件中所示:StandardAnalyzer
EnglishAnalyzer
StopAnalyzer.ENGLISH_STOP_WORDS_SET
"a", "an", "and", "are", "as", "at", "be", "but", "by",
"for", "if", "in", "into", "is", "it",
"no", "not", "of", "on", "or", "such",
"that", "the", "their", "then", "there", "these",
"they", "this", "to", "was", "will", "with"
StopFilter
本身没有定义默认的停用词集。
于 2013-07-08T16:33:21.517 回答