24

Lucene 有一个默认的停止过滤器(http://lucene.apache.org/core/4_0_0/analyzers-common/org/apache/lucene/analysis/core/StopFilter.html),有谁知道列表中的单词是什么?

4

1 回答 1

44

和中设置的默认停用词来自,如源文件中所示:StandardAnalyzerEnglishAnalyzerStopAnalyzer.ENGLISH_STOP_WORDS_SET

"a", "an", "and", "are", "as", "at", "be", "but", "by",
"for", "if", "in", "into", "is", "it",
"no", "not", "of", "on", "or", "such",
"that", "the", "their", "then", "there", "these",
"they", "this", "to", "was", "will", "with"

StopFilter本身没有定义默认的停用词集。

于 2013-07-08T16:33:21.517 回答