根据corporatezen.com/2013/11/updating-solr-engine-coldfusion,我正在使用应该使用Solr 3.4的CF10。我添加<charFilter class="solr.HTMLStripCharFilterFactory"/>
了,<fieldType name="text">
但搜索结果中的摘要字段仍然包含 HTML。知道为什么吗?
<field name="summary" type="text" indexed="false" stored="true" required="false" />
http://localhost:8985/solr/test/admin/schema.jsp显示:
字段:摘要字段类型:TEXT
属性:标记化、存储
模式:标记化,存储
位置增量差距:100
指数分析器:org.apache.solr.analysis.TokenizerChain 详情
字符过滤器:
org.apache.solr.analysis.HTMLStripCharFilterFactory args:{luceneMatchVersion: LUCENE_24 } Tokenizer Class: org.apache.solr.analysis.WhitespaceTokenizerFactory
过滤器:
org.apache.solr.analysis.StopFilterFactory args:{words: stopwords.txt ignoreCase: true enablePositionIncrements: true luceneMatchVersion: LUCENE_24 } org.apache.solr.analysis.WordDelimiterFilterFactory args:{splitOnCaseChange: 1 generateNumberParts: 1 catenateWords: 1 luceneMatchVersion: LUCENE_24 generateWordParts: 1 catenateAll: 0 catenateNumbers: 1 } org.apache.solr.analysis.LowerCaseFilterFactory args:{luceneMatchVersion: LUCENE_24} org.apache.solr.analysis.EnglishPorterFilterFactory args:{protected: protwords.txt luceneMatchVersion: LUCENE_24 } org. apache.solr.analysis.RemoveDuplicatesTokenFilterFactory args:{luceneMatchVersion: LUCENE_24 } 查询分析器:org.apache.solr.analysis.TokenizerChain 详情
字符过滤器:
org.apache.solr.analysis.HTMLStripCharFilterFactory args:{luceneMatchVersion: LUCENE_24 } Tokenizer Class: org.apache.solr.analysis.WhitespaceTokenizerFactory
过滤器:
org.apache.solr.analysis.SynonymFilterFactory args:{synonyms: synonyms.txt expand: true ignoreCase: true luceneMatchVersion: LUCENE_24 } org.apache.solr.analysis.StopFilterFactory args:{words: stopwords.txt ignoreCase: true luceneMatchVersion: LUCENE_24 } org.apache.solr.analysis.WordDelimiterFilterFactory 参数:{splitOnCaseChange:1 generateNumberParts:1 catenateWords:0 luceneMatchVersion:LUCENE_24 generateWordParts:1 catenateAll:0 catenateNumbers:0} org.apache.solr.analysis.LowerCaseFilterFactory 参数:{luceneMatchVersion:LUCENE_24 } org.apache.solr.analysis.EnglishPorterFilterFactory args:{protected: protwords.txt luceneMatchVersion: LUCENE_24 } org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory args:{luceneMatchVersion: LUCENE_24 }