我已经将 nutch 2.3.1 与 solr 6.5 集成在一起,这样我可以将数据推送到 solr 并获得索引。现在我想删除重复的元素,为此我在 schema.xml 和 solrconfig.xml 中进行了修改
<field name="signatureField" type="string" stored="true" indexed="true" multiValued="false" />
<updateRequestProcessorChain name="dedupe">
<processor class="solr.processor.SignatureUpdateProcessorFactory">
<bool name="enabled">true</bool>
<str name="signatureField">id</str>
<bool name="overwriteDupes">false</bool>
<str name="fields">id,content,date,url</str> <!-- changing to id <str name="fields">name,features,cat</str>-->
<str name="signatureClass">solr.processor.Lookup3Signature</str>
</processor>
<processor class="solr.LogUpdateProcessorFactory" />
<processor class="solr.RunUpdateProcessorFactory" />
</updateRequestProcessorChain>
<requestHandler name="/update" class="solr.UpdateRequestHandler" >
<lst name="defaults">
<str name="update.chain">dedupe</str>
</lst>
</requestHandler>
但是在索引 bin/nutch solrindex http://localhost:8983/solr/testcore -all 错误之后!请帮我解决这个问题
提前谢谢你:)