0

We've been doing some tests with Apache UIMA. The results are amazing!

Our ideal set up would be a tight integration of UIMA with SOLR. Ideally, we like to pass all the content we index past UIMA, for additional metadata.

There are examples and documentation available how to do the the UIMA integration with the SOLR Update handler, but ideally we'd like to use the existing DataImportHandler (DIH). All our data is imported though a JDBC connection, and should be passed to UIMA before indexing.

Can anybody shed some light on how to accomplish this?

Thanks,

René

4

1 回答 1

1

数据导入处理程序确实支持 update.processor。
因此,您可能可以将 dataimport 处理程序与 uima 更新处理器联系起来 -

<requestHandler name="/dataimport" class="org.apache.solr.handler.dataimport.DataImportHandler">
    <lst name="defaults">
        <str name="config">data-config.xml</str>
        <str name="update.processor">uima</str>
    </lst>
</requestHandler>

只是一种选择,我自己没试过。

于 2011-09-28T13:01:40.333 回答