We've been doing some tests with Apache UIMA. The results are amazing!
Our ideal set up would be a tight integration of UIMA with SOLR. Ideally, we like to pass all the content we index past UIMA, for additional metadata.
There are examples and documentation available how to do the the UIMA integration with the SOLR Update handler, but ideally we'd like to use the existing DataImportHandler (DIH). All our data is imported though a JDBC connection, and should be passed to UIMA before indexing.
Can anybody shed some light on how to accomplish this?
Thanks,
René