0

是否可以使用 Solr 在 HTML 文件中进行搜索,例如抓取站点?

4

2 回答 2

1

Solr 只是搜索索引。看一下用于抓取网络的 nutch。 http://nutch.apache.org/about.html solr 将索引 HTML 就好了。

于 2012-05-07T16:28:15.347 回答
0

Quoting http://wiki.apache.org/nutch/NutchTutorial#A4._Setup_Solr_for_search

If all has gone to plan, we are now ready to search with http://localhost:8983/solr/admin/. If you want to see the raw HTML indexed by Solr, change the content field definition in schema.xml to:

于 2012-05-08T12:21:49.150 回答