我有以下一起批处理的命令。它运行 Nutch 并将结果发送到 Solr。我已经读到这些与我想用来以编程方式运行的 Java 方法相匹配。
这些与哪些 Java 类匹配?
bin/nutch inject crawl/crawldb urls(text file containing list of urls)
bin/nutch generate crawl/crawldb crawl/segments
export SEGMENT=crawl/segments/`ls -tr crawl/segments|tail -1`
bin/nutch fetch $SEGMENT -noParsing
bin/nutch parse $SEGMENT
bin/nutch updatedb crawl/crawldb $SEGMENT -filter -normalize
bin/nutch invertlinks crawl/linkdb -dir crawl/segments
bin/nutch solrindex http://localhost:8080/solr/ crawl/crawldb crawl/linkdb crawl/segments/*
谢谢