问题
SOLR DIH 总结每次迭代中的查询。就像在第三次迭代中一样,产生以下输出
"entity:us-patent-grant-xslt",
[
"document#3",
[
"query",
"/var/www/data1/US07985001-20110726.XML",
"query",
"/var/www/data1/US07985001-20110726.XML",
"query",
"/var/www/data1/US07985001-20110726.XML",
"time-taken",
"0:0:0.0",
"time-taken",
"0:0:0.0",
"time-taken",
"0:0:0.0",
null,
"----------- row #1-------------",
"id",
"US7985001",
"pub_date",
"2011-07-26 00:00:00",
null,
"---------------------------------------------"
],
数据配置文件
<entity name="pickupdir"
processor="FileListEntityProcessor"
rootEntity="false"
dataSource="null"
fileName="^[\w\d-]+\.XML$"
baseDir="/var/www/data1/"
recursive="true"
onError="skip">
<entity name="us-patent-grant-xslt"
url="${pickupdir.fileAbsolutePath}"
xsl="data.xsl"
processor="XPathEntityProcessor"
useSolrAddSchema= "true"
rootEntity="true"
onError="skip">
<field column="id" />
<field column="pub_date" />
</entity>
</entity>
因此,当我在每次迭代中批量上传数据时,查询总结和性能滞后。目前我的服务器每秒处理 2 个文档。