4

我使用以下代码(准确地说是 PyLucene)调用 Lucene:

analyzer = StandardAnalyzer(Version.LUCENE_30)
queryparser = QueryParser(Version.LUCENE_30, "text", analyzer)
query = queryparser.parse(queryparser.escape(querytext))

但请考虑这是否是 的内容querytext

querytext = "THE FOOD WAS HONESTLY NOT WORTH THE PRICE. MUCH TOO PRICY WOULD NOT GO BACK AND OR RECOMMEND IT"

在这种情况下,“AND OR”会使查询解析器出错,即使我使用的是queryparser.escape. 如何避免以下错误消息?

    Java stacktrace:
org.apache.lucene.queryParser.ParseException: Cannot parse 'THE FOOD WAS HONESTLY NOT WORTH THE PRICE. MUCH TOO PRICY WOULD NOT GO BACK AND OR RECOMMEND IT': Encountered " <OR> "OR "" at line 1, column 80.
Was expecting one of:
    <NOT> ...
    "+" ...
    "-" ...
    "(" ...
    "*" ...
    <QUOTED> ...
    <TERM> ...
    <PREFIXTERM> ...
    <WILDTERM> ...
    "[" ...
    "{" ...
    <NUMBER> ...
    <TERM> ...
    "*" ...

 at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:187)
     ....
 at org.apache.lucene.queryParser.QueryParser.generateParseException(QueryParser.java:1759)
 at org.apache.lucene.queryParser.QueryParser.jj_consume_token(QueryParser.java:1641)
 at org.apache.lucene.queryParser.QueryParser.Clause(QueryParser.java:1268)
 at org.apache.lucene.queryParser.QueryParser.Query(QueryParser.java:1207)
 at org.apache.lucene.queryParser.QueryParser.TopLevelQuery(QueryParser.java:1167)
 at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:182)
4

3 回答 3

1

queryparser.parse 仅转义特殊字符(如本页所示)并保持“AND OR”不变,因此它不适用于您的情况。由于您可能还使用 StandardAnalyzer 来分析您的文本,因此您的索引中的术语已经是小写的。因此,您可以将整个查询字符串更改为小写,然后再将其提供给查询解析器。小写的“and”和“or”不被视为运算符,因此“and or”不会使查询解析器出错。

于 2010-08-12T00:22:03.967 回答
1

不只是OR,是AND OR

我使用以下解决方法:

query = queryparser.parse(queryparser.escape(querytext.replace("AND OR", "AND or")))
于 2010-08-10T18:39:27.047 回答
0

我意识到我在这里参加聚会已经很晚了,但是在搜索字符串周围加上引号是一个更好的选择:

querytext = "\"THE FOOD WAS ... \""
于 2014-09-10T13:36:11.023 回答