postgresql - Postgresql全文搜索带撇号的单词

Question

我正在为 Postgresql 8.4 构建一个自定义的 ispell 字典配置，并且在正确解析带有撇号的单词时遇到了一些问题。Postgresql 包含的 ispell 字典包括一个 .affix 文件，其中包含一个“M” SFX 规则，该规则指定其单词的扩展形式。

这是一个示例，假设dictionary/SM我的字典文件中有：

SELECT to_tsvector('english_ispell', 'dictionary''s dictionaries');

预期输出：

'dictionary':1,2

实际输出：

s':2, 'dictionary':1,3

我做错了什么吗？这是 ts_debug 的输出，显示了它是如何被解析的。

SELECT * FROM ts_debug('english_ispell', 'dictionary''s dictionaries');

   alias   |   description   |    token     |         dictionaries          |   dictionary   |   lexemes    
-----------+-----------------+--------------+-------------------------------+----------------+--------------
 asciiword | Word, all ASCII | dictionary   | {english_ispell,english_stem} | english_ispell | {dictionary}
 blank     | Space symbols   | '            | {}                            |                | 
 asciiword | Word, all ASCII | s            | {english_ispell,english_stem} | english_ispell | {s}
 blank     | Space symbols   |              | {}                            |                | 
 asciiword | Word, all ASCII | dictionaries | {english_ispell,english_stem} | english_ispell | {dictionary}

如何让 Postgresql 将其解析'为单个单词的一部分，而不是将其分解为“空格符号”？

postgresql - Postgresql全文搜索带撇号的单词

0 回答 0

Related

Reference