通过如下注册标记字符创建标记器时,无法注册罗马“X”。(测试 ES 版本:ES6.7,ES5.6)
"tokenizer": {
"autocomplete": {
"type": "edge_ngram",
"min_gram": 1,
"max_gram": 14,
"token_chars": [
"Ⅹ"
]
}
}
错误日志是这样的
{"error":{"root_cause":[{"type":"remote_transport_exception","reason":"[node02][192.168.115.x:9300][indices:admin/create]"}],"type ":"illegal_argument_exception","reason":"未知 标记类型:'ⅹ',必须是 [symbol、private_use、paragraph_separator、start_punctuation、未分配、enclosure_mark、connector_punctuation、letter_number、other_number、math_symbol、lowercase_letter、space_separator、surrogate、 initial_quote_punctuation、decimal_digit_number、digit、other_punctuation、dash_punctuation、currency_symbol、non_spacing_mark、格式、modifier_letter、控制、uppercase_letter、other_symbol、end_punctuation、modifier_symbol、other_letter、line_separator、titlecase_letter、字母、标点符号,combining_spacing_mark,final_quote_punctuation,空格]"},"status":400}
如何将罗马数字标记为术语?