我为数据库创建了一个索引。
res = yield dbEmail["emails"].create_index([("subject", pymongo.TEXT)])
现在我正在尝试搜索创建的索引
dbEmail["emails"].find({$text: {$search: "dogs"}}, {score: {$meta: "textScore"}}).sort({score: {$meta: "textScore"}})
我发现这不起作用。电机文档说 db.collection.find(filter) 中的 filter 参数是一个 SON 对象。当我在我的集合上尝试 list_indexes() 时,它们也是 SON 对象。但是之后
for index in (yield dbEmail["emails"].list_indexes()):
searchResults = yield dbEmail["emails"].find(index)
也会导致错误。我们如何使用电机对索引进行 $text 搜索?
编辑:回复尼尔
我的初始数据库集合如下所示:
emails = {"_id" : ObjectId("xxcxcdsfas"),"subject": "Joe owns a dog", "content": "Dogs are man's best friend", "likes": 60, "year": 2015,"language": "english"},
{"_id" : ObjectId("xxcxcdsfas"),"subject": "Dogs eat cats and dog eats pigeons too", "content": "Cats are not evil", "likes": 30, "year": 2015,"language": "english"},
{"_id" : ObjectId("xxcxcdsfas"),"subject": "Cats eat rats", "content": "Rats do not cook food", "likes": 55, "year": 2014, "language": "english"},
{"_id" : ObjectId("xxcxcdsfas"), "subject": "Rats eat Joe", "content": "Joe ate a rat", "likes": 75, "year": 2014, "language": "english"}
在创建索引(正在创建,我已经验证)之后,我需要的最终结果应该是这样的,
{ "_id" : ObjectId("xxcxcdsfas"), "subject" : "Dogs eat cats and dog eats pigeons too", "content" : "Cats are not evil", "likes" : 30, "year" : 2015, "language" : "english", "score" : 1 }
{ "_id" : ObjectId("xxcxcdsfas"), "subject" : "Joe owns a dog", "content" : "Dogs are man's best friend", "likes" : 60, "year" : 2015, "language" : "english", "score" : 0.6666666666666666 }
第一个得分更高,因为狗在主题中出现了两次,而第二个中出现了一次。
我使用以下代码检查了索引是否有效:
for item in (yield dbEmail["emails"].index_information()).items():
print(item)
响应或索引如下:
('_id_', {'v': 2, 'key': [('_id', 1)], 'ns': 'emailInfo.emails'})
('subject_text', {'v': 2, 'key': [('_fts', 'text'), ('_ftsx', 1)], 'ns': 'emailInfo.emails', 'weights': SON([('subject', 1)]), 'default_language': 'english', 'language_override': 'language', 'textIndexVersion': 3})