javascript - MongoDB查询从集合中删除重复文档

Question

我从搜索框中获取数据，然后使用常规插入查询将数据作为文档插入 MongoDB。数据以具有唯一“_id”的以下格式存储在单词“cancer”的集合中。

{
  "_id": {
    "$oid": "553862fa49aa20a608ee2b7b"
  },
  "0": "c",
  "1": "a",
  "2": "n",
  "3": "c",
  "4": "e",
  "5": "r"
}

每个文档都有一个以与上述相同格式存储的单词。我有很多这样的文件。现在，我想从集合中删除重复的文档。我无法想出办法做到这一点。帮我。

score 2 · Accepted Answer

mongo shell 中的一个简单解决方案：`

use your_db
db.your_collection.createIndex({'1': 1, '2': 1, '3': 1, etc until you reach maximum expected letter count}, {unique: true, dropDups: true, sparse:true, name: 'dropdups'})
db.your_collection.dropIndex('dropdups')

笔记：

如果您有很多文件，预计此过程需要很长时间
请注意，这将删除适当的文档，最好先克隆您的集合并在那里尝试。

javascript - MongoDB查询从集合中删除重复文档

1 回答 1

Related

Reference