我有一组三元组让我们说
ID | Trigram | Frequency
1 | great customer service | 10
2 | customer service great | 8
3 | good customer service | 6
4 | have some parking | 5
5 | some more parking | 2
我想对所有三元组进行模糊匹配,并将相似的三元组替换为频率最高的三元组。例如,上表应该变成
ID | Trigram | Frequency
1 | great customer service | 10
2 | great customer service | 8
3 | great customer service | 6
4 | have some parking | 5
5 | have some parking | 2
我正在使用fuzzywuzzy 包来计算相似度,但不知道如何进行替换。提前致谢