1

我有一组三元组让我们说

 ID |        Trigram         | Frequency 
  1 | great customer service |        10 
  2 | customer service great |         8 
  3 | good customer service |         6 
  4 | have some parking      |         5 
  5 | some more parking      |         2 

我想对所有三元组进行模糊匹配,并将相似的三元组替换为频率最高的三元组。例如,上表应该变成

 ID |        Trigram         | Frequency 
  1 | great customer service |        10 
  2 | great customer service |         8 
  3 | great customer service |         6
  4 | have some parking      |         5 
  5 | have some parking      |         2 

我正在使用fuzzywuzzy 包来计算相似度,但不知道如何进行替换。提前致谢

4

0 回答 0