我在 MySQL 表中有一个 TEXT 文件。它有句子
例子
Hello AAAA, where is your dog BBBB
Hello PPPP, where is your dog QQQQ
Hello XXXX, where is your dog YYYY
I am fine. thanks
I am fine. thanks
where are you going?
Thank you very much
这里前 3 个句子在 7 个单词中有 5 个相同的单词。所以它是 (5/7)*100=72% 相似
第 4 和第 5 100% 相似
我的问题是。使用 php 我想在这样的表中分组
sample_sentence_group count
Hello AAAA, where is your dog BBBB 3
I am fine. thanks 2
where are you going? 1
Thank you very much 1
我该怎么做?该表有超过 100K 的记录
谢谢