python - 字符串列上的雾索引

问问题 2021-03-16T14:37:15.353

53 次

我在 Pandas 数据框中有一列，每行（twits）包含多个文本字符串，我正在尝试使用函数创建雾索引.gunning_fog_index。这是我想做的

textacy.preprocessing.remove_punctuation() 使用Next清理数据使用Spacy doc 上textacy.make_spacy_doc() 的textacy.text_stats.TextStats()函数制作 spacy doc，然后使用.gunning_fog_index

这是我尝试过的：

我在使用时遇到了问题，textacy.preprocessing.remove_punctuation(df["twits"], marks=None)因为它不会覆盖整个列。我收到以下错误

AttributeError：“系列”对象没有属性“翻译”

由于 textacy 函数通过（i index from 1）仅查看一行来工作，因此textacy.preprocessing.remove_punctuation(df["twits"][1], marks=None)我尝试使用 for 循环，但这失败了。

for i in df["twits"]
    textacy.preprocessing.remove_punctuation(df["twits"][i], marks=None)

我在想我可能能够将所有不同的参数传递到 for 循环中，但我不确定如何继续。有任何想法吗？

python - 字符串列上的雾索引

0 回答 0

Related

Reference