r - Quanteda 中的 corpus() 出现“重复的 row.names”错误

Question

我刚刚升级了我的 Quanteda 版本，我遇到了一个以前没有的错误。

spec_txt <- textfile("spec/Spectator_Texts_purged_R.csv", textField = "Cleaned.Text")
spec_corp <- corpus(spec_txt)

返回：

data.frame 中的错误（文本 = x，row.names = 名称（x），check.rows = TRUE，：重复的 row.names：Spectator_Texts_purged_R.csv

我在 CSV 中搜索了悬挂逗号，甚至将文件缩减为三行：

Issue,Signed,Text,Printer 
1,C,"This is test",A
2,R,"Test this is",B

和：

Text
"This is test"
"Test this is"

错误仍然存在 - 我错过了什么吗？

score 0 · Accepted Answer

0

我错误地升级了 quantedaData 而不是quanteda。升级quanteda，错误消失了。

于 2016-08-31T13:38:51.717 回答

1 回答 1