2

我有一些句子,从句子中我想将单词分开以获得行向量。但是这些单词正在重复以匹配我不想要的最大句子的行向量。我希望无论句子有多大,每个句子的行向量都只会是单词一次。

sentence <- c("case sweden", "meeting minutes ht board meeting st march now also attachment added agenda today s board meeting", "draft meeting minutes board meeting final meeting minutes ht board meeting rd april")
sentence <- cbind(sentence)
word_table <- do.call(rbind, strsplit(as.character(sentence), " "))
test <- cbind(sentence, word_table)

这是我现在得到的,在此处输入图像描述

这就是我想要的, 在此处输入图像描述

我的意思是不重复

4

1 回答 1

2

rawr的解决方案,

sentence <- c("case sweden", "meeting minutes ht board meeting st march now also attachment added agenda today s board meeting", "draft meeting minutes board meeting final meeting minutes ht board meeting rd april")
dd <- read.table(text = paste(sentence, collapse = '\n'), fill = TRUE)
test <- cbind(sentence, dd)

或者,

cc <- read.table(text = paste(gsub('\n', '', sentence), collapse = '\n'), fill = TRUE)
test1 <- cbind(sentence, cc)

谢谢。

于 2016-03-07T23:20:51.263 回答