machine-learning - 随机森林，文本分类

Question

如何使用单词作为特征来使用随机森林算法对文本进行情感分析？我使用单词作为特征，而随机森林使用数字，这就是我卡住的地方。

score 2 · Accepted Answer

I think you can use sckit-learn to facilitate you in solving it. You can look for tutorial at the website of sckit-learn tutorial here. it will be very useful.

When working with text features you can use CountVectorizer or DictVectorizer. Take a look at feature extraction and especially section 4.1.3 here.

To facilitate you to know more, you can find an example here. It will useful for classifying text documents.

score 0 · Accepted Answer

您可以在随机森林管道的预处理部分使用 countvectorizer 或 tfidf。发布您的数据的摘录，我将演示

machine-learning - 随机森林，文本分类

2 回答 2

Related

Reference