0

我是 R 的新手,Rhipe 和 Hadoop 想从内容为的文件中读取数据,

<Author>fallriverma
<Content>Quality hotel at great price Very clean.
<Date>Nov 25, 2008
<Rating>5   5   5   5   5   5   5   5   
<Aspects>
1   8826(grat):1    
3   3(clean):1  19(price):1 187(quality):1  
0   
0   
0   
3   0(staff):1  12(friendly):1  14(helpful):1   
3   6(breakfast):1  46(free):1  333(selection):1    
0

<Author>yondaime1845
<Content>Its the best of the best for a reason One of the more affordable and better hotels in the city of seattle.
<Date>Jan 2, 2008
<Rating>5   5   5   5   5   5   5   5   
<Aspects>
4   41(city):1  374(reason):1   762(seattle):1  1062(affordable):1  
0   
0   
4   1(location):1   66(park):1  143(cheap):1    186(convenient):1   
0   
0   
4   5(time):1   9(service):1    12(friendly):1  608(employee):1 
0

我想从“8826(qrat):1”中读取它的作者和方面值,如8826,并想使用hadoop、rhipe和R按列显示它们

希望你的建议

提前致谢

4

1 回答 1

0

rhls("/user/notroot/input/"), 提供您所在位置的完整路径 -HDFS它会起作用

于 2013-04-08T06:56:08.850 回答