unix - 使用 cygwin 和 hadoop 创建文件

Question

我正在使用本教程来设置第一个 hadoop 项目：

http://hadoop.apache.org/docs/r0.18.3/mapred_tutorial.html#Source+Code

我对这部分感到困惑：

Sample text-files as input:

$ bin/hadoop dfs -ls /usr/joe/wordcount/input/ 
/usr/joe/wordcount/input/file01 
/usr/joe/wordcount/input/file02 

$ bin/hadoop dfs -cat /usr/joe/wordcount/input/file01 
Hello World Bye World 

$ bin/hadoop dfs -cat /usr/joe/wordcount/input/file02 
Hello Hadoop Goodbye Hadoop

如何创建文件 file01 和 file02 ？

当我运行上面的命令 () 时，我收到错误：“文件不存在 /usr/joe/wordcount/input/file01”

score 0 · Accepted Answer

您可以使用文件编辑器在 cygwin 上创建文件，vim或者emacs（不确定）。

或者您可以在下面复制粘贴来创建文件。

echo "Hello World Bye World" >  /usr/joe/wordcount/input/file01
echo "Hello Hadoop Goodbye Hadoop" > /usr/joe/wordcount/input/file02

还有其他更巴洛克式的技术可以将简单数据放入文件中，但是鉴于您的问题定义，这应该可以正常工作。

（另一种可能性，...。这些文件可能是测试或教程安装的一部分，而您错过了运行安装的步骤？如果创建这些文件后，遇到新的错误消息，请考虑 dbl-checking you确认所有安装正确的步骤）

IHTH

score 0 · Accepted Answer

ru 指向同一条路径？？否则从本地文件系统创建文件。然后使用copyFromLocal命令将其放入 HDFS。然后 cat 文件路径/文件名。这行得通。

bin/hadoop fs -copyFromLocal /path/to/file/in/localfilesys/file01 /usr/joe/wordcount/input
bin/hadoop fs -cat /usr/joe/wordcount/input/file01

unix - 使用 cygwin 和 hadoop 创建文件

2 回答 2

Related

Reference