13

Our hadoop cluster using snappy as default codec. Hadoop job reduce output file name is like part-r-00000.snappy. JSnappy fails to decompress the file bcz JSnappy requires the file start with SNZ. The reduce output file start with some bytes 0 somehow.

How could I decompress the file?

4

1 回答 1

18

使用“Hadoop fs -text”读取此文件并将其通过管道传输到 txt 文件。前任:

hadoop fs -text part-r-00001.snappy > /tmp/mydatafile.txt

于 2014-03-10T23:37:18.843 回答