2

我们正在尝试找出最适合 Nutch-Hadoop 集成的 Linux 发行版?我们计划使用集群通过 Nutch 抓取大量内容。让我知道您是否需要对此问题进行更多说明?

谢谢。

4

2 回答 2

1

There is no much difference between any major Linux distribution in this case. But I'd recommend you one that has hadoop packages prepared. I'm using Cloudera's Hadoop distribution on debian and it works very well.

于 2010-06-18T12:56:49.400 回答
1

hadoop 和 hbase 软件包将在下一个 Debian 稳定版中:

http://packages.debian.org/search?keywords=hadoop

于 2010-06-20T11:31:32.720 回答