问题标签 [reducers]

问问题

For questions regarding programming in ECMAScript (JavaScript/JS) and its various dialects/implementations (excluding ActionScript). Note JavaScript is NOT the same as Java! Please include all relevant tags on your question; e.g., [node.js], [jquery], [json], [reactjs], [angular], [ember.js], [vue.js], [typescript], [svelte], etc.

840 问题

0 投票

2 回答

3751 浏览

java - 使用 MRUnit 测试多个输出

有没有办法测试一个用于写入多个输出文件的reduceMRUnit类MultipleOutputFormat？

2013-01-20T09:02:37.300

0 投票

1 回答

86 浏览

java - reducer 数量对集群节点数量的依赖性

我的 hadoop 程序使用一个映射器，它将输入数据分成一定数量的部分，这些部分在/usr/countcomputers.txt文件中设置（由映射器函数读取）。进一步在一个部分到达每个减速器。因此，在/usr/countcomputers.txt文件中设置的数字定义了减速器的数量。在这方面我有一个问题：reducers 仅在启动 TaskTracker 的恶魔的计算机上执行，或者在所有节点上执行，包括由 JobTracker 和 Secondary NameNode 启动的 NameNode 的哪些恶魔？对我来说，知道对这个问题的回答非常重要，因为/usr/countcomputers.txt文件中设置的数字取决于它，在程序中读取。

java linux hadoop mapreduce reducers

2013-02-06T17:14:25.800

0 投票

5 回答

539 浏览

java - 如何从文件中拆分给定的输入？

我已经编写了用于从文本文件传递整数输入的 Java 代码，例如1 10 39 59 20 60 38，当有空格时我必须拆分字符串。

输入在单行中给出input.txt

我的代码是：

分割线后，我将分离的值用于不同的任务。我的问题是如何拆分位于同一文件中的所有值（值也在不同的行中）并将它们存储在一个数组中？

假设如果以下是input.txt中给出的输入，那么如何拆分所有值并将它们存储在一个数组中？

示例输入：

预期输出：

当我将我的代码用于上述输入时，只考虑输入文件的最后一行 - 所有前面的行都被忽略。

java reducers

2013-02-25T16:41:26.830

0 投票

1 回答

1421 浏览

amazon-web-services - 如何计算映射器/减速器的数量以最大化运行在亚马逊云上的 mahout RecommenderJob 的性能？

根据 Amazon Elastic MapReduce 上使用/可用的实例，计算要使用的正确 hadoop 映射器和缩减器数量的最佳方法是什么？（使用 mahout-core-0.7 发行版的 RecommenderJob）

amazon-web-services hadoop mahout reducers mapper

2013-03-06T20:51:26.417

0 投票

1 回答

767 浏览

hadoop - Size of map output partitions?

Let's assume that we have 3 mappers (m1, m2 and m3) and 2 reducers (r1 and r2).

Each reducer fetches its input partitions from the generated files by each mapper.

From the job history, I can extract the total input for each reduce task, but I would like to know the contribution of each mapper to this reducer input ?

For example, the reducer r1 will receive an INPUT_r1 such as:

INPUT_r1 = ( partition fetched from m1 ) + ( partition fetched from m2 ) + ( partition fetched from m3 )

I would like to know the size of those partitions from mappers ?

hadoop mapper reducers

2013-04-09T17:45:48.523

0 投票

2 回答

94 浏览