Mapper/Reducer 1 --> (key,value)
/ | \
/ | \
Mapper/Reducer 2 | Mapper/Reducer 4
-> (oKey,oValue) | -> (xKey, xValue)
|
|
Mapper/Reducer 3
-> (aKey, aValue)
我有一个日志文件,我将其与 MR1 汇总。Mapper2、Mapper3、Mapper4 将 MR1 的输出作为它们的输入。作业是链式的。
MR1 输出:
User {infos of user:[{data here},{more data},{etc}]}
..
MR2 输出:
timestamp idCount
..
MR3 输出:
timestamp loginCount
..
MR4 输出:
timestamp someCount
..
我想合并 MR2-4 的输出:最终输出->
timestamp idCount loginCount someCount
..
..
..
有没有办法没有猪或蜂巢?我正在使用 Java。