我有一个用例,我需要计算两个字段的不同数量。
样本 :
x = LOAD 'testdata' using PigStorage('^A') as (a,b,c,d);
y = GROUP x BY a;
z = FOREACH y {
**bc = DISTINCT x.b,x.c;**
dd = DISTINCT x.d;
GENERATE FLATTEN(group) as (a), COUNT(bc), COUNT(dd);
};