我有一个包含以下 7 个每日汇总表的数据集,每个汇总表都很小:
summary_broad_20130104_hourly 199 k rows 8395 KB
summary_broad_20130105_hourly 198 k rows 8347 KB
summary_broad_20130106_hourly 197 k rows 8319 KB
summary_broad_20130107_hourly 194 k rows 8190 KB
summary_broad_20130108_hourly 197 k rows 8338 KB
summary_broad_20130109_hourly 201 k rows 8490 KB
summary_broad_20130110_hourly 202 k rows 8534 KB
select count(*)
在对所有这些表(04-10)进行简单操作时,我得到“响应太大” :
$ bq query "select count(*) from summary_broad_20130104_hourly,summary_broad_20130105_hourly,summary_broad_20130106_hourly,summary_broad_20130107_hourly,summary_broad_20130108_hourly,summary_broad_20130109_hourly,summary_broad_20130110_hourly"
BigQuery error in query operation: Error processing job '987928363675:job_bb504ce6ea31416db3f8762d3c244d93': Response too large to return.
但是,如果我将查询限制为表 04-06,我会得到成功的响应:
$ bq query "select count(*) from summary_broad_20130104_hourly,summary_broad_20130105_hourly,summary_broad_20130106_hourly"
+--------+
| f0_ |
+--------+
| 594721 |
+--------+
但是,如果我扩展到表 04-07,那么我可以再次重现该错误:
$ bq query "select count(*) from summary_broad_20130104_hourly,summary_broad_20130105_hourly,summary_broad_20130106_hourly,summary_broad_20130107_hourly"
BigQuery error in query operation: Error processing job '987928363675:job_22ea222b4b0549a98bb9e90f1a2ac539': Response too large to return.
这是一个错误,还是我做错了什么?查询的数据很小(在 04-07 案例中约为 40MB),我希望它select count(*)
是一个节点本地聚合,中间数据量可以忽略不计。