sql - 使用一个插入语句在多个分区中插入数据

Question

我有表 A 和表 B，其中 B 是使用名为 X 的字段的 A 的分区表。

当我想从A向B插入数据时，我通常执行如下语句：

INSERT INTO TABLE B PARTITION(X=x) SELECT <columnsFromA> FROM A WHERE X=x

现在我想要实现的是能够插入一个 X 的范围，比如说 x1、x2、x3 ......我怎样才能在一个语句中实现这一点？

score 2 · Accepted Answer

使用动态分区加载：

set hive.exec.dynamic.partition=true;
set hive.exec.dynamic.partition.mode=nonstrict;

INSERT OVERWRITE TABLE table_B PARTITION(X)
select 
col_1,
col_2,
...
col_N,
X --partition column is the last one
 from 
      table_A
where X in ('x1', 'x2', 'x3'); --filter here

或者select * from table_A，如果 A 和 B 中的列顺序相同，则使用。分区列 (X) 应该是最后一个。

sql - 使用一个插入语句在多个分区中插入数据

1 回答 1

Related

Reference