存储在 Hive 中的文件:
[
{
"occupation": "guitarist",
"fav_game": "football",
"name": "d1"
},
{
"occupation": "dancer",
"fav_game": "chess",
"name": "k1"
},
{
"occupation": "traveller",
"fav_game": "cricket",
"name": "p1"
},
{
"occupation": "drummer",
"fav_game": "archery",
"name": "d2"
},
{
"occupation": "farmer",
"fav_game": "cricket",
"name": "k2"
},
{
"occupation": "singer",
"fav_game": "football",
"name": "s1"
}
]
hadoop 中的 CSV 文件:
name,age,city
d1,23,delhi
k1,23,indore
p1,23,blore
d2,25,delhi
k2,30,delhi
s1,25,delhi
我单独询问了他们,它工作正常。然后,我尝试加入查询:
select * from hdfs.`/demo/distribution.csv` d join hive.demo.`user_details` u on d.name = u.name
我遇到了以下问题:
org.apache.drill.common.exceptions.UserRemoteException:系统错误:DrillRuntimeException:联接仅支持 1. 数字数据 2. Varchar、Varbinary 数据 3. 日期、时间戳数据之间的隐式转换 左类型:INT,右类型:VARCHAR。添加显式强制转换以避免此错误 Fragment 0:0 [Error Id: b01db9c8-fb35-4ef8-a1c0-31b68ff7ae8d on IMPETUS-DSRV03.IMPETUS.CO.IN:31010]