I am using Drill (1.3) on two nodes. Say:
- 192.xxx.xxx.xxx
- 192.yyy.yyy.yyy
I tried querying (from 192.xxx.xxx.xxx) on a csv file (1000 million records):
select count(*) from dfs.`home/impadmin/BiggerBoy.csv`
Also, I tried join query (from 192.xxx.xxx.xxx) on Hive
& Oracle
:
select * from hive.testDB.`catalog_sales` x inner join oracle.ILABUSER.`customer_address` y on y.CA_ADDRESS_SK = x.CS_BILL_ADDR_SK group by y.CA_CITY limit 100
Every time I got(from Drill UI):
Query Profile
STATE: COMPLETED
FOREMAN: 192.xxx.xxx.xxx
TOTAL FRAGMENTS: 1
Why the other node is not used. Then whats the benefit of using multiple nodes in this case.
Do Drill take care of this by itself or I need to configure something?
If anybody able to get multiple fragment please share your use case.