Hive clustered by on more than one column

后端 未结 2 1233
执笔经年
执笔经年 2021-01-02 11:56

I understand that when the hive table has clustered by on one column, then it performs a hash function of that bucketed column and then puts that row of data into one of the

2条回答
  •  感情败类
    2021-01-02 12:31

    1. Yes the number of files will still be 32.
    2. Hash function will operate by considering "continent,country" as a single string and then will use this as input.

    Hope it helps!!

提交回复
热议问题