Hive(Bigdata)- difference between bucketing and indexing
问题 What is the main difference between bucketing and indexing of a table in Hive? 回答1: The main difference is the goal: Indexing The goal of Hive indexing is to improve the speed of query lookup on certain columns of a table. Without an index, queries with predicates like 'WHERE tab1.col1 = 10' load the entire table or partition and process all the rows. But if an index exists for col1, then only a portion of the file needs to be loaded and processed. Indexes become even more essential when the