How does Hive 'alter table <table name> concatenate' work?
问题 I have n(large) number of small sized orc files which i want to merge into k(small) number of large orc files. This is done using alter table table_name concatenate command in Hive. I want to understand how does Hive implement this. I'm looking to implement this using Spark with any changes if required. Any pointers would be great. 回答1: As per the AlterTable/PartitionConcatenate: If the table or partition contains many small RCFiles or ORC files, then the above command will merge them into