This page says of collect_list:
Returns a list of objects with duplicates.
Is that list ordered? For example, the order of the query results?
built-in collect_list isn't guaranteed to be ordered, even if you do an order by first (even if it did ensure order, doing it this way is a waste of time). Just use brickhouse collect; it ensures the elements are ordered.
It's correct that collect_list isn't guaranteed to be ordered. The function sort_array will sort the result:
select a, b, sort_array(collect_list(c)) as sorted_c
from the_table
group by a, b
来源:https://stackoverflow.com/questions/31350150/is-hives-collect-list-ordered