The documentation of HIVE notes that LIMIT clause returns rows chosen at random. I have been running a SELECT table on a
HIVE
LIMIT
returns rows chosen at random
SELECT
To be safe you want to use
select * from table distribute by rand() sort by rand() limit 10000;
select * from table
distribute by rand()
sort by rand()
limit 10000;