I want to select rows from mytable in original rows with definite numbers. As we know, the key word \'limit\' will randomly select rows. The rows in mytable are in order. I
Rows in your table may be in order but...
Tables are being read in parallel, results returned from different mappers or reducers not in original order. That is why you should know the rule defining "original order".
If you know then you can use row_number()
or order by
. For example:
select * from table order by ... limit 10000;
Try:
SET mapred.reduce.tasks = 1
SELECT * FROM (
SELECT *, ROW_NUMBER() OVER () AS row_num
FROM table ) table1
SORT BY row_num LIMIT 10000