When collecting billions of rows, it is better (when possible) to consolidate, process, summarize, whatever, the data before storing. Keep the raw data in a file if you think you need to get back to it.
Doing that will eliminate most of your questions and concerns, plus speed up the processing.