I was asked by the interviewer to design a system to store gigabytes of data and the system also has to support some kind of query.
Description:
In my opinion, create a B+ tree using time as the key to help you quickly locate the range of records during given time period (t1,t2) in disk. Then using the records during (t1,t2) to build IP and URL hash table respectively.