I have a text file which has a long list of numbers (170k lines) these numbers represent the unique urls found in a each page I crawled. So the text file looks like this: