I am dealing with file-file and file-sql comparisons. Since the size of my data is large so I am forced to use chunksize of pandas dataframe. For testing purpose I have used