I have a large dataset (about 300.000 rows), and I want to delete duplicates based on the value in two columns. I have tried to exemplify below. The code I have written now