I am new to machine learning and need help on the best approach.
I have a master dataset with millions of rows with columns:
Customer first name, last