For retailing data, it is often user-based and users buy multiple occasions. We need to randomly sample users (not by each purchase record/row) and hope to reproduce the res