I have a two-column dataframe that I one hot encode into a sparse dataframe(as it exceeds my systems memory). The problem is when I try to groupby it takes forever. My goal