I have a spark dataset of 700 million rows which are partitioned by id.
id
Each row Is id and a value. I want to create a stratifi
value