How to do stratified sampling on two columns in PySpark Dataframe?

后端 未结 0 1881
刺人心
刺人心 2021-02-11 09:08

I want to sample below data set based on IDs and the comm_type they fall into; The same IDs can have multiple comm_types, the data set is huge so I want to do further analysis o

相关标签:
回答
  • 消灭零回复
提交回复
热议问题