SPARK DataFrame: How to efficiently split dataframe for each group based on same column values

前端 未结 3 1223
花落未央
花落未央 2020-12-31 10:41

I have a DataFrame generated as follows:

df.groupBy($\"Hour\", $\"Category\")
  .agg(sum($\"value\").alias(\"TotalValue\"))
  .sort($\"Hour\".asc,$\"TotalVal         


        
3条回答
  •  误落风尘
    2020-12-31 11:39

    This has been answered here for Spark (Scala):

    How can I split a dataframe into dataframes with same column values in SCALA and SPARK

    and here for pyspark:

    PySpark - Split/Filter DataFrame by column's values

提交回复
热议问题