Pyspark create dictionary within groupby
问题 Is it possible in pyspark to create dictionary within groupBy.agg() ? Here is a toy example: import pyspark from pyspark.sql import Row import pyspark.sql.functions as F sc = pyspark.SparkContext() spark = pyspark.sql.SparkSession(sc) toy_data = spark.createDataFrame([ Row(id=1, key='a', value="123"), Row(id=1, key='b', value="234"), Row(id=1, key='c', value="345"), Row(id=2, key='a', value="12"), Row(id=2, key='x', value="23"), Row(id=2, key='y', value="123")]) toy_data.show() +---+---+-----