Using pyspark, I\'d like to be able to group a spark dataframe, sort the group, and then provide a row number. So
Group Date A 2000 A 2002
Use window function:
from pyspark.sql.window import * from pyspark.sql.functions import row_number df.withColumn("row_num", row_number().over(Window.partitionBy("Group").orderBy("Date")))