Spark dataframe reducebykey like operation

前端 未结 3 846
星月不相逢
星月不相逢 2021-02-08 11:40

I have a Spark dataframe with the following data (I use spark-csv to load the data in):

key,value
1,10
2,12
3,0
1,20
         


        
3条回答
  •  一个人的身影
    2021-02-08 11:58

    I think user goks missed out on some part in the code. Its not a tested code.

    .map should have been used to convert the rdd to a pairRDD using .map(lambda x: (x,1)).reduceByKey. ....

    reduceByKey is not available on a single value rdd or regular rdd but pairRDD.

    Thx

提交回复
热议问题