How to find max value in pair RDD?

前端 未结 4 1278
攒了一身酷
攒了一身酷 2020-12-01 14:30

I have a spark pair RDD (key, count) as below

Array[(String, Int)] = Array((a,1), (b,2), (c,1), (d,3))

How to find the key with highest co

4条回答
  •  不思量自难忘°
    2020-12-01 15:07

    Use Array.maxBy method:

    val a = Array(("a",1), ("b",2), ("c",1), ("d",3))
    val maxKey = a.maxBy(_._2)
    // maxKey: (String, Int) = (d,3)
    

    or RDD.max:

    val maxKey2 = rdd.max()(new Ordering[Tuple2[String, Int]]() {
      override def compare(x: (String, Int), y: (String, Int)): Int = 
          Ordering[Int].compare(x._2, y._2)
    })
    

提交回复
热议问题