approxQuantile give incorrect Median in Spark (Scala)?

后端 未结 3 762
太阳男子
太阳男子 2020-12-19 09:53

I have this test data:

 val data = List(
        List(47.5335D),
        List(67.5335D),
        List(69.5335D),
        List(444.1235D),
        List(677.53         


        
3条回答
  •  慢半拍i
    慢半拍i (楼主)
    2020-12-19 10:14

    This is the result from my local. Do you do something similar?

     val data = List(
            List(47.5335D),
            List(67.5335D),
            List(69.5335D),
            List(444.1235D),
            List(677.5335D)
          )
    
    val df = data.flatten.toDF
    
    df.stat.approxQuantile("value", Array(0.5), 0)
    // res18: Array[Double] = Array(67.5335)
    

提交回复
热议问题