difference between rdd.collect().toMap to rdd.collectAsMap()?

前端 未结 2 385
死守一世寂寞
死守一世寂寞 2021-01-02 10:15

Is there any performance impact when I use collectAsMap on my RDD instead of rdd.collect().toMap ?

I have a key value rdd and I want to convert to HashMap as far I

2条回答
  •  悲哀的现实
    2021-01-02 11:01

    No difference. Avoid using collect() as much as you can as it destroys the concept of parallelism and collects the data on the driver.

提交回复
热议问题