Total size of serialized results of 16 tasks (1048.5 MB) is bigger than spark.driver.maxResultSize (1024.0 MB)

前端 未结 2 2019
-上瘾入骨i
-上瘾入骨i 2020-12-05 02:43

I get the following error when I add --conf spark.driver.maxResultSize=2050 to my spark-submit command.

17/12/27 18:33:19 ERROR Tra         


        
2条回答
  •  孤城傲影
    2020-12-05 03:30

    Cause: caused by actions like RDD's collect() that send big chunk of data to the driver

    Solution: set by SparkConf: conf.set("spark.driver.maxResultSize", "4g") OR set by spark-defaults.conf: spark.driver.maxResultSize 4g OR set when calling spark-submit: --conf spark.driver.maxResultSize=4g

提交回复
热议问题