Spark java.lang.StackOverflowError

后端 未结 3 998
囚心锁ツ
囚心锁ツ 2020-12-16 19:07

I\'m using spark in order to calculate the pagerank of user reviews, but I keep getting Spark java.lang.StackOverflowError when I run my code on a big dataset (

3条回答
  •  渐次进展
    2020-12-16 19:45

    When your for loop grows really large, Spark can no longer keep track of the lineage. Enable checkpointing in your for loop to checkpoint your rdd every 10 iterations or so. Checkpointing will fix the problem. Don't forget to clean up the checkpoint directory after.

    http://spark.apache.org/docs/latest/streaming-programming-guide.html#checkpointing

提交回复
热议问题