What is the difference between spark checkpoint and persist to a disk

后端 未结 4 1360
北海茫月
北海茫月 2020-11-29 16:29

What is the difference between spark checkpoint and persist to a disk. Are both these store in the local disk?

4条回答
  •  渐次进展
    2020-11-29 16:45

    If you check the relevant part of the documentation, it talks about writing data to a reliable system, e.g. HDFS. But it is up to you to tell Apache Spark where to write its checkpoint information.

    On the other hand, persisting is about caching data mostly in memory, as this part of the documentation clearly indicates.

    So, it depends on what directory you gave to Apache Spark.

提交回复
热议问题