What is the difference between spark checkpoint and persist to a disk. Are both these store in the local disk?
If you check the relevant part of the documentation, it talks about writing data to a reliable system, e.g. HDFS. But it is up to you to tell Apache Spark where to write its checkpoint information.
On the other hand, persisting is about caching data mostly in memory, as this part of the documentation clearly indicates.
So, it depends on what directory you gave to Apache Spark.