Apache Spark does not delete temporary directories

前端 未结 6 534
庸人自扰
庸人自扰 2020-11-27 15:48

After a spark program completes, there are 3 temporary directories remain in the temp directory. The directory names are like this: spark-2e389487-40cc-4a82-a5c7-353c0feefbb

6条回答
  •  刺人心
    刺人心 (楼主)
    2020-11-27 16:54

    for spark.local.dir, it will only move spark temp files, but the snappy-xxx file will still exists in /tmp dir. Though didn't find way to make spark automatically clear it, but you can set JAVA option:

    JVM_EXTRA_OPTS=" -Dorg.xerial.snappy.tempdir=~/some-other-tmp-dir"
    

    to make it move to another dir, as most system has small /tmp size.

提交回复
热议问题