What is the difference between memory_only and memory_and_disk caching level in spark?

后端 未结 2 679
自闭症患者
自闭症患者 2020-12-25 12:37

How is the behavior of memory_only and memory_and_disk caching level in spark differ?

2条回答
  •  爱一瞬间的悲伤
    2020-12-25 13:00

    As explained in the documentation, Persistence levels in terms of efficiency:

    Level                Space used  CPU time  In memory  On disk  Serialized
    -------------------------------------------------------------------------
    MEMORY_ONLY          High        Low       Y          N        N
    MEMORY_ONLY_SER      Low         High      Y          N        Y
    MEMORY_AND_DISK      High        Medium    Some       Some     Some
    MEMORY_AND_DISK_SER  Low         High      Some       Some     Y
    DISK_ONLY            Low         High      N          Y        Y
    

    MEMORY_AND_DISK and MEMORY_AND_DISK_SER spill to disk if there is too much data to fit in memory.

提交回复
热议问题