Reuse a cached Spark RDD

流过昼夜 提交于 2019-12-20 03:17:38

问题


Is there a possibility in Spark to re-use a cached RDD in another application (or in another run of the same application)?

JavaRDD<ExampleClass> toCache = ... // transformations on the RDD
toCache.cache();                    // can this be reused somehow in another application or further runs?

回答1:


No, Spark RDD cannot be used in other application or in another run.

You can connect Spark with for example Hazelcast or Apache Ignite to save RDDs in memory. Other application will have possibility to read data saved in first application



来源:https://stackoverflow.com/questions/41721963/reuse-a-cached-spark-rdd

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!