Is HDFS necessary for Spark workloads?

后端 未结 4 1633
夕颜
夕颜 2021-01-06 05:39

HDFS is not necessary but recommendations appear in some places.

To help evaluate the effort spent in getting HDFS running:

What are the benefits of

4条回答
  •  暗喜
    暗喜 (楼主)
    2021-01-06 06:34

    The shortest answer is:"No, you don't need it". You can analyse data even without HDFS, but off course you need to replicate the data on all your nodes.

    The long answer is quite counterintuitive and i'm still tryng to understand it with the help stackoverflow community.

    Spark local vs hdfs permormance

提交回复
热议问题