Difference between DataFrame, Dataset, and RDD in Spark

后端 未结 15 1426
慢半拍i
慢半拍i 2020-11-22 15:53

I\'m just wondering what is the difference between an RDD and DataFrame (Spark 2.0.0 DataFrame is a mere type alias for Dataset[Row]

15条回答
  •  眼角桃花
    2020-11-22 16:21

    You can use RDD's with Structured and unstructured where as Dataframe/Dataset can only process Structured and Semi Structured Data (It is having proper schema)

提交回复
热议问题