Avro vs. Parquet

后端 未结 7 986
自闭症患者
自闭症患者 2020-12-07 09:39

I\'m planning to use one of the hadoop file format for my hadoop related project. I understand parquet is efficient for column based query and avro for full

相关标签:
7条回答
  • 2020-12-07 10:21

    Avro is a Row based format. If you want to retrieve the data as a whole you can use Avro

    Parquet is a Column based format. If your data consists of a lot of columns but you are interested in a subset of columns then you can use Parquet

    HBase is useful when frequent updating of data is involved. Avro is fast in retrieval, Parquet is much faster.

    0 讨论(0)
提交回复
热议问题