How does Hive stores data and what is SerDe?

后端未结

关注

 4  1231

执笔经年 2021-02-04 12:50

when querying a table, a SerDe will deserialize a row of data from the bytes in the file to objects used internally by Hive to operate on that row of data. when

4条回答

萌比男神i (楼主)

2021-02-04 13:47

I think the above has the concepts serialise and deserialise back to front. Serialise is done on write, the structured data is serialised into a bit/byte stream for storage. On read, the data is deserialised from the bit/byte storage format to the structure required by the reader. eg Hive needs structures that look like rows and columns but hdfs stores the data in bit/byte blocks, so serialise on write, deserialise on read.

0 讨论(0)

查看其它4个回答
发布评论:

提交评论
- 加载中...