I\'m planning to use one of the hadoop file format for my hadoop related project. I understand parquet is efficient for column based query and avro for full
Just for a description on Parquet, you can refer here: http://bigdata.devcodenote.com/2015/04/parquet-file-format.html
I intend to write very soon on Avro and a comparison between the 2 as well. Will post it here once done.