Inspect Parquet from command line

前端 未结 9 1482
再見小時候
再見小時候 2020-12-07 20:26

How do I inspect the content of a Parquet file from the command line?

The only option I see now is

$ hadoop fs -get my-path local-file
$ parquet-tool         


        
9条回答
  •  旧巷少年郎
    2020-12-07 20:45

    If you're using HDFS, the following commands are very useful as they are frequently used (left here for future reference):

    hadoop jar parquet-tools-1.9.0.jar schema hdfs://path/to/file.snappy.parquet
    hadoop jar parquet-tools-1.9.0.jar head -n5 hdfs://path/to/file.snappy.parquet
    

提交回复
热议问题