I use pyarrow to create and analyse Parquet tables with biological information and I need to store some metadata, e.g. which sample the data comes from, how it was obtained and processed.
Parquet seems to support file-wide metadata, but I cannot find how the write it via pyarrow. The closest thing I could find is how to write row-group metadata, but this seems like an overkill, since my metadata is the same for all row groups in the file.
Is there any way to write file-wide Parquet metadata with pyarrow?
来源:https://stackoverflow.com/questions/52122674/how-to-write-parquet-metadata-with-pyarrow