Convert Parquet to CSV

我的未来我决定 提交于 2020-01-01 09:18:52

问题


How to convert Parquet to CSV from a local file system (e.g. python, some library etc.) but WITHOUT Spark? (trying to find as simple and minimalistic solution as possible because need to automate everything and not much resources).

I tried with e.g. parquet-tools on my Mac but data output did not look correct.

Need to make output so that when data is not present in some columns - CSV will have corresponding NULL (empty column between 2 commas)..

Thanks.


回答1:


You can do this by using the Python packages pandas and pyarrow (pyarrow is an optional dependency of pandas that you need for this feature).

import pandas as pd
df = pd.read_parquet('filename.parquet')
df.to_csv('filename.csv')

When you need to make modifications to the contents in the file, you can standard pandas operations on df.



来源:https://stackoverflow.com/questions/51215166/convert-parquet-to-csv

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!