Using AWS Glue to convert very big csv.gz (30-40 gb each) to parquet

后端 未结 2 1321
醉梦人生
醉梦人生 2021-01-24 07:19

There are lots of such questions but nothing seems to help. I am trying to covert quite large csv.gz files to parquet and keep on getting various errors like

\'C         


        
2条回答
  •  栀梦
    栀梦 (楼主)
    2021-01-24 07:51

    How many DPUs you are using. This article gives a nice overview of DPU capacity planning. Hope that helps. There is no definite rulebook from AWS stating how much DPU you need to process a particular size.

提交回复
热议问题