Spark 2.0 deprecates 'DirectParquetOutputCommitter', how to live without it?

前端 未结 2 1851
情歌与酒
情歌与酒 2021-01-31 10:57

Recently we migrated from \"EMR on HDFS\" --> \"EMR on S3\" (EMRFS with consistent view enabled) and we realized the Spark \'SaveAsTable\' (parquet format) writes to S3 were ~4x

2条回答
提交回复
热议问题