Extremely slow S3 write times from EMR/ Spark

前端 未结 6 1143
梦如初夏
梦如初夏 2020-12-23 12:20

I\'m writing to see if anyone knows how to speed up S3 write times from Spark running in EMR?

My Spark Job takes over 4 hours to complete, however the cluster is onl

6条回答
  •  予麋鹿
    予麋鹿 (楼主)
    2020-12-23 12:33

    How large is the file(s) you are writing too? Having one core writing to a very large file is going to be much slower than splitting the file up and have multiple workers write out smaller files.

提交回复
热议问题