Running EMR Spark With Multiple S3 Accounts

前端 未结 4 1246
隐瞒了意图╮
隐瞒了意图╮ 2020-12-29 11:12

I have an EMR Spark Job that needs to read data from S3 on one account and write to another.
I split my job into two steps.

  1. read data from the S3 (no

4条回答
  •  夕颜
    夕颜 (楼主)
    2020-12-29 11:20

    For controlling access of the resources, generally IAM roles are managed as a standard practice. Assume roles are used when you want to access resources in a different account. If you or your organisation follow the same then you should follow https://aws.amazon.com/blogs/big-data/securely-analyze-data-from-another-aws-account-with-emrfs/. The basic idea here is to use a credentials provider with which the access is obtained by EMRFS to access objects in S3 buckets. You can go one step further and make the ARN for STS and buckets parameterized for the JAR created in this blog.

提交回复
热议问题