How do you search an amazon s3 bucket?

后端未结

关注

 21  2328

渐次进展

I have a bucket with thousands of files in it. How can I search the bucket? Is there a tool you can recommend?

相关标签:

21条回答

执笔经年

2020-11-30 18:33
Search by Prefix in S3 Console

directly in the AWS Console bucket view.

Copy wanted files using s3-dist-cp

When you have thousands or millions of files another way to get the wanted files is to copy them to another location using distributed copy. You run this on EMR in a Hadoop Job. The cool thing about AWS is that they provide their custom S3 version s3-dist-cp. It allows you to group wanted files using a regular expression in the groupBy field. You can use this for example in a custom step on EMR
```
[
    {
        "ActionOnFailure": "CONTINUE",
        "Args": [
            "s3-dist-cp",
            "--s3Endpoint=s3.amazonaws.com",
            "--src=s3://mybucket/",
            "--dest=s3://mytarget-bucket/",
            "--groupBy=MY_PATTERN",
            "--targetSize=1000"
        ],
        "Jar": "command-runner.jar",
        "Name": "S3DistCp Step Aggregate Results",
        "Type": "CUSTOM_JAR"
    }
]
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
轮回少年

2020-11-30 18:33
Try this command:
```
aws s3api list-objects --bucket your-bucket --prefix sub-dir-path --output text --query 'Contents[].{Key: Key}'
```
Then you can pipe this into a grep to get specific file types to do whatever you want with them.
0 讨论(0)
发布评论:

提交评论
- 加载中...
旧时难觅i

2020-11-30 18:33
Fast forward to 2020, and using aws-okta as our 2fa, the following command, while slow as hell to iterate through all of the objects and folders in this particular bucket (+270,000) worked fine.
```
aws-okta exec dev -- aws s3 ls my-cool-bucket --recursive | grep needle-in-haystax.txt
```
0 讨论(0)
发布评论:

提交评论
- 加载中...

上一页 1 2 3 4

How do you search an amazon s3 bucket?

Search by Prefix in S3 Console

Copy wanted files using s3-dist-cp