S3 data partitioning for bucket logging files
问题 I have a s3 bucket "ABC" and logging is enabled at this bucket and logs stored in "ABC-logs". There are many files comes in "ABC-logs" per day. Now I want to segregate these logs year wise. For ex: s3://ABC-logs/year=2015 s3://ABC-logs/year=2016 s3://ABC-logs/year=2017 What is the best way to do this. I thought to do it via awscli but each year end, I will have to change bucket logging folder. 回答1: The traditional way to do this is via an Amazon EMR cluster . You can use Hive to create an