Write parquet from AWS Kinesis firehose to AWS S3

后端 未结 3 664
孤城傲影
孤城傲影 2020-12-24 02:07

I would like to ingest data into s3 from kinesis firehose formatted as parquet. So far I have just find a solution that implies creating an EMR, but I am looking for somethi

3条回答
  •  感情败类
    2020-12-24 02:42

    Good news, this feature was released today!

    Amazon Kinesis Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3. Parquet and ORC are columnar data formats that save space and enable faster queries

    To enable, go to your Firehose stream and click Edit. You should see Record format conversion section as on screenshot below:

    See the documentation for details: https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html

提交回复
热议问题