What is most efficient way to write from kafka to hdfs with files partitioning into dates

后端 未结 5 1951
囚心锁ツ
囚心锁ツ 2020-12-28 20:53

I\'m working on project that should write via kafka to hdfs. Suppose there is online server that writes messages into the kafka. Each message includes timestamp in it. I w

5条回答
  •  醉酒成梦
    2020-12-28 21:51

    if you are using Apache Kafka 0.9 or above, you can use the Kafka Connect API.

    check out https://github.com/confluentinc/kafka-connect-hdfs

    This is a Kafka connector for copying data between Kafka and HDFS.

提交回复
热议问题