Compute differences between succesive records in Hadoop with Hive Queries

前端 未结 3 1615
后悔当初
后悔当初 2021-01-12 04:26

I have a Hive table that holds data of customer calls. For simplicity consider it has 2 columns, first column holds the customer ID and the second column holds the timestamp

3条回答
  •  萌比男神i
    2021-01-12 05:10

    You can use explicit MAP-REDUCE with other programming language like Java or Python. Where emit from map {cutomer_id,call_time} and in reducer you will get {customer_id,list{time_stamp}} and in reducer you can sort these time stamps and can process the data.

提交回复
热议问题