COLLECT_SET() in Hive, keep duplicates?

后端 未结 9 1532
离开以前
离开以前 2020-12-12 17:06

Is there a way to keep the duplicates in a collected set in Hive, or simulate the sort of aggregate collection that Hive provides using some other method? I want to aggregat

9条回答
  •  半阙折子戏
    2020-12-12 18:01

    Check out the Brickhouse collect UDAF ( http://github.com/klout/brickhouse/blob/master/src/main/java/brickhouse/udf/collect/CollectUDAF.java )

    It also supports collecting into a map. Brickhouse also contains many useful UDF's not in the standard Hive distribution.

提交回复
热议问题