Using map/reduce java classes in hive queries

后端 未结 3 885
执念已碎
执念已碎 2021-01-20 19:34

I read like, in hive queries we can use map/reduce scripts.

Can we use java map/reduce classes in Hive queries. If so, can you please provide me the sample.

3条回答
  •  忘掉有多难
    2021-01-20 19:58

    You can start writing you custom Mapper and Reducer following the Link provided by @Moiz Arafat.

    After that make a jar out of it (say myjar.jar).

    Working of such custom Mapper and Reducer require hive-contrib.jar, which can be found in the lib folder of your hive installation.

    Before running a query which uses these Mapper/Reducer, you should add these two jars to hive.

      hive> add file path/to/hive-contrib.jar;
      hive> add file path/to/myjar.jar;
    

    Please use add file and NOT add jar.

    Now you can write your query like :

        hive> FROM (
        FROM src
        MAP inkey, inval
        USING 'java -cp myjar.jar:hive-contrib.jar com.myco.mymr.MyMapper'
        AS k, v
        CLUSTER BY k) mapout
        REDUCE k, v
        USING 'java -cp myjar.jar:hive-contrib.jar com.myco.mymr.MyReducer'
    

提交回复
热议问题