elephantbird

Elephant-bird mvn package error

匆匆过客 提交于 2019-12-05 11:01:58
I have installed hadoop 2.2 in my system. I want to use Elephant-Bird jar. Am getting following error while runnning "mvn package". Error: [ERROR] Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin:2.3.2:compile (default-compile) on project elephant-bird-core: Compilation failure: Compilation failure: [ERROR] /usr/lib/hadoop/elephant_bird/core/target/generated-sources/thrift/com/twitter/elephantbird/thrift/test/TestListInList.java: [9,39] error: package org.apache.commons.lang3.builder does not exist [ERROR] /usr/lib/hadoop/elephant_bird/core/target/generated-sources/thrift

Loading data from HDFS does not work with Elephantbird

安稳与你 提交于 2019-12-03 21:54:50
I am trying to process data with elephantbird in pig but I don't succeed in loading the data. Here is my pig script: register 'lib/elephant-bird-core-3.0.9.jar'; register 'lib/elephant-bird-pig-3.0.9.jar'; register 'lib/google-collections-1.0.jar'; register 'lib/json-simple-1.1.jar'; twitter = LOAD 'statuses.log.2013-04-01-00' USING com.twitter.elephantbird.pig.load.JsonLoader('-nestedLoad'); DUMP twitter; The output I get is [main] INFO org.apache.pig.Main - Apache Pig version 0.11.0-cdh4.3.0 (rexported) compiled May 27 2013, 20:48:21 [main] INFO org.apache.pig.Main - Logging error messages

Use elephant-bird with hive to read protobuf data

半城伤御伤魂 提交于 2019-12-02 06:23:05
问题 I have a similar problem like this one The followning are what I used: CDH4.4 (hive 0.10) protobuf-java-.2.4.1.jar elephant-bird-hive-4.6-SNAPSHOT.jar elephant-bird-core-4.6-SNAPSHOT.jar elephant-bird-hadoop-compat-4.6-SNAPSHOT.jar The jar file which include the protoc compiled .class file. And I flow Protocol Buffer java tutorial create my data "testbook". And I use hdfs dfs -mkdir /protobuf_data to create HDFS folder. Use hdfs dfs -put testbook /protobuf_data to put "testbook" to HDFS. Then