问题
Am new in the Apache Flume. I need to install the flume on top of the HDFS cluster environment. I did Google it, all are saying using the cloudera distribution but I need to install and configure from the source.
So can anyone please suggest me, where to start and how to customize the flume agent and sink services?
回答1:
I have just installed Apache Flume 1.3 on Ubuntu.
You need to download the binary zip for your OS, extract it and create a config file which is similar to properties file in Java.
The installation and running of agents is a dumb/easy process, just read this
回答2:
installation steps
#step1
goto http://flume.apache.org/download.html
#click following link
apache-flume-1.4.0-bin.tar.gz
http://apache.mesi.com.ar/flume/1.4.0/apache-flume-1.4.0-bin.tar.gz
#Download apache-flume by double clicking on the above link
tar -xvzf apache-flume-1.4.0-bin.tar.gz
mv apache-flume-1.4.0-bin.tar.gz flume
chown -R hduser:hadoop flume
#go to flume conf path
../../flume/conf
vi flume-env.sh.template
#mention JAVA Path
#mantion flume Classpath
回答3:
Here is an example tutorial that I've put together that uses flume, apache, flask and kibana in a vagrant environment.
There is a video at the bottom to give a better intro:
https://github.com/mross462/Flume-Elasticsearch
回答4:
In the course Moving Data into Hadoop offered by Big Data University at :http://bigdatauniversity.com/bdu-wp/bdu-course/moving-data-into-hadoop/ teaches you intro to flume and a lab exercise which helps you with both configuring and running flume.
来源:https://stackoverflow.com/questions/14170282/how-to-install-and-configure-apache-flume