问题
I want to install Impala on an Ubuntu instance. So far, none of the methods below didn't work.
How can I install a stable version of Impala in Ubuntu?
Failed method nr. 1: apt-get
First I tried to install binaries using
sudo apt-get update
sudo apt-get install impala
sudo apt-get install impala-server
sudo apt-get install impala-state-store
However, there are problems with the public key of Impala's repository:
Err:8 https://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala bionic Release
404 Not Found [IP: 151.101.192.167 443]
Err:4 http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1 InRelease
The following signatures couldn't be verified because the public key is not available: NO_PUBKEY 327574EE02A818DD
Reading package lists... Done
E: The repository 'https://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala bionic Release' does not have a Release file.
N: Updating from such a repository can't be done securely, and is therefore disabled by defau
lt.
N: See apt-secure(8) manpage for repository creation and user configuration details.
W: GPG error: http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1 InRelease: The following signatures couldn't be verified because the public key is not available:
NO_PUBKEY 327574EE02A818DD
E: The repository 'http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1
InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
Failed method nr. 2: Building it
I did git clone https://git-wip-us.apache.org/repos/asf/impala.git
, then ran bin/bootstrap_build.sh
.
However, the file bootstrap_build.sh references a non-existent shell script in the last line ./buildall.sh -notests -so
.
Update 1: When I run sudo dpkg --install impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
I get the following dependencies errors:
dpkg: dependency problems prevent configuration of impala:
impala depends on bigtop-utils (>= 0.7); however:
Package bigtop-utils is not installed.
impala depends on hadoop; however:
Package hadoop is not installed.
impala depends on hadoop-hdfs; however:
Package hadoop-hdfs is not installed.
impala depends on hadoop-yarn; however:
Package hadoop-yarn is not installed.
impala depends on hadoop-mapreduce; however:
Package hadoop-mapreduce is not installed.
impala depends on hbase; however:
Package hbase is not installed.
impala depends on hive (>= 0.12.0+cdh5.1.0); however:
Package hive is not installed.
impala depends on zookeeper; however:
Package zookeeper is not installed.
impala depends on libhdfs0; however:
Package libhdfs0 is not installed.
impala depends on avro-libs; however:
Package avro-libs is not installed.
impala depends on parquet; however:
Package parquet is not installed.
impala depends on sentry (>= 1.3.0+cdh5.1.0); however:
Package sentry is not installed.
dpkg: error processing package impala (--install):
dependency problems - leaving unconfigured
Processing triggers for libc-bin (2.27-3ubuntu1) ...
Errors were encountered while processing:
impala
Update 2: Installing Hadoop according to this tutorial did not fix the Hadoop dependency problems.
回答1:
It's easier to install Impala if you grab packages that are bundled with a release since these are tied to a set of software versions. Pick a CDH version (like cdh5.14.4 for example) then install the appropriate packages. So looking at:
http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/
you can wget
the packages and then run dpkg install
to install them:
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-catalog_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-dbg_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-server_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-shell_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-state-store_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
dpkg --install impal*.deb
To setup the Impala on Ubuntu16, do the following:
wget 'https://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/cloudera.list'
mv cloudera.list /etc/apt/sources.list.d/cloudera.list
wget 'https://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/archive.key'
apt-key add archive.key`
Then perform the original instructions:
sudo apt-get update
sudo apt-get install impala
sudo apt-get install impala-server
sudo apt-get install impala-state-store
References
- Installing the Latest CDH 5 Release - On Ubuntu or Debian Systems
- Installing the Latest CDH 5 Release - Step 2: Optionally Add a Repository Key
回答2:
I will suggest to use github repo but stable branch i.e. 2.x
And while installing from dpkg won't install all dependencies, you have two options:
install all dependencies from error
I have used the script / heredoc below to generate a command. It can be used with other errors by changing the error messages in the heredoc.
use
apt-get install ./impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
This is the easier method as everything is taken care of.
$ echo "sudo apt-get install " `grep "Package" << Devidas | awk '{print $2}' | tr '\r\n' ' ' ; echo ' '
dpkg: dependency problems prevent configuration of impala:
impala depends on bigtop-utils (>= 0.7); however:
Package bigtop-utils is not installed.
impala depends on hadoop; however:
Package hadoop is not installed.
impala depends on hadoop-hdfs; however:
Package hadoop-hdfs is not installed.
impala depends on hadoop-yarn; however:
Package hadoop-yarn is not installed.
impala depends on hadoop-mapreduce; however:
Package hadoop-mapreduce is not installed.
impala depends on hbase; however:
Package hbase is not installed.
impala depends on hive (>= 0.12.0+cdh5.1.0); however:
Package hive is not installed.
impala depends on zookeeper; however:
Package zookeeper is not installed.
impala depends on libhdfs0; however:
Package libhdfs0 is not installed.
impala depends on avro-libs; however:
Package avro-libs is not installed.
impala depends on parquet; however:
Package parquet is not installed.
impala depends on sentry (>= 1.3.0+cdh5.1.0); however:
Package sentry is not installed.
dpkg: error processing package impala (--install):
dependency problems - leaving unconfigured
Processing triggers for libc-bin (2.27-3ubuntu1) ...
Errors were encountered while processing:
impala
Devidas`
This should result in something like:
sudo apt-get install bigtop-utils hadoop hadoop-hdfs hadoop-yarn hadoop-mapreduce \
hbase hive zookeeper libhdfs0 avro-libs parquet sentry
来源:https://stackoverflow.com/questions/52502493/how-to-install-impala-on-ubuntu