How to install Impala on Ubuntu? [closed]

半世苍凉 提交于 2020-01-02 10:57:08

问题


I want to install Impala on an Ubuntu instance. So far, none of the methods below didn't work.

How can I install a stable version of Impala in Ubuntu?

Failed method nr. 1: apt-get

First I tried to install binaries using

sudo apt-get update
sudo apt-get install impala
sudo apt-get install impala-server
sudo apt-get install impala-state-store

However, there are problems with the public key of Impala's repository:

Err:8 https://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala bionic Release
  404  Not Found [IP: 151.101.192.167 443]
Err:4 http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1 InRelease
  The following signatures couldn't be verified because the public key is not available: NO_PUBKEY 327574EE02A818DD
Reading package lists... Done
E: The repository 'https://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala bionic Release' does not have a Release file.
N: Updating from such a repository can't be done securely, and is therefore disabled by defau
lt.
N: See apt-secure(8) manpage for repository creation and user configuration details.
W: GPG error: http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1 InRelease: The following signatures couldn't be verified because the public key is not available:
 NO_PUBKEY 327574EE02A818DD
E: The repository 'http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1
 InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.

Failed method nr. 2: Building it

I did git clone https://git-wip-us.apache.org/repos/asf/impala.git, then ran bin/bootstrap_build.sh.

However, the file bootstrap_build.sh references a non-existent shell script in the last line ./buildall.sh -notests -so.

Update 1: When I run sudo dpkg --install impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb I get the following dependencies errors:

dpkg: dependency problems prevent configuration of impala:
 impala depends on bigtop-utils (>= 0.7); however:
  Package bigtop-utils is not installed.
 impala depends on hadoop; however:
  Package hadoop is not installed.
 impala depends on hadoop-hdfs; however:
  Package hadoop-hdfs is not installed.
 impala depends on hadoop-yarn; however:
  Package hadoop-yarn is not installed.
 impala depends on hadoop-mapreduce; however:
  Package hadoop-mapreduce is not installed.
 impala depends on hbase; however:
  Package hbase is not installed.
 impala depends on hive (>= 0.12.0+cdh5.1.0); however:
  Package hive is not installed.
 impala depends on zookeeper; however:
  Package zookeeper is not installed.
 impala depends on libhdfs0; however:
  Package libhdfs0 is not installed.
 impala depends on avro-libs; however:
  Package avro-libs is not installed.
 impala depends on parquet; however:
  Package parquet is not installed.
 impala depends on sentry (>= 1.3.0+cdh5.1.0); however:
  Package sentry is not installed.
dpkg: error processing package impala (--install):
 dependency problems - leaving unconfigured
Processing triggers for libc-bin (2.27-3ubuntu1) ...
Errors were encountered while processing:
 impala

Update 2: Installing Hadoop according to this tutorial did not fix the Hadoop dependency problems.


回答1:


It's easier to install Impala if you grab packages that are bundled with a release since these are tied to a set of software versions. Pick a CDH version (like cdh5.14.4 for example) then install the appropriate packages. So looking at:

http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/

you can wget the packages and then run dpkg install to install them:

wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-catalog_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-dbg_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-server_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-shell_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-state-store_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb

dpkg --install impal*.deb

To setup the Impala on Ubuntu16, do the following:

wget 'https://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/cloudera.list'
mv cloudera.list /etc/apt/sources.list.d/cloudera.list
wget 'https://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/archive.key'
apt-key add archive.key`

Then perform the original instructions:

sudo apt-get update
sudo apt-get install impala
sudo apt-get install impala-server
sudo apt-get install impala-state-store

References

  • Installing the Latest CDH 5 Release - On Ubuntu or Debian Systems
  • Installing the Latest CDH 5 Release - Step 2: Optionally Add a Repository Key



回答2:


I will suggest to use github repo but stable branch i.e. 2.x

And while installing from dpkg won't install all dependencies, you have two options:

  1. install all dependencies from error

    I have used the script / heredoc below to generate a command. It can be used with other errors by changing the error messages in the heredoc.

  2. use apt-get install ./impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb

    This is the easier method as everything is taken care of.

$ echo "sudo apt-get install " `grep "Package" << Devidas | awk '{print $2}' | tr '\r\n' ' ' ; echo ' '
dpkg: dependency problems prevent configuration of impala:
 impala depends on bigtop-utils (>= 0.7); however:
  Package bigtop-utils is not installed.
 impala depends on hadoop; however:
  Package hadoop is not installed.
 impala depends on hadoop-hdfs; however:
  Package hadoop-hdfs is not installed.
 impala depends on hadoop-yarn; however:
  Package hadoop-yarn is not installed.
 impala depends on hadoop-mapreduce; however:
  Package hadoop-mapreduce is not installed.
 impala depends on hbase; however:
  Package hbase is not installed.
 impala depends on hive (>= 0.12.0+cdh5.1.0); however:
  Package hive is not installed.
 impala depends on zookeeper; however:
  Package zookeeper is not installed.
 impala depends on libhdfs0; however:
  Package libhdfs0 is not installed.
 impala depends on avro-libs; however:
  Package avro-libs is not installed.
 impala depends on parquet; however:
  Package parquet is not installed.
 impala depends on sentry (>= 1.3.0+cdh5.1.0); however:
  Package sentry is not installed.
dpkg: error processing package impala (--install):
 dependency problems - leaving unconfigured
Processing triggers for libc-bin (2.27-3ubuntu1) ...
Errors were encountered while processing:
 impala
Devidas`

This should result in something like:

sudo apt-get install bigtop-utils hadoop hadoop-hdfs hadoop-yarn hadoop-mapreduce \
    hbase hive zookeeper libhdfs0 avro-libs parquet sentry


来源:https://stackoverflow.com/questions/52502493/how-to-install-impala-on-ubuntu

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!