问题
when I
postgres=# CREATE EXTERNAL TABLE csv_hdfs_lineitem (like a) LOCATION (
'gphdfs://xxxxx/gptest/lineitem.csv'
) FORMAT 'text' (delimiter E'|' null E'\\N' escape E'off' fill missing fields)
ENCODING 'UTF8'
;
it shows
ERROR: protocol "gphdfs" does not exist
I want to know how to configure greenplum to support gphdfs protocol
回答1:
- you need to install hadoop client to all gpdb nodes and add class_path
- setup 2 guc, gp_hadoop_target_version and gp_hadoop_home pointing to the hadoop distribution and binary.
- restart gpdb
- grant protocol access to gpadmin.
- try gphdfs external table.
For detail, check the following link
http://gpdb.docs.pivotal.io/43110/admin_guide/load/topics/g-one-time-hdfs-protocol-installation.html#topic20
来源:https://stackoverflow.com/questions/41156655/error-protocol-gphdfs-does-not-exist