ERROR: protocol “gphdfs” does not exist

六月ゝ 毕业季﹏ 提交于 2019-12-12 03:29:17

问题


when I

postgres=#   CREATE EXTERNAL TABLE csv_hdfs_lineitem (like a) LOCATION (
    'gphdfs://xxxxx/gptest/lineitem.csv'
) FORMAT 'text' (delimiter E'|' null E'\\N' escape E'off' fill missing fields)
ENCODING 'UTF8'
;

it shows

ERROR: protocol "gphdfs" does not exist

I want to know how to configure greenplum to support gphdfs protocol


回答1:


  1. you need to install hadoop client to all gpdb nodes and add class_path
  2. setup 2 guc, gp_hadoop_target_version and gp_hadoop_home pointing to the hadoop distribution and binary.
  3. restart gpdb
  4. grant protocol access to gpadmin.
  5. try gphdfs external table.

For detail, check the following link

http://gpdb.docs.pivotal.io/43110/admin_guide/load/topics/g-one-time-hdfs-protocol-installation.html#topic20



来源:https://stackoverflow.com/questions/41156655/error-protocol-gphdfs-does-not-exist

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!