How to use google cloud storage in dataflow pipeline run from datalab

后端 未结 2 549
心在旅途
心在旅途 2021-01-22 00:51

We\'ve been running a Python pipeline in datalab that reads image files from a bucket in google cloud storage (importing google.datalab.storage). Originally we were using Direct

2条回答
  •  长情又很酷
    2021-01-22 01:53

    The most likely issue is that you need to have Dataflow install the datalab pypi module.

    Typically you would do this by listing "datalab" in the requirements.txt file you upload to Dataflow. See https://cloud.google.com/dataflow/pipelines/dependencies-python

提交回复
热议问题