How to create a Dataflow pipeline from Pub/Sub to GCS in Python

后端 未结 2 1880
谎友^
谎友^ 2020-12-20 18:21

I want to use Dataflow to move data from Pub/Sub to GCS. So basically I want Dataflow to accumulate some messages in a fixed amount of time (15 minutes for example), then wr

2条回答
  •  甜味超标
    2020-12-20 18:51

    I ran into this same error, and found a workaround, but not a fix:

    TypeError: Cannot convert GlobalWindow to apache_beam.utils.windowed_value._IntervalWindowBase [while running 'test-file-out/Write/WriteImpl/WriteBundles']
    

    running locally with DirectRunner and on dataflow with DataflowRunner.

    Reverting to apache-beam[gcp]==2.9.0 allows my pipeline to run as expected.

提交回复
热议问题