Why is my fusion breaker losing or holding back data?
问题 I am working on a streaming Dataflow pipeline that consumes messages of batched items from PubSub and eventually writes them to Datastore. For better parallelism, and also for timely acknowledgement of the messages pulled from the PubSub, I unpack the batches into individual items and add a fusion breaker right after it. So the pipeline looks like this ... PubSubIO -> deserialize -> unpack -> fusion break -> validation/transform -> DatastoreIO. Here is my fusion breaker, largely copied from