Cloud Dataflow - how does Dataflow do parallelism?
问题 My question is, behind the scene, for element-wise Beam DoFn (ParDo), how does the Cloud Dataflow parallel workload? For example, in my ParDO, I send out one http request to an external server for one element. And I use 30 workers, each has 4vCPU. Does that mean on each worker, there will be 4 threads at maximum? Does that mean from each worker, only 4 http connections are necessary or can be established if I keep them alive to get the best performance? How can I adjust the level of