问题
Say we have one worker with 4 CPU cores. How does parallelism configured in Dataflow worker machines? Do we parallelize beyond # of cores?
Where would this type of information be available?
回答1:
One worker thread is used per core, and each worker thread independently processes a chunk of the input space.
来源:https://stackoverflow.com/questions/47777639/google-cloud-dataflow-worker-threading