Cloud Dataflow - how does Dataflow do parallelism?

与世无争的帅哥 提交于 2019-12-06 08:07:39
  1. Yes, based on this answer.
  2. No, you can establish more connections. Based on my answer, you can use a async http client to have more concurrent requests. As this answer also describes, you need to collect the results from these asynchronous calls and output it synchronously in any @ProcessElement or @FinishBundle.
  3. See 2.
  4. Since your resource usage is low, it indicates that the worker spends most of its time waiting for a response. I think with the described approach above, you can utilize your resources far better and you can achieve the same performance with far less workers.
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!