Spark Streaming: stateless overlapping windows vs. keeping state

后端未结

关注

 2  1758

轮回少年 2021-02-01 18:42

What would be some considerations for choosing stateless sliding-window operations (e.g. reduceByKeyAndWindow) vs. choosing to keep state (e.g. via updateStateByKey or the new m

2条回答

情深已故 (楼主)

2021-02-01 18:56
I think one of other drawbacks of third approach is that the RDDs are not received chronologically..considering running them on a cluster..
```
ongoingEventsStream.foreachRDD { /*accumulate state in casssandra*/ }
```
also what about check-pointing and driver node failure..In that case do u read the whole data again? curious to know how you wanna handle this?

I guess maybe mapwithstate is a better approach why you consider all these scenario..
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...