实时计算框架 Spark: Lightning-fast cluster computing
【推荐】2019 Java 开发者跳槽指南.pdf(吐血整理) >>> real-time processing framework 实时计算框架 Real-time processing denotes processing, transforming and analyzing data on the fly Spark : Apache Spark has an advanced DAG execution engine that supports acyclic data flow and in-memory computing. Spark Stream : 实时流数据处理器(跟 apache storm 一样?distributed realtime computation system. spark stream vs apache storm 两者的一篇比较文章 ) Spark Streaming is an extension of the core Spark API that enables scalable, high-throughput, fault-tolerant stream processing of live data streams. Data can be ingested from many sources like Kafka,