How to execute multiple queries in parallel and distributed?
问题 I am using spark 2.4.1 version and java 8. I have scenario like: Will be provided a list of classifiers from a property file to process. These classifiers determines the data what to pull and process. Something like the below: val classifiers = Seq("classifierOne","classifierTwo","classifierThree"); for( classifier : classifiers ){ // read from CassandraDB table val acutalData = spark.read(.....).where(<classifier conditition>) // the data varies depend on the classifier passed in // this