Does the driver need constant access to the master node? Or is it only required to get initial resource allocation? What happens if master is not available after Spark cont
The first and probably the most serious for the time being consequence of a master failure or a network partition is that your cluster won't be able to accept new applications. This is why Master is considered to be a single point of failure when cluster is used with default configuration.
Master loss will be acknowledged by the running applications but otherwise these should continue to work more or less like nothing happened with two important exceptions:
Yes, the driver and master communicate constantly throughout the SparkContext's lifetime. That allows driver to:
rdd.count()
won't terminate until job is completed) and get their resultA disconnect between driver and master will fail the job.
Below are the steps spark application does, when it starts,
So, yes, failing on master will result in executors not able to communicate with it. So, they will stop working. Failing of master will make driver unable to communicate with it for job status. So, your application will fail.