How to prevent dask client from dying on worker exception?
问题 I'm not understanding the resiliency model in dask distributed. Problem Exceptions raised by a workers kills embarrassingly parallel dask operation. All workers and clients die if any worker encounters an exception. Expected Behavior Reading here: http://distributed.dask.org/en/latest/resilience.html#user-code-failures Suggests that exceptions should be contained to workers and that subsequent tasks would go on without interruption. "When a function raises an error that error is kept and