Context
I have a python spark job transforming a dataframe into json and writing it to elastic search. The time to write the (almost) identical numbe