Pyspark : forward fill with last observation for a DataFrame
可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试): 问题: Using Spark 1.5.1, I've been trying to forward fill null values with the last known observation for one column of my DataFrame. It is possible to start with a null value and for this case I would to backward fill this null value with the first knwn observation. However, If that too complicates the code, this point can be skipped. In this post , a solution in Scala was provided for a very similar problem by zero323 . But, I don't know Scala and I don't succeed to ''translate'' it in Pyspark API code. It's possible to do it with Pyspark ?