Why does column 1st_from_end contain null:
1st_from_end
from pyspark.sql.functions import split df = sqlContext.createDataFrame([(\'a b c d\',)], [\'s\',]) d
Create your own udf would look like this
def get_last_element(l): return l[-1] get_last_element_udf = F.udf(get_last_element) df.select(get_last_element(split(df.s, ' ')).alias('1st_from_end')