I have a PySpark dataframe that includes timestamps in a column (call the column \'dt\'), like this:
2018-04-07 16:46:00 2018-03-06 22:18:00
One simple way to do it with string manipulation:
from pyspark.sql.functions import lit, concat df = df.withColumn('date', concat(df.date.substr(0, 10), lit(' 00:00:00')))