问题
I have a data frame in pyspark
like sample below. I would like to duplicate a column in the data frame and rename to another column name.
Name Age Rate
Aira 23 90
Ben 32 98
Cat 27 95
Desired output is :
Name Age Rate Rate2
Aira 23 90 90
Ben 32 98 98
Cat 27 95 95
How can I do it?
回答1:
Just
df.withColumn("Rate2", df["Rate"])
or (in SQL)
SELECT *, Rate AS Rate2 FROM df
来源:https://stackoverflow.com/questions/50399361/duplicate-a-column-in-pyspark-data-frame