How to convert column with string type to int form in pyspark data frame?

前端未结

关注

 3  389

忘了有多久 2020-12-24 05:33

I have dataframe in pyspark. Some of its numerical columns contain \'nan\' so when I am reading the data and checking for the schema of dataframe, those columns will have \'

3条回答

一整个雨季 (楼主)

2020-12-24 05:59
```
from pyspark.sql.types import IntegerType
data_df = data_df.withColumn("Plays", data_df["Plays"].cast(IntegerType()))
data_df = data_df.withColumn("drafts", data_df["drafts"].cast(IntegerType()))
```
You can run loop for each column but this is the simplest way to convert string column into integer.
0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...