PySpark create new column with mapping from a dict

后端未结

关注

 2  1367

感情败类 2020-12-05 02:43

Using Spark 1.6, I have a Spark DataFrame column (named let\'s say col1) with values A, B, C, DS, DNS, E, F, G and H and I want to create a new col

2条回答

Happy的楠姐 (楼主)

2020-12-05 03:29
Sounds like the simplest solution would be to use the replace function: http://spark.apache.org/docs/2.4.0/api/python/pyspark.sql.html#pyspark.sql.DataFrame.replace
```
mapping= {
        'A': '1',
        'B': '2'
    }
df2 = df.replace(to_replace=mapping, subset=['yourColName'])
```
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...