发表新帖

发表新帖

Comparing columns in Pyspark

前端未结

关注

 5  858

Happy的楠姐 2020-12-01 18:17

I am working on a PySpark DataFrame with n columns. I have a set of m columns (m < n) and my task is choose the column with max values in it.

For example:

5条回答

慢半拍i (楼主)

2020-12-01 19:07
You can also use the pyspark built-in least:
```
from pyspark.sql.functions import least, col
df = df.withColumn('min', least(col('c1'), col('c2'), col('c3')))
```
0 讨论(0)

查看其它5个回答
发布评论:

提交评论
- 加载中...

热议问题