发表新帖

发表新帖

show distinct column values in pyspark dataframe: python

后端未结

关注

 9  912

忘了有多久 2020-12-23 10:55

Please suggest pyspark dataframe alternative for Pandas df[\'col\'].unique().

I want to list out all the unique values in a pyspark dataframe column.

9条回答

無奈伤痛 (楼主)

2020-12-23 11:31

If you want to select ALL(columns) data as distinct frrom a DataFrame (df), then

df.select('*').distinct().show(10,truncate=False)

0 讨论(0)

查看其它9个回答
发布评论:

提交评论
- 加载中...

热议问题