发表新帖

发表新帖

View RDD contents in Python Spark?

后端未结

关注

 6  933

醉酒成梦 2020-11-29 03:40

Running a simple app in pyspark.

f = sc.textFile(\"README.md\")
wc = f.flatMap(lambda x: x.split(\' \')).map(lambda x: (x, 1)).reduceByKey(add)

6条回答

一个人的身影 (楼主)

2020-11-29 04:26
You can simply collect the entire RDD (which will return a list of rows) and print said list:
```
print(wc.collect())
```
0 讨论(0)

查看其它6个回答
发布评论:

提交评论
- 加载中...

热议问题