Is there a way to convert a Spark Df (not RDD) to pandas DF
I tried the following:
var some_df = Seq( (\"A\", \"no\"), (\"B\", \"yes\"), (\"B\", \
following should work
some_df = sc.parallelize([ ("A", "no"), ("B", "yes"), ("B", "yes"), ("B", "no")] ).toDF(["user_id", "phone_number"]) pandas_df = some_df.toPandas()