Spark 2.0.x dump a csv file from a dataframe containing one array of type string

前端 未结 6 1589
难免孤独
难免孤独 2020-11-29 07:07

I have a dataframe df that contains one column of type array

df.show() looks like

|ID|ArrayOfString|Age|Gender|
+--+-------         


        
6条回答
  •  再見小時候
    2020-11-29 07:41

    No need for a UDF if you already know which fields contain arrays. You can simply use Spark's cast function:

    import org.apache.spark.sql.functions._
    val dumpCSV = df.withColumn("ArrayOfString", col("ArrayOfString").cast("string"))
                    .write
                    .csv(path="/home/me/saveDF")
    

    Hope that helps.

提交回复
热议问题