Converting RDD[org.apache.spark.sql.Row] to RDD[org.apache.spark.mllib.linalg.Vector]

前端 未结 3 1693
天命终不由人
天命终不由人 2021-01-11 21:36

I am relatively new to Spark and Scala.

I am starting with the following dataframe (single column made out of a dense Vector of Doubles):

scala> v         


        
3条回答
  •  清歌不尽
    2021-01-11 21:50

    import org.apache.spark.mllib.linalg.Vectors
    
    scaledDataOnly
       .rdd
       .map{
          row => Vectors.dense(row.getAs[Seq[Double]]("features").toArray)
         }
    

提交回复
热议问题