Joining two RDD[String] -Spark Scala
可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试): 问题: I have two RDDS : rdd1 [String,String,String]: Name, Address, Zipcode rdd2 [String,String,String]: Name, Address, Landmark I am trying to join these 2 RDDs using the function : rdd1.join(rdd2) But I am getting an error : error: value fullOuterJoin is not a member of org.apache.spark.rdd.RDD[String] The join should join the RDD[String] and the output RDD should be something like : rddOutput : Name,Address,Zipcode,Landmark And I wanted to save these files as a JSON file in the end. Can someone help me with the same ? 回答1: As said in the