Scala实现wordcount

我怕爱的太早我们不能终老 提交于 2019-12-26 23:21:26
import org.apache.spark.rdd.RDD
import org.apache.spark.{SparkConf, SparkContext}

object WordCount {
  def main(args: Array[String]): Unit = {
    val config: SparkConf = new SparkConf().setMaster("local[*]").setAppName("WordCount")

    val sc = new SparkContext(config)
    //    println(sc)
    val lines: RDD[String] = sc.textFile("in/word.txt")
    val words: RDD[String] = lines.flatMap(x=>x.split(" "))
    val wordToOne: RDD[(String, Int)] = words.map(x=>(x,1))
    val wordToSum: RDD[(String, Int)] = wordToOne.reduceByKey((x,y)=>(x+y))

    val result: Array[(String, Int)] = wordToSum.collect()
    for(word <- result) println(word)
    // lines.flatMap(_.split(" ")).map((_, 1)).reduceByKey(_+_).collect().foreach(println)
  }
}
标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!