Apache Spark Naive Bayes based Text Classification
可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试): 问题: im trying to use Apache Spark for document classification. For example i have two types of Class (C and J) Train data is : C, Chinese Beijing Chinese C, Chinese Chinese Shanghai C, Chinese Macao J, Tokyo Japan Chinese And test data is : Chinese Chinese Chinese Tokyo Japan // What is ist J or C ? How i can train and predict as above datas. I did Naive Bayes text classification with Apache Mahout, however no with Apache Spark. How can i do this with Apache Spark? 回答1: Yes, it doesn't look like there is any simple tool to do that in Spark yet.