mahout

Mahout Plugin for ruby on rails

我与影子孤独终老i 提交于 2019-12-03 09:17:03
问题 I want to use Apache Mahout in my project on Ruby on Rails for implementing recommendations and collaborative filtering. In Particular my requirements are: suggesting related tags. suggesting related articles. based on user's preferences prompt him for review of articles. based on geographical location, and other meta information of a user, suggest him similar users. I am open to using any other solution (other than mahout) if it integrates with rails easily and fulfills my requirements. 回答1:

how can I compile/using mahout for hadoop 2.0?

匿名 (未验证) 提交于 2019-12-03 08:52:47
可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试): 由 翻译 强力驱动 问题: The latest release mahout 0.9 is only built on hadoop 1.x. (mvn clean install) How can I compile mahout for hadoop 2.0.x? Because When I was running the commands: hadoop jar mahout-examples-0.9-SNAPSHOT-job.jar org.apache.mahout.cf.taste.hadoop.item.RecommenderJob -s SIMILARITY_COOCCURENCE -i test -o result I always got the error message IncompatibleClassChangeError: Found interface org.apache.hadoop.mapreduce.JobContext, but class was expected. Thanks! 回答1: To compile Mahout to work with 2.x since it isn't released in a package

Web page recommender system

杀马特。学长 韩版系。学妹 提交于 2019-12-03 06:25:20
问题 I am trying to build a recommender system which would recommend webpages to the user based on his actions(google search, clicks, he can also explicitly rate webpages). To get an idea the way google news does it, it displays news articles from the web on a particular topic. In technical terms that is clustering, but my aim is similar. It will be content based recommendation based on user's action. So my questions are: How can I possibly trawl the internet to find related web-pages? And what

Hadoop家族学习路线图(转)

纵然是瞬间 提交于 2019-12-03 06:09:27
Hadoop家族学习路线图 Hadoop家族系列文章 ,主要介绍Hadoop家族产品,常用的项目包括Hadoop, Hive, Pig, HBase, Sqoop, Mahout, Zookeeper, Avro, Ambari, Chukwa,新增加的项目包括,YARN, Hcatalog, Oozie, Cassandra, Hama, Whirr, Flume, Bigtop, Crunch, Hue等。 从2011年开始,中国进入大数据风起云涌的时代,以Hadoop为代表的家族软件,占据了大数据处理的广阔地盘。开源界及厂商,所有数据软件,无一不向Hadoop靠拢。Hadoop也从小众的高富帅领域,变成了大数据开发的标准。在Hadoop原有技术基础之上,出现了Hadoop家族产品,通过“大数据”概念不断创新,推出科技进步。 作为IT界的开发人员,我们也要跟上节奏,抓住机遇,跟着Hadoop一起雄起! 关于作者: 张丹(Conan), 程序员Java,R,PHP,Javascript weibo:@Conan_Z blog: http://blog.fens.me email: bsspirit@gmail.com 转载请注明出处: http://blog.fens.me/hadoop-family-roadmap/ 前言 使用Hadoop已经有一段时间了,从开始的迷茫

Recommendation Engines for Java applications [closed]

痴心易碎 提交于 2019-12-03 05:13:48
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 5 years ago . I was wondering if there is any open source recommendation engine available? It should suggest something like Amazon and Netflix. I have heard of a framework called Apache Mahout - Taste. I am trying it next week. It would be great if you can share your valuable thoughts. 回答1: I'm the developer of Mahout / Taste

Support Vector Machine for Java?

跟風遠走 提交于 2019-12-03 03:14:11
问题 I'd like to write a "smart monitor" in Java that sends out an alert any time it detects oncoming performance issues. My Java app is writing data in a structured format to a log file: <datetime> | <java-method> | <seconds-to-execute> So, for example, if I had a Widget#doSomething(String) method that took 812ms to execute, it would be logged as: 2013-03-24 11:39:21 | Widget#doSomething(String) | 812 As performance starts to degrade (such as during a major collection, during peak loads, or if

Run cvb in mahout 0.8

≡放荡痞女 提交于 2019-12-03 01:48:23
The current Mahout 0.8-SNAPSHOT includes a Collapsed Variational Bayes (cvb) version for Topic Modeling and removed the Latent Dirichlet Analysis (lda) approach, because cvb can be parallelized way better. Unfortunately there is only documentation for lda on how to run an example and generate meaningful output. Thus, I want to: preprocess some texts correctly run the cvb0_local version of cvb inspect the results by looking at the top n words in each of the generated topics So here are the subsequent Mahout commands I had to call in a linux shell to do it. $MAHOUT_HOME points to my mahout/bin

Clustering — Sparse vector and Dense Vector

爱⌒轻易说出口 提交于 2019-12-03 00:28:12
For clustering, Mahout input needs to be in vector form. There are two types of vector implementations. One is Sparse Vector and another is Dense Vector. What is difference between two ? Usage scenarios for Sparse and Dense ? Concept-wise, most of the values in a sparse vector are zero, in a dense vector they are not. Same for dense and sparse matrices. The terms sparse and dense generally describe these properties, not only in Mahout. In Mahout the DenseVector assumes not too many zero entries and therefore "Implements vector as an array of doubles" ( org.apache.mahout.math.DenseVector ). In

What's difference between item-based and content-based collaborative filtering?

岁酱吖の 提交于 2019-12-03 00:14:14
问题 I am puzzled about what the item-based recommendation is, as described in the book "Mahout in Action". There is the algorithm in the book: for every item i that u has no preference for yet for every item j that u has a preference for compute a similarity s between i and j add u's preference for j, weighted by s, to a running average return the top items, ranked by weighted average How can I calculate the similarity between items? If using the content, isn't it a content-based recommendation?

基于 Apache Mahout 构建社会化推荐引擎-实战

一笑奈何 提交于 2019-12-02 22:24:04
参考文章 IBM<<基于 Apache Mahout 构建社会化推荐引擎>>, 文章的链接: http://www.ibm.com/developerworks/cn/java/j-lo-mahout/ 安装条件: 1) JDK,使用1.6版本。 MyEclipse /Eclipse 2) Maven,使用2.0.11版本或以上。 3) Apache Mahout,使用0.5版本。 4) Tomcat 7.0 或者其他版本。 MySQL 以下会具体说明。 一、 Taste的安装与简单的 Demo 实现 下载得到Apache Mhout的发布版本。 http://mahout.apache.org/ 建议下载0.5版本 mahout-distribution-0.5-src.zip,解压缩。 下载Maven,并且配置环境,见 http://blog.csdn.net/highram/article/details/7190839 在MyEclipse或者Eclipse中构建Mahout,见 http://www.cnblogs.com/dlts26/archive/2011/09/13/2174889.html 启动jetty服务器: cmd 在 taste-web目录下敲命令:mvn jetty:run-war 输入url: http://localhost:8080/mahout