Datasets for Apache Mahout

旧城冷巷雨未停 提交于 2019-12-22 08:56:20

问题


I am looking for datasets that can be used for implementing recommendation system usecase of Apache Mahout. I know of only MovieLens Data Sets from GroupLens Research group.

Anyone knows any other datasets that can be used for recommendation system implementation? I am particularly interested in item-based data sets though other datasets are most welcome.


回答1:


this is Sebastian from Mahout.

There is a dataset from a czech dating website available that might be of interest to you: http://www.occamslab.com/petricek/data/

Btw the term item-based refers to a special collaborative filtering approach not to the dataset itself, which is usually in the common form of user-item-rating tripels that most collaborative filtering approaches work with.

We would love to hear from your experimentation results and experiences (if you wanna share them) on our user mailinglist at user@mahout.apache.org




回答2:


While searching for data sets, I found few sites that list publicly available data sets which can used for data mining. Some of these can be used for Mahout too.

Bixo Labs

UCI Datasets

KDnuggets




回答3:


You can look at iPinYou RTB Bidding Data Set Quora : http://qr.ae/OrqgM http://contest.ipinyou.com/data-release.html



来源:https://stackoverflow.com/questions/3490785/datasets-for-apache-mahout

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!