Algorithm for ranking popular blog posts

北战南征 提交于 2019-12-06 09:13:29

问题


I am building a blog aggregator like Techmeme that finds most popular posts from several blogs. Unlike Techmeme, first, I aggregate blog posts from a variety of RSS feeds, then save the headlines and relevant URLs in database. After that, I have to find what the most popular blog posts are.

For defining top blog post headlines, I track Facebook and Twitter share counts for every post of every blog and I rank the blog posts for their share counts. But that isn't the best solution because some bloggers can cheat via increasing their sharing counts with fraudulent shares.

So my question is what criterias could I use to define what the most popular posts are? What would be a better algorithm for ranking blog posts?


回答1:


Since the term 'popular' in this context is vague I would define the popularity of posts according to my criterias. Combine all suggested answers and make a reasonable reputation system for the blog posts. For instance, basically I would do something like this.

  • facebook share x 2
  • twitter share x 3
  • pagerank of the domain x 2
  • 50 000 / global alexa rating
  • and so on

Finally, you may sum up all these and compare. Moreover, you can develop some criterias take into account of size of size of posts, number of images within the post, etc.




回答2:


It may be possible to estimate the joint distribution of shares across different sources. It's hard to detect fraudulence for marginalized (i.e. single) metrics, but it's harder to fake a holistic "organic" profile.




回答3:


How about using variation of PageRank?

here is the more details. http://pr.efactory.de/e-pagerank-algorithm.shtml http://en.wikipedia.org/wiki/PageRank?PHPSESSID=e371f8cacb91eff0c852a0e001893a9a



来源:https://stackoverflow.com/questions/9570384/algorithm-for-ranking-popular-blog-posts

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!