reddit

GitHub终极指南,教你如何在GitHub中“挖矿”

别来无恙 提交于 2019-11-28 22:41:44
"我应该在我的项目中使用GitHub吗?"——有抱负的数据科学家经常问我这个问题。只有一个答案——"绝对!" GitHub对于那些想要脱颖而出的数据科学家来说是一个非常有价值的平台。你可以把它当做一份在线简历,用来向招聘人员和其他专业人士展示你的代码。GitHub上有来自谷歌、Facebook、IBM、英伟达等顶尖科技巨头的开源项目,这些举措让GitHub更加受到人们的关注。 如果你是数据科学的初学者,或者是专业人士,你应该有一个GitHub账户。为了节省你寻找高含金量库的时间(虽然这样的库有很多),我很高兴在这篇文章中罗列出它们并将它们直接提供给你。 这篇文章集合了各种用例——计算机视觉(目标检测和分割),PyTorch实现的谷歌人工智能打破了用于NLP的BERT模型,提取最新的研究论文及其摘要等。 为什么我们要在这篇文章中加入关于Reddit的讨论呢?我个人觉得Reddit是一个极其有益的平台——因为它有着丰富的内容,众多机器学习/深度学习专家提出的想法,一些"骇人听闻"的话题,以及开源资源,等等。我可以继续说下去我喜欢它的原因,这足以表明我是有多么的推崇Reddit——它们是那样的独特而有价值。 你可以看看我关于GitHub上的顶级存储库和Reddit讨论(从4月份开始),我们每个月都在讨论: 一月:https://www.analyticsvidhya.com/blog

Reddit-style nested/threaded/indented comments for Rails?

情到浓时终转凉″ 提交于 2019-11-28 13:53:44
问题 I'm wondering if someone has already built a system for threaded comments (for lack of a better term) in Rails or if I need to build it myself. In case it's not clear, what I'm referring to is a comment system like Reddit's that automatically indents replies so that they appear like branches of a tree (preferably with voting just as Reddit does). If someone could point me to code that does this, it would be greatly appreciated. Or perhaps there is an open source project that includes this

How much memory will a list with one million elements take up in Python?

廉价感情. 提交于 2019-11-28 13:26:18
There are more than a million subreddits on Reddit, according to redditmetrics.com . I wrote a script that repeatedly queries this Reddit API endpoint until all the subreddits are stored in an array, all_subs : all_subs = [] for sub in <repeated request here>: all_subs.append({"name": display_name, "subscribers": subscriber_count}) The script has been running for close to ten hours, and it's about halfway done (it gets rate-limited every three or four requests). When it's finished, I expect an array like this: [ { "name": "AskReddit", "subscribers", 16751677 }, { "name": "news", "subscribers",

How much memory will a list with one million elements take up in Python?

大城市里の小女人 提交于 2019-11-27 07:41:54
问题 There are more than a million subreddits on Reddit, according to redditmetrics.com. I wrote a script that repeatedly queries this Reddit API endpoint until all the subreddits are stored in an array, all_subs : all_subs = [] for sub in <repeated request here>: all_subs.append({"name": display_name, "subscribers": subscriber_count}) The script has been running for close to ten hours, and it's about halfway done (it gets rate-limited every three or four requests). When it's finished, I expect an