I am going to write an small application which requires twitter data. Can some one give me a good data source for twitter data. It would be great if data is large as the application is going to run a data mining algorithm.
Why not use the APIs from Twitter? You can build a data set with the Streaming API or just iterate through a dictionary and use the Search API
You should use the Twitter Streaming API Sample resource to collect data from Twitter. The Search API is not intended for automated search.
This short tutorial shows how to get and filter data using Twitter API. You set language, keyword and other parameters.
If you need to integrate a simple widget in your web pages you can get JavaScript source at twitter web site.
You can get a twitter dataset from http://dmml.asu.edu/datasets/
You could try jetwick (open source)
At the moment we have 10 mio tweets per week, but you can improve that with more servers ;)
来源:https://stackoverflow.com/questions/4251768/twitter-public-dataset