Hot off the press, a script that is able to gather large-ish amounts of geo tagged tweet data from Twitter's public API. Running for 24 hours will yield ~300MB (~6 Million Tweets) in the following format timestamp tweet_id tweet_language country lat lng 1367706951 330813049193762816 es ES 39.4643 -0.3548 For more details check out: twitter-geo-data on github.