Foursquare Dataset

1. NYC Restaurant Rich Dataset (Check-ins, Tips, Tags)

Location based social networks have attracted millions of users and massively contains their digital footprints. We have crawled a part of these digital footprints from Foursquare in order to study the problems of personalized location recommendation and search. This dataset includes check-in, tip and tag data of restaurant venues in NYC collected from Foursquare from 24 October 2011 to 20 February 2012. It contains 3112 users and 3298 venues with 27149 check-ins and 10377 tips (written in English).

Please download the dataset here and check the readme file here

Please cite our paper if you publish material based on those datasets.


2. NYC and Tokyo Check-in Dataset

This dataset contains check-ins in NYC and Tokyo collected for about 10 month (from 12 April 2012 to 16 February 2013). It contains 227,428 check-ins in New York city and 573,703 check-ins in Tokyo. Each check-in is associated with its time stamp, its GPS coordinates and its semantic meaning (represented by fine-grained venue-categories). This dataset is originally used for studying the spatial-temporal regularity of user activity in LBSNs.

Please download the dataset here and check the readme file here

Please cite our paper if you publish material based on those datasets.


3. Global-scale Check-in Dataset

This dataset includes long-term (about 18 months from April 2012 to September 2013) global-scale check-in data collected from Foursquare. It contains 33,278,683 checkins by 266,909 users on 3,680,126 venues (in 415 cities in 77 countries). Those 415 cities are the most checked 415 cities by Foursquare users in the world, each of which contains at least 10K check-ins. Please see the references for more details about data collection and processing.

Please download the dataset here (about 775MB zipped) and check the readme file here

Please cite our paper if you publish material based on this dataset.


4. User Profile Dataset 

This dataset includes some user profile data for privacy study (i.e., gender, #friends, #followers). It contains 18,201 and 11,874 users who have checked in New York City and Tokyo, respectively. The corresponding user check-in data can be found in the global-scale check-in dataset I published. The two dataset can be linked by the anonymized user ID (the unique key). Please see the references for more details about data collection and processing.

Please download the dataset here and check the readme file here

Please cite our paper if you publish material based on this dataset.


5. Global-scale Check-in Dataset with User Social Networks

This dataset includes long-term (about 22 months from Apr. 2012 to Jan. 2014) global-scale check-in data collected from Foursquare, and also two snapshots of user social networks before and after the check-in data collection period (see more details in our paper). The check-in dataset contains 22,809,624 checkins by 114,324 users on 3,820,891 venues. The social network data contains 363,704 (old) and 607,333 (new) friendships. Due to frequent requests, we also include the raw check-in dataset containing 90,048,627 checkins by 2,733,324 users on 11,180,160 venues.

Please download the dataset here (~2.68GB zipped, ~4.08GB unzipped) and check the readme file here

Please cite our paper if you publish material based on this dataset.