Skip to content

bayomim/foursquare-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 

Repository files navigation

foursquare-dataset

A Foursquare dataset for contextual suggestion

A dataset that we've collected for our participation in TREC 2016 Contextual Suggestion track. https://sites.google.com/site/treccontext/

The dataset comprises 228,778 attractions such as: Parks, Restaurants, Meusums, etc.

Update: 2018-07-23 - a new set of 969 attractions were added 
        the new dataset contains 229,747 attractions

The dataset is in JSON format and it was collected from 2017-01-30 to 2017-02-22.

Update: the last 969 attractions in the dataset were collected on 2018-07-23

Attractions in this dataset are from different cities in the US.

The dataset is for research use ONLY.

Please cite our paper if you publish material based on this dataset.

@inproceedings{bayomi2016adapt_tcd,
   title={ADAPT\_TCD: An Ontology-Based Context Aware Approach for Contextual Suggestion.},
   author={Bayomi, Mostafa and Lawless, S{\'e}amus},
   booktitle={TREC},
   year={2016}
 }

Our paper is available here: https://trec.nist.gov/pubs/trec25/papers/ADAPT_TCD-CX.pdf

Each object in the JSON file has data from the contextual suggestion track (e.g. attraction id, city, city location., etc.)

Each object contains a "foursquare" object that has all the data associated with the attraction on Foursquare including users' reviews.

User names and Ids are anonymised.

The dataset is available here:

https://drive.google.com/file/d/1rk8QxxR9GhxcHr7ZRXI8oD_5a6UijDW2/view?usp=sharing

A sample of an object in the dataset is as follows:

{"_id": {"$oid":"58907c200eb9c5a0205f634f"}, // a unique id assigned by MongoDB.
"id":"TRECCS-00652087-350", // Attraction's ID provided by TREC Contextual Suggestion track
"orig_url":"http://www.yelp.com/biz/norm-waitt-sr-ymca-south-sioux-city", // Attraction's URL provided by TREC Contextual Suggestion track
"title":"Norm Waitt Sr YMCA", // Attraction's title provided by TREC Contextual Suggestion track
"city_id":350, // Attraction's City ID in the TREC Contextual Suggestion track dataset
"city_name":"Sioux City", // Attraction's City name in the TREC Contextual Suggestion track dataset
"state":"IA", // Attraction's state in the TREC Contextual Suggestion track dataset
"lat":42.49999, // Latitude of the CITY (not the attraction) in the TREC Contextual Suggestion track dataset
"long":-96.40031, // Longitude of the CITY (not the attraction) in the TREC Contextual Suggestion track dataset
"foursquare":{ // This is the data we collected from Foursquare about this attraction
      "timestamp":{"$date":"2017-01-30T15:51:11.287Z"}, // The time when the data was collected
      "title":"Norm Waitt Sr. YMCA", //  Attraction's title in Foursquare
      "foursqaure_url":"https://foursquare.com/v/norm-waitt-sr-ymca/4c61d1ac4be3b7138adcd3e8", //  Attraction's URL in Foursquare. Note the last part of the URL is the attraction's ID in Foursquare
      "rating":8.7, // Attraction's rating
      "ratingCount":17,// Number of users who gave rating to this attraction in Foursquare
      "reviewCount":3, // Number of users who wrote reviews on this attraction in Foursquare
      "photosCount":0, // Number of photos associated with this  attraction 
      "categories":["Gym / Fitness Center"], // Category of the attraction
      "openingTimes":[ // Open and close times
            {"days":"Mon–Fri","hours":["5:00 AM–10:00 PM"]},
            {"days":"Sat","hours":["6:00 AM–6:00 PM"]},
            {"days":"Sun","hours":["Noon–6:00 PM"]}],
            "venue_url":"nwsymca.org", // original URL of the attraction
            "address":"601 Riverview DrSouth Sioux City, NE 68776United States",
            "description":"The NWSYMCA has been an essential part of Siouxland community for over 125 years. As a diverse, non-profit organization, all members, volunteers, staff \u0026 donors are joined together by the shared commitment youth development, healthy living and social responsibility.", // a description for the attraction in Foursquare
            "otherLocations":{"length":0,"locations":[]},// If this attraction has other locations
            "features":[// What features this attraction has
                  {"feature":"Credit Cards","value":"Yes"},
                  {"feature":"Wi-Fi","value":"Free"}],
            "menus":[],// What dishes are in the menue (if a restaurant or so)
            "drinks":[],// What drinks this place offers (if a restaurant or so)
            "tagCloudArray":[], // Prominent keywords etracted from the users' reviews
            "reviews":[ // users' reviews
                {
                "userName":"2", // userName and userId have the same value. This is the anonymous ID 
                "userId":"2",
                "tipDate":"January 10, 2016",// the date where this review was written
                "tipText":"Great classes for non-members. We really enjoy aqua tots with our son." // The review text
                },
                {
                "userName":"3",
                "userId":"3",
                "tipDate":"March 29, 2011",
                "tipText":"Need a new workout? Try a group class (power, step, ride, centergy) ... there's something for everyone."
                },
                .... // more reviews
                ]} // End of "foursquare" object
                }

About

A foursquare dataset for contextual suggestion

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published