-
Catalogue Entry: Multi-aspect Reviews
These datasets include reviews with multiple rated dimensions. The most comprehensive of these are beer review datasets from Ratebeer and Beeradvocate, which include sensory... -
Catalogue Entry: Modeling heart rate and activity data for personalized fitness recommendation
This is a collection of workout logs from users of EndoMondo. Data includes multiple sources of sequential sensor data such as heart rate logs, speed, GPS, as well as sport...-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Marketing Bias data
These datasets contain attributes about products sold on ModCloth and Amazon which may be sources of bias in recommendations (in particular, attributes about how the products...-
File available for download in the following formats:
- CSV
-
Catalogue Entry: Learning to Discover Social Circles in Ego Networks
These datasets contain social connections and "circles" from Facebook, Twitter, and Google Plus. -
Catalogue Entry: Homeless Shelter Use by Age and Community with Implications
The dataset illustrates three key results—that people experience sheltered homelessness in vastly different ways, that the experience varies by their age, and that these...-
File available for download in the following formats:
-
Catalogue Entry: Google Restaurants
This is a mutli-modal dataset of restaurants from Google Local (Google Maps). Data includes images and reviews posted by users, as well as other metadata for each restaurant.-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Google Local Reviews (2021)
This dataset contains review information from Google Maps (ratings, text, images, etc.), business metadata (address, geographic info, descriptions, category information, price,...-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Clothing_Fit_Data
These datasets contain measurements of clothing fit from ModCloth and RentTheRunway.-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Behance Community Art Data
Likes and image data from the community art website Behance. This is a small, anonymized, version of a larger proprietary dataset. -
Catalogue Entry: Amazon Question and Answer Data
This dataset contains Question and Answer data from Amazon, totaling around 1.4 million answered questions. This dataset can be combined with Amazon product review data,...-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Amazon Product Reviews
This is a large crawl of product reviews from Amazon. This dataset contains 82.83 million unique reviews, from around 20 million users.-
File available for download in the following formats:
- JSON