-
Catalogue Entry: Trucking industry, employment statistics, by province and territory
This table contains 56 series, with data for years 2009 -2010 (not all combinations necessarily have data for all years), and was last released on 2015-02-06. This table...-
File available for download in the following formats:
- CSV
-
Catalogue Entry: Video Game Data
Step charts from the video game Dance Dance Revolution, and audio files from the NES platform.-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Understanding the interplay between titles, content, and communities in social media
Submissions of reddit posts (and in particular resubmissions of the same content) along with metadata.-
File available for download in the following formats:
- CSV
-
Catalogue Entry: Steam Video Game and Bundle Data
These datasets contain reviews from the Steam video game platform, and information about which games were bundled together.-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Speech Recognition and Multi-Speaker Diarization of Long Conversations
This dataset contains program transcripts from This American Life. Data includes full program transcripts and associated audio.-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Social Recommendation Data
These datasets include ratings as well as social (or trust) relationships between users. Data are from LibraryThing (a book review website) and epinions (general consumer reviews). -
Catalogue Entry: Pinterest Fashion Compatibility
This dataset contains images (scenes) containing fashion products, which are labeled with bounding boxes and links to the corresponding products.-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Interview: Large-scale Modeling of Media Dialog with Discourse Patterns and...
This dataset contains interview transcripts from National Public Radio (NPR). Data includes full interview transcripts and news article headlines.-
File available for download in the following formats:
- CSV
-
Catalogue Entry: Google Local Reviews (2018)
These datasets contain reviews about businesses from Google Local (Google Maps). Data includes geographic information for each business as well as reviews.-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Goodreads Spoilers
These datasets contain reviews from the Goodreads book review website, along with annotated "spoiler" information from each review.-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Goodreads Book Reviews
These datasets contain reviews from the Goodreads book review website, and a variety of attributes describing the items. Critically, these datasets have multiple levels of user...-
File available for download in the following formats:
- JSON
-
Catalogue Entry: Generating Personalized Recipes from Historical User Preferences
These datasets contain recipe details and reviews from Food.com (formerly GeniusKitchen). Data includes cooking recipes and review texts.-
File available for download in the following formats:
- CSV
-
Catalogue Entry: DogWhistle: Cant Understanding Data
DogWhistle is a Chinese dataset collected from the historical records for an online game. It provides hidden words and the cant for them, with human answers. The dataset is...-
File available for download in the following formats:
- CSV