Yelp Open Dataset

An all-purpose dataset for learning

The Yelp dataset is a subset of our businesses, reviews, and user data for use in personal, educational, and academic purposes. Available as JSON files, use it to teach students about databases, to learn NLP, or for sample production data while you learn how to make mobile apps.

The Dataset


5,996,996 reviews

188,593 businesses

280,992 pictures

10 metropolitan areas
  • 1,185,348 tips by 1,518,169 users
  • Over 1.4 million business attributes like hours, parking, availability, and ambience
  • Aggregated check-ins over time for each of the 188,593 businesses

Get Started

Visit the documentation for information on the structure of the dataset and how to get started.

The dataset challenge

If you're a student, we have just the thing for you. Ever had a theory about how food trends start? Who are the trend setters? Well, we run a competition that gives you the chance to explore out dataset deeply and win some money too! For more information, visit our dataset challenge page.