Curated Collections
High-quality datasets tailored to industry needs.
Home
About
Courses
Datasets
Blog
Policy
Contact
AWS Public Datasets
- Datasets available through Amazon Web Services.
Awesome Public Datasets on GitHub
- A curated list of public datasets.
CERN Open Data Portal
- Data from particle physics experiments.
Data.world
- Collaborative data community with diverse datasets.
Enigma Public
- Public data from governments, companies, and organizations.
Figshare
- Platform for sharing research datasets across disciplines.
FiveThirtyEight
- Data and code behind their articles.
Google Cloud Public Datasets
- Hosted datasets for use with Google Cloud.
Harvard Dataverse
- A repository for research data.
Kaggle Competitions
- Datasets from data science competitions.
Microsoft Azure Open Datasets
- Datasets hosted on Azure, covering various domains.
Papers with Code Datasets
- Datasets tied to machine learning research papers.
Zenodo
- Open repository for research datasets, hosted by CERN.
Specialized Collections
Dryad
- Curated medical and biological research data.
EIA Open Data
- Electricity and power generation data from the U.S. Energy Information Administration.
OpenML
- Machine learning datasets and experiments.