Datasets

Discover and download high-quality datasets for your AI/ML projects

8 Datasets

Featured Datasets

ImageNet-1K
Computer VisionImages
Large-scale image classification dataset with 1.2M images across 1,000 categories
classificationcomputer-visionbenchmark
Size:150 GB
Downloads:2.3M
Rating:4.9/5
Updated:1/15/2024
COCO 2017
Computer VisionImages
Object detection, segmentation, and captioning dataset with 330K images
object-detectionsegmentationcaptioning
Size:25 GB
Downloads:1.8M
Rating:4.8/5
Updated:2/1/2024

All Datasets (8)

Common Crawl
Natural LanguageText
Web crawl data for training large language models and NLP research
nlplanguage-modelingweb-data
Size:2.5 TB
Downloads:890K
Rating:4.7/5
Updated:1/30/2024
OpenWebText
Natural LanguageText
Open-source recreation of GPT-2's training dataset with 40GB of text
gptlanguage-modelingtext-generation
Size:40 GB
Downloads:1.2M
Rating:4.6/5
Updated:1/20/2024
Kaggle House Prices
TabularCSV
Regression dataset for predicting house prices with 79 features
regressionreal-estatebeginner-friendly
Size:460 KB
Downloads:3.1M
Rating:4.5/5
Updated:2/10/2024
MNIST Handwritten Digits
Computer VisionImages
Classic dataset of 70,000 handwritten digits for image classification
classificationdigitsbeginner-friendly
Size:11 MB
Downloads:5.2M
Rating:4.4/5
Updated:1/5/2024
Stanford Sentiment Treebank
Natural LanguageText
Fine-grained sentiment analysis dataset with 215,154 phrases
sentiment-analysisnlpclassification
Size:2.3 MB
Downloads:780K
Rating:4.3/5
Updated:1/12/2024
Cityscapes
Computer VisionImages
Urban scene understanding dataset with pixel-level annotations
segmentationautonomous-drivingurban
Size:11 GB
Downloads:420K
Rating:4.7/5
Updated:2/5/2024