Awesome List of Dataset
Dataset
Art Dataset
Dataset
- The Open-Source Movement Comes to Medical Datasets
- Mozilla Foundation - Mozilla Common Voice Adds 16 New Languages and 4,600 New Hours of Speech
Drug Dataset
Dataset Zoo
- Deeplite/deeplite-torch-zoo Pytorch
Dataset
Dataset
Dataset
Dataset
- google-research-datasets/wit: WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
- PUBLIC DATA: 2021 AI Index Report - Google Drive
Dataset Tools
- Scale AI: The Data Platform for AI : High quality training and validation data for AI applications
- Aquarium - Data Management For ML : ML data management platform
- Labelbox: The leading training data platform for data labeling : Save time by creating and managing your training data, people, and processes in a single place
Cell Tower Dataset
- Cellular Tower and Signal Map
- OpenCelliD - Largest Open Database of Cell Towers & Geolocation - by Unwired Labs
Twitter Dataset
Dataset
- EleutherAI EleutherAI is a grassroots AI research group aimed at democratizing and open sourcing AI research.
- The Pile : The Pile is a 825 GiB diverse, open source language modelling data set that consists of 22 smaller, high-quality datasets combined together.