My Favorite Data Sets for ML Use
- Enron Emails: https://data.world/brianray/enron-email-dataset
- Registry of open data on AWS: https://registry.opendata.aws/
- EEG Data:
- EEG Motor Movement / Imagery Dataset: https://physionet.org/pn4/eegmmidb/
- MRI Images Data:
- http://www.dspace.cam.ac.uk/bitstream/1810/243410/47/wctM1517.nii
- Metadata for this file:
- Filename: wctM1517.nii
- BrainID: M1517
- FileType: CotricalThickness
- BrainType: Exskull
- ModelType: R6/2
- CAG: 250
- Age: 18
- Sex: M
- EHR Dataset:
- NFL Dataset analysis:
- FIMI Datasets:
- Retail and more Interesting Datasets: [IBM’s Data Generator]
- Quandl: The data source for financial, economic and alternative datasets
- Data.gov
- 1000 genome data:
- Political TV Ad Dataset:
- Open Secrets Dataset: Center for responsive politics
- Fisheye Dataset:
- Internet Archive: A lot of social and community related datasets
- Heart Rate Database:
- UC Irvine Machine Learning Repository:
-
Helpful EEG Dataset Repositories
Examples of online EEG data repositories.
- Health Statistics and Data:
- Predict Site for EEG:
- Resting State Data:
- European Datasets
- DataLad datasets
- Mike Cohen – Data sets