UC Irvine Machine Learning Repository
Subject:
Computer Science:
- Machine Learning
Keywords:
machine learning, repository, open access, open science
Accessible to:
Everybody
Attention:
New website is under development

Data source type
Open source: Repository
License
Open access
Analysis unit
Other
Geographical coverage
Global
Time period coverage
Varies with each dataset.
Frequency
Other
Format
Various formats
Description
The University of California Irvine Machine Learning Repository is a collection of databases, domain theories, and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms.
The repository maintains about 600 datasets from various subjects, including Computer Science, Life Sciences, Physical Sciences, Business, Social Sciences, Games and more. You can search data based on characteristics (e.g. tabular, time-series, sequential, text) or the associated tasks the data is meant for (classification, regression, clustering).
Note that a new version of the website is under development.
The repository maintains about 600 datasets from various subjects, including Computer Science, Life Sciences, Physical Sciences, Business, Social Sciences, Games and more. You can search data based on characteristics (e.g. tabular, time-series, sequential, text) or the associated tasks the data is meant for (classification, regression, clustering).
Note that a new version of the website is under development.
Distributor
University of California Irvine
Acknowledgement
UCI MRL gratefully acknowledges funding support from the National Science Foundation.
Cite as
Citation instructions are provided at each dataset in the repository.
Contact
Contact information is provided on the repository's website. You may also contact us at [email protected] for inquiries about using data from the repository.