Data sources

UC Irvine Machine Learning Repository

Subject:
Computer Science:
  • Machine Learning
Keywords: machine learning, repository, open access, open science
Accessible to: Everybody, Aalto Students & Staff, Aalto users
Attention:
New website is under development.
A black ant-eater within a hexagon partly filled with yellow colour

Data source type

Open source: Repository

License

Open access

Analysis unit

Other

Geographical coverage

Global

Time period coverage

Varies with each dataset.

Frequency

Other

Format

Various formats

Description

The University of California Irvine Machine Learning Repository is a collection of databases, domain theories, and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms. The repository maintains about 600 datasets from various subjects, including e.g.,

  • Computer Science
  • Life Sciences
  • Physical Sciences
  • Business
  • Social Sciences
  • Games

You can search data based on characteristics (e.g. tabular, time-series, sequential, text) or the associated tasks the data is meant for (classification, regression, clustering). Note that a new version of the website is under development.

Distributor

University of California Irvine

Acknowledgement

UCI MRL gratefully acknowledges funding support from the National Science Foundation.

Cite as

Citation instructions are provided at each dataset in the repository.

Contact

Contact information is provided on the repository's website. You may also contact us at [email protected] for inquiries about using data from the repository.

Share
URL copied!