Classical machine learning and statistics datasets from the UCI Machine Learning Repository and other sources.
The datasets package defines two different kinds of datasets:
file-embed
) embeddedin the package as pure values and do not require network or IO to download the data set. This includes Iris, Anscombe and OldFaithful.
Numeric.Datasets.getDataset
and are cached in a local temporary directory.
> import Numeric.Datasets (getDataset) > import Numeric.Datasets.Iris (iris) > import Numeric.Datasets.Abalone (abalone) > > main = do > -- The Iris data set is embedded > print (length iris) > print (head iris) > -- The Abalone dataset is fetched > abas <- getDataset abalone > print (length abas) > print (head abas).