dsfo icon indicating copy to clipboard operation
dsfo copied to clipboard

Datasets for OCaml

Build Status

TL;DR: Download (anyhow) and interact (ocaml) with common machine learning datasets.

  • MNIST

    # Mnist.download `Train ;;
    - : unit = ()
    # Mnist.download `Test ;;
    - : unit = ()
    # let train = Mnist.data `Train ;;
    val train : (float, Bigarrayo.float64_elt, Bigarrayo.fortran_layout) Bigarrayo.A2.t =  ...
    # let test = Mnist.data `Test ;;
    val test : (float, Bigarrayo.float64_elt, Bigarrayo.fortran_layout) Bigarrayo.A2.t =  ...
    
  • CIFAR10

# let t3 = Cifar10.data (`Train 3) ;;
# Graphics.open_graph "" ;;
# let display n =
    let tr, label = Cifar10.decode t3 n in
    Vcifar10.draw_and_inspect ~zoom:3 ~label (Vcifar10.aligned_colors tr) ;;

This is a small library to simplify processing and interacting with a few common machine learning data sets. The code will call out to utilities such as curl to do the work.