Package: VISET: Computer Vision Datasets  
Author: Jeffrey Byrne <jeff@visym.com>  
URL: https://bitbucket.org/visym/viset/

VISET is a python package for creating computer vision datasets.
VISET generates redistributable HDF5 files for sharing datasets,
and provides a common python programming interface for 
downloading and caching datasets for typical evaluation tasks.

VISET supports iterating over training and testing splits in a pythonic way:

dbfile = viset.caltech.Caltech101().export()    # create database file (once)
db = viset.dataset.Viset(dbfile, split='2fold')  # download dataset
for (image,annotation) in db['train']:                # iterate over training set
    ...train classifier

VISET is useful for sharing large annotated datasets defined by URLs
and large image archive files.  Researchers can post their VISET dataset
file (e.g. caltech101.h5) files online to share an annotated dataset for
easy reuse.

VISET currently exports:

1. ImageNet-Fall2011  
2. Caltech101  
3. Caltech256  
4. ETHZ shapes
5. ETHZ extended shapes
6. MNIST
7. LabelMe3 
8. Weizmann Horses (single scale)
9. Weizmann Horses (multiscale scale)
10. Pascal VOC 2012 
... more coming!

See demo_viset.py for an example of usage.

Try it:

sh> pip install viset

Inspired by the excellent work at:

1. http://jaberg.github.io/skdata/
2. http://mldata.org




