VLAD

This is an extended VLAD implementation based on the original implementation by @jorjasso.

Major changes:

Descriptor extraction is multi-threaded for a linear speedup, using the --threads argument (describe.py)
Visual Dictionary now uses MiniBatchKMeans instead of the regular implementation which falls over with more than a few thousand images.
- I recommend setting --batch-size to something high like 10,000 (note that the number of descriptors will be a few orders of magnitude greater than the number of images, so batch size of 10k with 10k images is fine).

Synopsis

Python implementation of VLAD for a CBIR system. Reference:

H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez and C. Schmid, "Aggregating Local Image Descriptors into Compact Codes," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, no. 9, pp. 1704-1716, Sept. 2012. doi: 10.1109/TPAMI.2011.235

Code Example

A query (-q queries/0706.3046-img-1-22.jpg) looking for the seven most similar images (-r 7) using ORB descriptors (-d ORB) and a visual vocabulary of 16 words (-dV visualDictionary/visualDictionary16ORB.pickle) and a ball-tree data structure as index (-i ballTreeIndexes/index_ORB_W16.pickle) is given by:

python query.py  -q queries/0706.3046-img-1-22.jpg -r 7 -d ORB -dV visualDictionary/visualDictionary16ORB.pickle -i ballTreeIndexes/index_ORB_W16.pickle

You must compute the following first: Descriptors, Visual Dictionaries, ball-tree indexes and VLAD descriptors, see section "Computing VLAD features for a new dataset" below for details.

Another examples of queries:

SIFT

python query.py  -q queries/0706.3046-img-1-22.jpg -r 7 -d SIFT -dV visualDictionary/visualDictionary16SIFT.pickle -i ballTreeIndexes/index_SIFT_W16.pickle

python query.py  -q queries/1403.3290-img-5-14.jpg -r 10 -d SIFT -dV visualDictionary/visualDictionary64SIFT.pickle -i ballTreeIndexes/index_SIFT_W64.pickle   

python query.py  -q queries/0801.2442-img-2-21.jpg -r 3 -d SIFT -dV visualDictionary/visualDictionary256SIFT.pickle -i ballTreeIndexes/index_SIFT_W256.pickle

SURF

python query.py  -q queries/1409.1047-img-3-06.jpg -r 7 -d SURF -dV visualDictionary/visualDictionary256SURF.pickle -i ballTreeIndexes/index_SURF_W256.pickle

python query.py  -q queries/0903.1780-img-1-32.jpg -r 7 -d SURF -dV visualDictionary/visualDictionary256SURF.pickle -i ballTreeIndexes/index_SURF_W256.pickle

python query.py  -q queries/1409.1047-img-3-06.jpg -r 7 -d SURF -dV visualDictionary/visualDictionary16SURF.pickle -i ballTreeIndexes/index_SURF_W16.pickle

ORB

python query.py  -q queries/0706.3046-img-1-22.jpg -r 7 -d ORB -dV visualDictionary/visualDictionary16ORB.pickle -i ballTreeIndexes/index_ORB_W16.pickle

python query.py  -q queries/1506.05863-img-3-21.jpg -r 7 -d ORB -dV visualDictionary/visualDictionary16ORB.pickle -i ballTreeIndexes/index_ORB_W16.pickle

Computing VLAD features for a new dataset

Example VLAD with ORB descriptors with a visual dictionary with 2 visual words and an a ball tree as index. (Of course, 2 visual words is not useful, instead, try 16, 32, 64, or 256 visual words)

Remark: Create folders: /ballTreeIndexes, /descriptors, /visualDictionary, /VLADdescriptors

compute descriptors from a dataset. The supported descriptors are ORB, SIFT and SURF:

python describe.py --dataset dataset --descriptor descriptorName --output output

*Example

python describe.py --dataset dataset --descriptor ORB --output descriptors/descriptorORB

Construct a visual dictionary from the descriptors in path -d, with -w visual words:

python visualDictionary.py  -d descriptorPath -w numberOfVisualWords -o output

*Example :

python visualDictionary.py -d descriptors/descriptorORB.pickle  -w 2 -o visualDictionary/visualDictionary2ORB

Compute VLAD descriptors from the visual dictionary:

python vladDescriptors.py  -d dataset -dV visualDictionaryPath --descriptor descriptorName -o output

*Example :

python vladDescriptors.py  -d dataset -dV visualDictionary/visualDictionary2ORB.pickle --descriptor ORB -o VLADdescriptors/VLAD_ORB_W2

Make an index from VLAD descriptors using a ball-tree DS:

python indexBallTree.py  -d VLADdescriptorPath -l leafSize -o output

*Example :

python indexBallTree.py  -d VLADdescriptors/VLAD_ORB_W2.pickle -l 40 -o ballTreeIndexes/index_ORB_W2

Query:

python query.py  --query image --descriptor descriptor --index indexTree --retrieve retrieve

 *Example
 ```python
 python query.py  -q queries/0706.3046-img-1-22.jpg -r 11 -d ORB -dV visualDictionary/visualDictionary2ORB.pickle -i ballTreeIndexes/index_ORB_W2.pickle
 ```

Motivation

This project is part of the pipeline of DOCRetrieval project

Installation

First install conda , then:

conda create --name openCV-Python numpy scipy scikit-learn matplotlib python=3
source activate openCV-Python
conda install -c menpo opencv3=3.1.0

Contributor

[email protected]

PyVLAD
PyVLAD copied to clipboard

Metadata

VLAD

Synopsis

Code Example

Computing VLAD features for a new dataset

Motivation

Installation

Contributor

← Metadata

Owner

Metadata

PyVLAD PyVLAD copied to clipboard

Metadata

VLAD

Synopsis

Code Example

Computing VLAD features for a new dataset

Motivation

Installation

Contributor

← Metadata

Owner

Metadata

PyVLAD
PyVLAD copied to clipboard