Hierarchical-Localization Doppelgangers support

There is a repository, which aims to avoid incorrect image registration called 'Doppelgangers': link. Implementing it into the pipeline should be straightforward, since it is a binary classifier, also I am currently working on it.

My pipeline is the following:

netvlad
pairs from retrieval
feature extractor (superglue)
feature matcher (lightglue)
!!!! Doppelgangers for each feature pair generates probability that they are a correct match
for pairs with probability < thresh delete them from matches.hdf5
run sfm
??? profit

This issue is for mainly tracking pull request, as well, as maybe some help I will need along the way

Sep 19 '23 17:09 awarebayes

Yes we already looked into this - it looks very promising. After discussing with the author @RuojinCai and experiments by @veichta, it looks like the model might not generalize well to types of matches (especially sparse matches) that are different than it was trained on (dense LoFTR). If someone can train a new model for DISK/SP+LightGlue (or, better, that generalizes to any sparse matches), then I'd be happy to integrate it (and add it to COLMAP down the road).

For the integration, I see two alternate designs:

Create a new step that takes feature and match files and outputs a new match file from which incorrect image pairs are removed.

python -m hloc.remove_flase_pairs --features path/to/features.h5 --matches path/to/matches.h5 --output path/to/cleaned_matches.h5

Create a new step that adds a doppelganger_score attribute to each image pair group in the match file:

python -m hloc.add_doppelganger_score --features path/to/features.h5 --matches path/to/matches.h5

And skip importing pairs with doppelganger_score < threshold in hloc.triangulation.import_matches.

Design 2. makes it easier to run the reconstruction with different thresholds without running the doppelganger network again, but it is more invasive - so I'd go for 1. for now.

cc @Phil26AT

Sep 20 '23 08:09 sarlinpe

The doppelganger score is very weird, it's just logits of size (2,), and not probabilities. We can of course softmax them

Sep 20 '23 10:09 awarebayes

Here is what I ended up doing: https://github.com/awarebayes/doppelgangers-hloc#usage-with-hloc

I need some more data from hloc, in my matches.h5, I created a merge request: https://github.com/cvg/Hierarchical-Localization/pull/309

I tested it with lightglue, somewhat works. Interesting remarks about superpoint+lightglue.

As for the final stage, I overwrite matches.h5 directly and delele 'bad' pairs: https://github.com/awarebayes/doppelgangers-hloc/blob/main/doppelgangers/utils/overwrite_hloc.py#L8

I believe it is possible to train doppelgangers since the architecture is a very simple CNN, and dataset is public and not really large.

Sep 20 '23 10:09 awarebayes

And here is how I use HLOC matches and features to load the data: https://github.com/awarebayes/doppelgangers-hloc/blob/main/doppelgangers/datasets/hloc_dataset.py

Sep 20 '23 10:09 awarebayes

I will try to train doppelgangers with superpoint + lightglue today

Sep 20 '23 12:09 awarebayes