vertical stitching ?
Is it possible to stitch images vertically like those ?

You should be. Try with
stitch * --wave_correct_kind vert
or
stitch * --wave_correct_kind no
You should be. Try with
stitch * --wave_correct_kind vertor
stitch * --wave_correct_kind no
Both produced this (vert just turned it horizontally) (sorry for the wrong title :) ). The third image is missing

I have got a result using
stitch * --confidence_threshold 0 --wave_correct_kind vert --detector sift
Apperently the matching confidences between the first and the other images are not that good. And with the default detector orb the stitching would fail. sift seems to be better for your use case.
Feel free to follow the tutorial for more insights!

--confidence_threshold 0 --wave_correct_kind vert --detector sift
This works way better ! But should all the frames have to be matched to the first one ? I'm trying to create something like a map from the drone footage and most of the frames will have nothing to do with the first one. Should I stitch the next frame to the result of the previous for example ? Thank you for your time !
I'd rather be interested in your dataset. Is it a grid and you know the relation of the Images? E.g. geographic bounding boxes? How many Images are there? Do you know for sure that the Images overlap?
I'd rather be interested in your dataset. Is it a grid and you know the relation of the Images? E.g. geographic bounding boxes? How many Images are there? Do you know for sure that the Images overlap?
I'm testing it with a random 4k (downsized to 1024x768) drone video by taking snapshots and assuring overlap with the previous frame. In reality (when I have a drone) there would be GPS data too and by using it you can approximate the position of one image to another. But that would require a slightly different algorithm for stitch I guess (if you include the GPS data) ?
PS: I tested it with 12 images and it still works very well.

What I could Imagine is that if you have a really big amount of Images it would be good to stitch some parts of the scene seperately and then stitching those results together into a final Panorama.
What I could Imagine is that if you have a really big amount of Images it would be good to stitch some parts of the scene seperately and then stitching those results together into a final Panorama.
Is there a way to hint the algorithm where to search for the matching features ? Like if you have a map of like 10,000 x 10,000 images to tell it to search from 9000x0 to 10,000x768 and to match it with the new image or it's lot a more complicated than that ?
no I think you will be fine like having a grid of images and first stitch the "upper left" images, then "upper right"... and after that you can stitch the 4 results together, you just need to have enough overlap.
x x x x