MiZhenxing comments

Results 8 comments of


                                            MiZhenxing

RuntimeError: The size of tensor a (31) must match the size of tensor b (32) at non-singleton dimension 3

The height and width of the original image should be divisible by 32 because the depth map is 1/4 size and the 3D cost network performs "stride=2" three times.

Why num_depth is (sample_interval*(2i+1)/2 + depth_min)

Hi, we use the mid-point of each bin as the sample depths. So `sample_interval * (i + i + 1) / 2.0` actually computes the mid-point of i-th bin.

Tutel with pytorch automatic mixed precision package

Thanks for your quick response. Sorry I didn't notice that example 🤣. I will do some testing on my code.

"forward_one_depth" and "forward_all_depth"

Hi, thank you for your question. `forward_one_depth` only computes the depth map of one binary tree depth. `forward_all_depth` computes the depth maps of all binary tree depths and gets the...

"forward_one_depth" and "forward_all_depth"

Hi, in our code we actually only support using model = "one" in training and only support model = "all" in testing. This is related to "Memory-efﬁcient Training." in Paper...

"forward_one_depth" and "forward_all_depth"

Hi, sorry to be late. In Figure 2, you can see an upsample operation from Stage 2 to Stage 3 (We use "stage" to refer binary depth in the paper....

How to use Gipuma

Hi, you can try the gipuma fusion in [CasMVSNet](https://github.com/alibaba/cascade-stereo/blob/master/CasMVSNet/gipuma.py)

Train on private dataset - CascadeStereo

The size of images should be divisible by 32. You could use fuctions like scale_mvs_input to change the size of images and also the intrisic parameters of your cameras.