UM-MAE issues

Semantic segmentation Pretrained model?

1

I use upernet_mae_swin_tiny_256_mask...py, but when I use checkpoint-99-model.pth as the backbone pretrained model. It reports this: mmseg - WARNING - The model and loaded state dict do not match exactly...

wuyuanmm

How to carry out single GPU detection training

Sorry to bother you again. I have downloaded your fine-tuning weight and tested the det part, but the following errors appear in the display RuntimeError: GFL: FPN: Default process group...

wudizuixiaosa

Vanilla ViT for Segmentation and Detection

Hi @implus Thanks for the great work and released code! I have checked the ```configs``` in both ```DET``` and ```SEG```, but found there are no configs for ```ViT```, which is...

Haochen-Wang409

Is there any configuration for fp16 training with swin tiny mae?

For fast training, I want to use fp16 training method for swin tiny mae. If you have done experiment with fp16, I want to know your configuration, such as grad...

chagmgang

Reported Train-Loss at epoch 200

Thank you for the code! I'm pertraining Swin Tiny on my dataset (around 799K training samples). The train-loss is slowly decreasing, reached around 0.115 at epoch 100. Can you please...

CSLR-research

full checkpoint

Hi,@implus, could you kindly provide the full checkpoint (including the decoder) of Swin-T and PVT-S? Lots of thanks!

LiewFeng

Why is the visible patch simply concatenated with the invisible mask in the decoder, rather than being placed at specific positions? There seems to be a problem with this section of the code.

Why is the visible patch simply concatenated with the invisible mask in the decoder, rather than being placed at specific positions? There seems to be a problem with this section...

thibault-wch

UM-MAE
UM-MAE copied to clipboard

Metadata

Semantic segmentation Pretrained model?

How to carry out single GPU detection training

Vanilla ViT for Segmentation and Detection

Is there any configuration for fp16 training with swin tiny mae?

Reported Train-Loss at epoch 200

full checkpoint

Why is the visible patch simply concatenated with the invisible mask in the decoder, rather than being placed at specific positions? There seems to be a problem with this section of the code.

why can model see mask token when training?

请问能否提供一下Swin-T UM-MAE预训练阶段的log？感谢！

May be a bug?

← Metadata

Owner

Metadata

UM-MAE UM-MAE copied to clipboard

Metadata

← Metadata

Owner

Metadata

UM-MAE
UM-MAE copied to clipboard