SegFormer
SegFormer copied to clipboard
Use of different optimizers in SegFormer
Keeping other parameters unchanged, if I try to use SGD instead of AdamW optimizer for Segformer model(modified with a Swin backbone), the results are so bad. I do get decent results with AdamW though. Only AdamW works with Segformer?