Dongxu issues

Results 5 issues of


                                            Dongxu

Currently use default parameters of AdamW: https://github.com/salesforce/LAVIS/blob/main/lavis/runners/runner_base.py#L103. This has to be made more flexible. Also, it would be good to have a standalone optimizer class, and related registry etc.

enhancement

Decouple number of iters per epoch and checkpointing frequency

Currently, when using iteration-based runner, checkpointing always happens after each inner epoch, thus coupling with the number of iterations per inner epoch. Ideally, the frequency to checkpoint/logging/ lr scheduling should...

enhancement

Save model weight only if require gradient updates.

For partial model training, e.g. BLIP2, it is not necessary to checkpoint the entire model. Saving partial weights is sufficient. This is currently not supported by the runner.

enhancement

Suppress missing key warning when loading partial model weights in BLIP2

ViT (if not finetuned) and LLM weights are not stored in the current BLIP2 checkpoints. The default load_checkpoint() in BaseModel raises warning on missing keys, which causes confusion. Proposed solution...

enhancement

Dongxu

add ALPRO (CVPR 22') paper

Configurable Optimizer

Decouple number of iters per epoch and checkpointing frequency

Save model weight only if require gradient updates.

Suppress missing key warning when loading partial model weights in BLIP2