Rohan Varma issues

Results 51 issues of


                                            Rohan Varma

FSDP namedtuple support

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #83055 * #83035 * #82892 - NamedTuple support is blocking MultiModal adoption. TODO: add test

oncall: distributed

cla signed

FSDP does not reshard params if _post_backward_hook did not fire

### 🐛 Describe the bug Sometimes, _post_backward_hook will not fire if gradients were not accumulated on the FSDP managed parameter, such as if all parameters in an FSDP module were...

high priority

triage review

oncall: distributed

module: fsdp

Abstractions to specify which model a client should train

i.e. different clients can train different models

Add support for different types of optimizers

DDP static graph fails for static model

### 🐛 Describe the bug ``` class M(nn.Module): def __init__(self): super().__init__() self.a = nn.Linear(10, 10) self.b = nn.Linear(10, 10) def forward(self, x): a = self.a(x) b = self.b(x) return (a,...

oncall: distributed

module: ddp

Rohan Varma

FSDP namedtuple support

FSDP does not reshard params if _post_backward_hook did not fire

Abstractions to specify which model a client should train

Add support for different types of optimizers

DDP static graph fails for static model

Improve testing for composable FSDP

utils.set_activation_checkpointing is unnecessarily restrictive

Add CI for 13B QLoRA

Initialize kv cache w/num_kv_heads instead of num_heads

[WIP] Add a Perf Monitor for metric tracking.