Stephen Roller issues

Results 41 issues of


                                            Stephen Roller

Update config.yml

**Patch description** **Testing steps** **Other information**

CLA Signed

Doc improvement requests

Please comment with docs you think should be updated, added, for clarified.

never-stale

multiwoz v22 is very slow

# Bug description It takes an extremely long time to load multiwoz v22. With the data already downloaded, the train set takes >200 seconds to get to display_data on my...

Help Wanted

never-stale

In #3740, we added support for FullyShardedDataParallel, but limited implementation to that of Zero2, not Zero3. Zero3 results in substantial decreases of memory usage compared with Zero2 while bringing speed...

never-stale

Flaky Tests

Please comment or +1 list your flaky tests. Don't just say "gpu tests", name specifics failing

never-stale

New Metric.from_mask helper method

We have quite a few instances where we have some per-token losses/metrics along with a corresponding mask ```python metric_per_token # torch.Tensor of shape (batchsize, num_tokens) mask # torch.BoolTensor of shape...

Enhancement

Help Wanted

Medium

never-stale

Update colab tutorial to use distilled blenderbot

We have a newer model. Let's use it! Current tutorial here: https://colab.research.google.com/drive/1bRMvN0lGXaTF5fuTidgvlAl-Lb41F7AD#scrollTo=KtVz5dCUmFkN

Help Wanted

Small

never-stale

Retire projects/image_chat/interactive.py

It contains too much copy pasta with the regular interactive web. We should find a way to improve this.

Code Quality

Medium

donotreap

Ability to observe per-example metrics during parley

**Description** We need to be able to see metrics (like F1, etc) for individual examples when available.

never-stale

TorchAgent/TGA/TRA/... should log when initializing optimizer or not

It's pretty useful to know whether the initializer is being created and using extra memory. We should add a log saying whether it's being hit. It might help with #2942.

Help Wanted

Minor

never-stale