Binh Tang comments

Results 39 comments of


                                            Binh Tang

Redrawing Results Incorrect

While we're still waiting for a code change, is there a temporary solution to fix the problem? I tried to add a CSS rule as suggested in https://github.com/bstriner/keras-tqdm/issues/21 and all...

INT8 Support for GPT models

@byshiue Would you mind confirming whether weight-only quantization works with the GPT 175B model without mixed precision? I have been able to get reasonable outputs using a OPT 175B checkpoint,...

CUDA out of memory

@mady143 The RuntimeError indicates you're out of GPU memory. You can try to reduce the batch size.

@gulzainali98 It appears that the weights you're using are not compatible with `ModelParallelTransformerLanguageModel`, which expects KQV weights to be combined. If you still have issues, I recommend trying one of...

No longer able to load provided OPT checkpoint after recent changes

I think the first issue can be fixed by a one-line change (see this [OmegaConf documentation](https://omegaconf.readthedocs.io/en/2.1_branch/usage.html#struct-flag)): ```python with omegaconf.open_dict(cfg): setattr(cfg["model"], "inference", True) ```

load_state_dict error when loading from MP models resharded from a singleton

@jxmsML The `qkv_proj` can be found in [ModelParallelMultiheadAttention](https://github.com/facebookresearch/metaseq/blob/f2cd36798793604cf51ab8b8a2cb167c964f9667/metaseq/model_parallel/modules/multihead_attention.py#L185), which is [enabled by default](https://github.com/facebookresearch/metaseq/blob/f2cd36798793604cf51ab8b8a2cb167c964f9667/metaseq/model_parallel/modules/multihead_attention.py#L91) when used with Megatron. This is in contrast with the individual `q_proj`, `k_proj`, `v_proj` in [MultiheadAttention](https://github.com/facebookresearch/metaseq/blob/f2cd36798793604cf51ab8b8a2cb167c964f9667/metaseq/modules/multihead_attention.py#L21). The...

Binh Tang

Redrawing Results Incorrect

INT8 Support for GPT models

CUDA out of memory

Can not load OPT-175B model.

No longer able to load provided OPT checkpoint after recent changes

load_state_dict error when loading from MP models resharded from a singleton

load_state_dict error when loading from MP models resharded from a singleton

`convert_to_singleton` seems to hang for OPT-66B

`convert_to_singleton` seems to hang for OPT-66B

Add a script to benchmark generator