Simran Arora comments

Results 16 comments of


                                            Simran Arora

FYI: HuggingFace Transformers Request

Awesome, thanks for putting in the request!

Type Error in GPTLMHeadModel

Hi, I think it's because this RMSNorm is being set to None https://github.com/HazyResearch/based/blob/e8de5648f7e84248be8ebc1499e817641b0f577b/based/models/gpt.py#L371 Due to the import structure here: https://github.com/HazyResearch/based/blob/e8de5648f7e84248be8ebc1499e817641b0f577b/based/models/gpt.py#L52 The options are to - install the norm from flash...

Type Error in GPTLMHeadModel

That line you pointed out requires this to be installed: https://github.com/Dao-AILab/flash-attention/tree/main/csrc/fused_dense_lib Would recommend cloning flash-attention and python setup.py install within this directory An alternative workaround, without the install, is to,...

Simran Arora

FYI: HuggingFace Transformers Request

Type Error in GPTLMHeadModel

Type Error in GPTLMHeadModel

Type Error in GPTLMHeadModel

Reproduce Deepseek GEMM perf in tk

Reproduce Deepseek GEMM perf in tk