butterluo

Results 17 issues of butterluo

Did ls compare with marian(https://github.com/marian-nmt/marian) ? What is the result?

is there any example for multi-gpu training (with fairseq and huggingface)?

Can lightseq export the weights in normal pytorch stat_dict after training using lightseq? It can be used for hugging face transformers or other pytorch lib to load and later use.

summary: When using MMFTransformer with MLM head, an warning will occur which will be an error in future: "UserWarning: optimizer contains a parameter group with duplicate parameters; in future, this...

CLA Signed

### Question When the code and dataset of LLaVA1.6 will be released?

Is there any plan to release the ScreenAgent weights' huggingface version? I found that the sat is hard to use, lack of ecosystem and only support few quantization method.

### Feature request / 功能建议 The script which can transform CogAgent weights' sat version to huggingface version. ### Motivation / 动机 Many company use the huggingace transformers library, and sat...

### Feature Request Could you support microsoft's GraphRAG ### Motivation The current RAG still has numerous deficiencies. Microsoft's Graph RAG could substantially improve RAG capabilities, and I hope it will...

enhancement
stale

When the distributed GRPO training will be supported?

feature request