Automodel icon indicating copy to clipboard operation
Automodel copied to clipboard

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Results 100 Automodel issues
Sort by recently updated
recently updated
newest added

**Is your feature request related to a problem? Please describe.** A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] **Describe the solution you'd...

enhancement

Hi Team, Do you have any plans to support DoRA? Paper: https://arxiv.org/abs/2402.09353 peft: https://github.com/huggingface/peft/pull/1474 Thank you

enhancement
community-request

Can give up to 10% speedup Closes #606 Qwen - https://wandb.ai/Nemo-automodel/qwen3_moe-cp-pp-grid Moonlight - https://wandb.ai/Nemo-automodel/moonlight-automodel-cp-pp-grid

Lowerbounding Liger-Kernel to ensure support for TiledMLP is added https://github.com/linkedin/Liger-Kernel/pull/935 TODO: confirm convergence with FSDP2.

**Is your feature request related to a problem? Please describe.** leverage compiler techniques to support automatic model sharding. **Describe the solution you'd like** A clear and concise description of what...

enhancement

**Is your feature request related to a problem? Please describe.** In multi-hop QA, a query would have multiple positive docs. We should use all of them for biencoder training. Currently...

enhancement

**Describe the bug** Phi-3-mini-4k-instruct does not run with Automodel. I've tried two cases: 1. using `nemo_automodel.NeMoAutoModelForCausalLM.from_pretrained` as the model `_target_` in the config. In this case I get the error...

bug

Related to https://github.com/NVIDIA-NeMo/Automodel/pull/798, while attempting to run the `examples/llm_pretrain/pretrain.py` example with the `examples/llm_pretrain/megatron_pretrain_gpt2.yaml` configuration file, I ran into a compilation error when building C++ helpers. I was running on Databricks,...

bug