SkanderBS2024

Results 5 issues of SkanderBS2024

**Describe the bug** Hello, i tried to build an mage from the docker file 'examples/mamba' and after pulling the image there was an error while installing packages. I tried to...

Hello, after pre-processing the dataset with a BPE tokenizer, when i launch the 'train.sh' script for mamba i do get this error. In the script it's mentionned that i have...

**How to customise the train.sh for a distributed Mamba Training ?** Hello, As i've seen in the megatron modules, there isn't a pre-defined bash script to pre-train a mamba model...

**Is your feature request related to a problem? Please describe.** When fine-tuning for a mamba2 hybrid model we can convert it to a .nemo format but we cannot convert back...

**Describe the bug** As described in the title, after finishing all of the installs and building nemo and megatron-lm from source, assuming that the model has been trained with megatron-lm....

bug