SkanderBS2024
SkanderBS2024
**Describe the bug** Hello, i tried to build an mage from the docker file 'examples/mamba' and after pulling the image there was an error while installing packages. I tried to...
Hello, after pre-processing the dataset with a BPE tokenizer, when i launch the 'train.sh' script for mamba i do get this error. In the script it's mentionned that i have...
**How to customise the train.sh for a distributed Mamba Training ?** Hello, As i've seen in the megatron modules, there isn't a pre-defined bash script to pre-train a mamba model...
**Is your feature request related to a problem? Please describe.** When fine-tuning for a mamba2 hybrid model we can convert it to a .nemo format but we cannot convert back...
**Describe the bug** As described in the title, after finishing all of the installs and building nemo and megatron-lm from source, assuming that the model has been trained with megatron-lm....