Mehmet Hamza Erol
Mehmet Hamza Erol
Hello, Thank you very much for the insightful work detailed in your paper! The training recipe (e.g. learning rate, scheduler, #of epochs) and results for the Vim-T and Vim-S models...
Hello, I'm reaching out to inquire whether the issue described at this link: https://github.com/state-spaces/mamba/issues/84#issuecomment-1933483360 could potentially be related to Triton's implementation and if it may require a bugfix in the...
Hello, First of all, thank you very much for this work and your efforts! The repository and guidelines are succinct and pretty effective! I've encountered a recurring issue while training...