Niko Maroulis
Niko Maroulis
I will test it tomorrow with my h200 to be sure that everything is working. With my mbr the answers seems ok, but the generation is slow. My end goal...
Generation looking good! ```elixir iex(16)> prompt = """ ...(16)> system ...(16)> You are a helpful assistant. ...(16)> user ...(16)> What is the capital of France? ...(16)> assistant ...(16)> """ "system\nYou...
@jonatanklosko i used a light model with qwen3 arch to write some basic tests similar to the other PR. Let me know if this is enough.
Sorry, worked caught up with me, I will continue the PR this weekend.
@jonatanklosko I managed to get some time. I compared the elixir implementation with Python transformer's ## 🧪 Test Environment - **Python**: transformers 4.57.1, torch 2.9.0 (bf16) - **Elixir**: Bumblebee (local),...
@jonatanklosko finally find again some time! i feel i addressed all the comments. Sorry for this large PR.
I started a draft pr! looking promising.