prof-schacht
prof-schacht
The python implementation should be here: https://github.com/salesforce/CodeGen/blob/main/jaxformer/hf/codegen/modeling_codegen.py Could you explain how to convert these models first to ggml and then to cpp. (P.S.: @ggerganov I contacted you also on twitter)
Great I will try it out. Thanks for your hint.
Is there still work to do where I and my team can help?
Did somebody train a Gemma-2-9b-it Tuned_lens model since the commit? I tried it but it failed with the following error: Traceback (most recent call last): File "/root/miniconda/envs/tunedlens/bin/tuned-lens", line 8, in...