prof-schacht

Results 4 comments of prof-schacht

The python implementation should be here: https://github.com/salesforce/CodeGen/blob/main/jaxformer/hf/codegen/modeling_codegen.py Could you explain how to convert these models first to ggml and then to cpp. (P.S.: @ggerganov I contacted you also on twitter)

Great I will try it out. Thanks for your hint.

Did somebody train a Gemma-2-9b-it Tuned_lens model since the commit? I tried it but it failed with the following error: Traceback (most recent call last): File "/root/miniconda/envs/tunedlens/bin/tuned-lens", line 8, in...