corenet How to Load OpenELM Pre-training Checkpoints using Hugging Face AutoModelForCausalLM ?

Hi there,

First, really admire the work on OpenELM! Thank you for making your models and code available.

Question regarding the pre-training checkpoints linked here: how can we convert these checkpoints into the format expected by AutoModelForCausalLM.from_pretrained?

I presume there's a script that was used for conversion of the final model weights into HF format, but I couldn't find it in the repo.

Would very much appreciate any help on this!

Best, Jason

Aug 08 '24 21:08 jasonkrone

I have also encountered the same problem. Do you have a solution?

Aug 28 '24 10:08 a154377713

I didn't wind up solving this but here's a reference that might be helpful https://github.com/foundation-model-stack/foundation-model-stack/blob/4349dacef63e86b6c1acdccb69b48fe562365bb2/fms/models/llama.py#L592

Sep 03 '24 22:09 jasonkrone

As a follow question - Are there any plans to push the model checkpoints to Huggingface hub for ease of access (like the Pythia suite of models)? It would really help the NLP community! Thanks in Advance!

Oct 09 '24 03:10 athrvkk