John B Olan
John B Olan
I forgot to tune the `num_workers` so the `train_bert()` ran for an extended time. I simply tuned the `num_worker` to `0` and completed training in less than 1 minute on...
I downloaded `nvidia/Llama3-ChatQA-1.5-8B` manually from HF into local. I ran `scripts/convert_hf_checkpoint.py` Then I wanted to run generate.py using the local checkpoint dir: ` raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(...
 I was stepping through the guide, including the requirements.txt. I got the attached error. I have `python 3.10.6`, `pydantic 2.5.0`, and `pydantic_core of 2.14.1`, `ray 2.8.0`
Any thoughts on why I am not able to reproduce the same resolution reported out there: Weights: I downloaded weights from Huggingface to local and loaded it from local directory...