guozejia
guozejia
I encountered a similar issue as well  checkpoints tree:  model: https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct dataset: https://huggingface.co/datasets/HuggingFaceTB/everyday-conversations-llama3.1-2k the code I used is as follows: ``` import argparse from datasets import load_dataset from...
@xylian86 thank you for u correction, i have successfully completed the conversion. and i will test later whether the converted data can be reloaded
I used the Meta-Llama-3.1-8B-Instruct model to generate a DeepSpeed checkpoint, then tried using the script, but encountered an error.  So, I printed the output shown below and found that...
@xylian86 i have tried using various types of models, such as 1B, 7B, and 8B. however, only the 8B model encounters issues after training, converting, and then being processed by...
i using the Llama-8B model to retry the above experiment, but it encountered an error: ` TypeError: 'NoneType' object is not subscriptable`.  the directories for generated data and checkpoints...
I was using the older version 0.14.4 of DeepSpeed as mentioned above. After updating to 0.15.2, I encountered the following error: `[rank2]: ValueError: loaded state dict contains a parameter group...
> 应该是一整套RABC的功能? 不如期待一下v2重构版的粗饼
1. 是否开多一个新表来存系列赛的数据? 2. 系列赛ID点击时应该需要支持查询近期的比赛和未来的比赛吧?