guozejia

Results 8 comments of guozejia

I encountered a similar issue as well ![Image](https://github.com/user-attachments/assets/634fc8e4-66e2-46a7-9f29-4dda8f2122ee) checkpoints tree: ![Image](https://github.com/user-attachments/assets/44537697-2ac6-4f0b-a7c8-5efa696b527c) model: https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct dataset: https://huggingface.co/datasets/HuggingFaceTB/everyday-conversations-llama3.1-2k the code I used is as follows: ``` import argparse from datasets import load_dataset from...

@xylian86 thank you for u correction, i have successfully completed the conversion. and i will test later whether the converted data can be reloaded

I used the Meta-Llama-3.1-8B-Instruct model to generate a DeepSpeed checkpoint, then tried using the script, but encountered an error. ![Image](https://github.com/user-attachments/assets/1d2bd38e-62e9-4228-b2aa-4f046e0a1255) So, I printed the output shown below and found that...

@xylian86 i have tried using various types of models, such as 1B, 7B, and 8B. however, only the 8B model encounters issues after training, converting, and then being processed by...

i using the Llama-8B model to retry the above experiment, but it encountered an error: ` TypeError: 'NoneType' object is not subscriptable`. ![Image](https://github.com/user-attachments/assets/9538bb17-b392-4c22-b778-3026a6f8da34) the directories for generated data and checkpoints...

I was using the older version 0.14.4 of DeepSpeed as mentioned above. After updating to 0.15.2, I encountered the following error: `[rank2]: ValueError: loaded state dict contains a parameter group...

> 应该是一整套RABC的功能? 不如期待一下v2重构版的粗饼

1. 是否开多一个新表来存系列赛的数据? 2. 系列赛ID点击时应该需要支持查询近期的比赛和未来的比赛吧?