Results 7 comments of 费政聪

Actually, toy_data.json is the first 100 items in the train.json, which is used to debug code quickly. You can use train.json instead, off course, we will fix it later :)

Hi, i have tried to fine-tune LLaMA model for better Chinese text generation. Please refer to: https://github.com/feizc/MLE-LLaMA

Hi, i have tried for instruct tuning on LLaMa, and provides scripts and alignment datasets : ) . You can reference for: https://github.com/feizc/MLE-LLaMA

> @feizc how are you approaching the problem of generating starting from a length that is less than the prefix? Actually, I use a fixed length of conditional context, i.e.,...

Hi, @a-r-r-o-w I try to convert the model ckpts with scripts: https://github.com/huggingface/diffusers/blob/mochi/scripts/convert_mochi_to_diffusers.py And create generation with pipeline in mochi branch: https://github.com/huggingface/diffusers/tree/mochi with code: import torch from diffusers import MochiPipeline from...

Hi, sorry for the missing file upload. We have uploaded the file in [huggingface](https://huggingface.co/Skywork/SkyReels-A2/blob/main/model_index.json).