费政聪
费政聪
Actually, toy_data.json is the first 100 items in the train.json, which is used to debug code quickly. You can use train.json instead, off course, we will fix it later :)
Hi, i have tried to fine-tune LLaMA model for better Chinese text generation. Please refer to: https://github.com/feizc/MLE-LLaMA
Hi, i have tried for instruct tuning on LLaMa, and provides scripts and alignment datasets : ) . You can reference for: https://github.com/feizc/MLE-LLaMA
> @feizc how are you approaching the problem of generating starting from a length that is less than the prefix? Actually, I use a fixed length of conditional context, i.e.,...
Hi, @a-r-r-o-w I try to convert the model ckpts with scripts: https://github.com/huggingface/diffusers/blob/mochi/scripts/convert_mochi_to_diffusers.py And create generation with pipeline in mochi branch: https://github.com/huggingface/diffusers/tree/mochi with code: import torch from diffusers import MochiPipeline from...
Hi, sorry for the missing file upload. We have uploaded the file in [huggingface](https://huggingface.co/Skywork/SkyReels-A2/blob/main/model_index.json).
hi, the hyper-parameters here need to be set to the path of the downloaded model. https://github.com/SkyworkAI/SkyReels-A2/blob/main/app.py#L35