Zhenzhong
Zhenzhong
你好! 我将您的代码中预训练模型做了替换,能否进行英文训练呢(还需修改其他地方吗)?跑出来的效果很差,是数据集太少的缘故吗? 感谢解答。 Hello! I have replaced the pre-trained model in your code, can I do English training (do I need to modify other places)? The effect of running out...
模型原理
你好,如果有时间的话可以更新一下各个模块的原理图或者原理描述吗,谢谢啦!
关于标签生成
感谢你的项目! 有些疑问,请问生成标签的数量是怎么控制的?
**Describe the bug** 再进行多机lora微调时出错: failed (exitcode: -11) local_rank: 5 (pid: 11514) of binary: /home/jovyan/data-ws-enr/zconda/envs/swift_ft/bin/python Traceback (most recent call last): File "/home/jovyan/data-ws-enr/zconda/envs/swift_ft/bin/torchrun", line 8, in sys.exit(main()) File "/home/jovyan/data-ws-enr/zconda/envs/swift_ft/lib/python3.10/site-packages/torch/distributed/elastic/multiprocessing/errors/__init__.py", line 348, in...
Could there be an issue with the parameter settings in my training script? export WANDB_MODE=disabled train_data="\ /home/jovyan/dataws1/bgeft/train_table_data " # set large epochs and small batch size for testing num_train_epochs=1 per_device_train_batch_size=1...