Zhenzhong issues

Results 5 issues of


                                            Zhenzhong

更改预训练模型适合进行英文多分类吗？（Is changing the pretrained model suitable for English multi-classification?）

你好！我将您的代码中预训练模型做了替换，能否进行英文训练呢（还需修改其他地方吗）？跑出来的效果很差，是数据集太少的缘故吗？感谢解答。 Hello! I have replaced the pre-trained model in your code, can I do English training (do I need to modify other places)? The effect of running out...

模型原理

你好，如果有时间的话可以更新一下各个模块的原理图或者原理描述吗，谢谢啦！

关于标签生成

感谢你的项目！有些疑问，请问生成标签的数量是怎么控制的？

failed (exitcode: -11) local_rank: 5 (pid: 11514) of binary: /home/jovyan/data-ws-enr/zconda/envs/swift_ft/bin/python

**Describe the bug** 再进行多机lora微调时出错： failed (exitcode: -11) local_rank: 5 (pid: 11514) of binary: /home/jovyan/data-ws-enr/zconda/envs/swift_ft/bin/python Traceback (most recent call last): File "/home/jovyan/data-ws-enr/zconda/envs/swift_ft/bin/torchrun", line 8, in sys.exit(main()) File "/home/jovyan/data-ws-enr/zconda/envs/swift_ft/lib/python3.10/site-packages/torch/distributed/elastic/multiprocessing/errors/__init__.py", line 348, in...

Why do the embeddings from my trained BGE-M3 model match those from the original model?

Could there be an issue with the parameter settings in my training script? export WANDB_MODE=disabled train_data="\ /home/jovyan/dataws1/bgeft/train_table_data " # set large epochs and small batch size for testing num_train_epochs=1 per_device_train_batch_size=1...