Roy
Roy
### Description I deployed this part of the code on the web side, but in the concurrent inference stuck, that is, two users at the same time inference graph stuck,...
Hi, it is mentioned in the paper that you pretrain the model on two 16-GPU nodes, I wonder is it using 16 A100 gpus? I'm not sure what this nodes...
Hello,it's a great work!Can you tell me what is the number of trainable parameters for the model fine-tuning retrieval task, using Uniter_base
I currently have a picture and text data set (the picture is a normal picture, while the text length is long, which is more similar to a composition or a...
Hello! When I'm traing using config:"resume_from_checkpoint: latest",I load the lora checkpoint but it has the error:"No inf checks were recorded for this optimizer." Howevwe, if I train without checkpoint, it...
您好,我希望在单机6卡训练,然而一直报错,不知道哪出错了 [0] NVIDIA GeForce RTX 3090 | 64°C, 100 % | 9619 / 24268 MB | root:bzminer/89389(7677M) yuxiang:python/22067(329M) yuxiang :python/22068(321M) yuxiang:python/22069(329M) yuxiang:python/22066(327M) yuxiang:python/22071(319M) yuxiang:python/22064(3 27M) 一直在单卡跑了6个进程,不清楚原因 if true; then nohup...