puppet101

Results 12 issues of puppet101

Hi, I tried to train my model based on the imagenet validation set which contains 50k images. At the beginning, i.e. the iteration step is smaller than some small numbers...

I find that the provided mirrored-human dataset is incompleted, the some of the videos and frame annotations are missing, how can I get them?

Hi, I used the training script in the readme, and didn't change the data and parameters, but my gpu memory still run out. Have you test it on 4xA100 40Gb?...

@s-JoL 非常感谢分享这么好的工作,中文预训练模型真的好稀缺。 我想了解一下,目前的预训练模型,有在一些评测集上测试过指标吗?

hi, I am training yi-34b model using HybridParallel and got the following errors, my pytorch version is 2.0, cuda is 11.8. Could you please give me some help? Thanks! Gradient...

我只用github上提供的配置文件yi_34b_200k_full_alpaca_enzh_32k_sp8,运行时的deepspeed选项是zero3_offload 但是出现如下错误,请问现在序列并行是不支持offload吗,还是有别的原因? 谢谢。 Traceback (most recent call last): Traceback (most recent call last): Traceback (most recent call last): File "/opt/ml/job/xtuner/tools/train.py", line 342, in File "/opt/ml/job/xtuner/tools/train.py", line 342, in Traceback (most...

deepspeed
sequence parallel

@nkolot Hi, thanks for sharing the code, it is great! Could you please provide a demo of multiview refinement?

Hi, @HugoBA92 Thanks for sharing your great work. But could you please provide the testing script and pretrained model?

Hi, I meet some errors when serving Qwen1.5-32B using fp6 quantization, could you please give me a help? Thank you. My code is below: import mii pipe = mii.pipeline('/mymodel/Qwen1.5-32B-fp16', quantization_mode='wf6af16',...

Thank you for sharing this code! Could you please give some detailed description on training the FFHQ dataset? For example, how to set the default hyperparameters?