hzhang57
Results
3
comments of
hzhang57
Hi, I trained a model with the provided codes on ImageNet-1k only with 4x2080ti (batch100), finally reach 82.0 around. I upload this temporal alternative in google drive to facilate other's...
yes, I trained the visformer small with 224: visformer_small
I tried Qwen3-VL-4B-Instruct. it can be GRPO tuned with vllm version 0.11.2