RemoteCLIP icon indicating copy to clipboard operation
RemoteCLIP copied to clipboard

Fine tuning training time?

Open BradNeuberg opened this issue 1 year ago • 2 comments

I'd like to know more about how you all fine tuned your model using the base OpenCLIP weights. How long did it take and what GPUs did you end up using? We are thinking about fine tuning RemoteCLIP itself with some more domain specific imagery and want to get a general sense of the cost and time it took you all to do that yourselves. Thanks :)

BradNeuberg avatar Mar 12 '24 18:03 BradNeuberg

I believe you used ITRA to finetune your model from the OpenCLIP LAION dataset. It looks like ITRA is also from your research group? Did you end up using the OpenCLIP tooling to fine tune or your own ITRA?

BradNeuberg avatar Mar 12 '24 18:03 BradNeuberg

Thank you very much for your attention! We performed full parameter fine-tuning by using OpenCLIP weights, in which training based on the ViT-L-14 model took 5 hours using 4 3090 GPUs. ITRA is indeed from our team, and we completed model training and evaluation based on it. : )

gzqy1026 avatar Mar 18 '24 07:03 gzqy1026