big_vision
big_vision copied to clipboard
PlaiGemma finetuned model
Thanks for the great work! I've checked out some finetuned model released on HF, like docVQA, just wanna know if you have any plans to provide finetune example for different downstream tasks?
You can check Skalski's work with PaliGemma here: https://github.com/roboflow/notebooks/blob/main/notebooks/how-to-finetune-paligemma-on-detection-dataset.ipynb
Also google published Jax tutorial for paligemma finetuned captioning: https://ai.google.dev/gemma/docs/paligemma/fine-tuning-paligemma
I played some with the captioning tutorial and would happily help if I can