Claudiu Daniel Hromei
Claudiu Daniel Hromei
Hello everyone, thank you very much for your contribution. I appreciate the effort and consistency in uploading the code for such many models and maintaining this repository. I saw Kosmos-2...
### Describe the issue Issue: I want to fine-tune a multi-modal LLM on a downstream task that uses both images and text. This is what I've done: 1. I tried...
Hello everyone, thank you for the great job! I am trying to further fine-tune the LLaVA architecture using your implementation with LLaMA 3 Instruct 8B. I can already fine-tune the...