Yuhui Zhang
Yuhui Zhang
VetTag
Official Code Release for VetTag: improving automated veterinary diagnosis coding via large-scale language modeling
TransSeg
Official Code Release for "Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image Segmentation" (ML4H 2022)
drml
Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)
C3
Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)
AutoConverter
Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 2025)
VLMClassifier
Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)