MMTrustEval
MMTrustEval copied to clipboard
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
Hi! Thanks for sharing your work Can you please provide the script for generating adversarial examples for Task R.4 and R.5 ?

Dear authors, does your method support testing models that have been fine-tuned and saved locally? If so, how should I proceed with this?
Hi, Niels here from the open-source team at Hugging Face. It's great to see you're releasing models + data on HF, I discovered your work through the paper page: https://huggingface.co/papers/2406.07057...