Geisa Faustino

Results 2 issues of Geisa Faustino

Hi Jose, Thanks for making this work available. 😊 I am trying to download NYU Depth V2 dataset you mention in your README, but the link seems to be broken....

Include a paragraph on best practices for evaluating RAG solutions. This emphasize the importance of assessing the stability of LLM-based responses and validating llm-based evaluators/metrics.