agenta icon indicating copy to clipboard operation
agenta copied to clipboard

Evaluators can access all columns

Open aakrem opened this issue 1 year ago • 2 comments

  • Now evaluators have access to whatever correct answer column from the testset.

aakrem avatar May 02 '24 17:05 aakrem

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
agenta ✅ Ready (Inspect) Visit Preview 💬 Add feedback May 31, 2024 7:46am

vercel[bot] avatar May 02 '24 17:05 vercel[bot]

@aakrem backend tests are failing

mmabrouk avatar May 17 '24 08:05 mmabrouk

Thanks for the PR, @aakrem! Great job. I have a few concerns (unrelated to the code changes).

I think the title of the PR and its description are a bit misleading. I have two questions:

* If an evaluator has access to all columns, why can we only access a single column in the test set?

This has been fixed. Now you configure in the evaluator which columns it has access to and it would be able to access them

* If the evaluators has access to any correct answer column from the test set, why are similarity match, JSON field match, webhook test, and Levenshtein distance the only evaluators that do?

It depends on the evaluator. Some of them, like JSON check, which checks whether the llm output is in json format does not require access to any ground truth obviously.

mmabrouk avatar May 31 '24 08:05 mmabrouk