gemini-cli icon indicating copy to clipboard operation
gemini-cli copied to clipboard

Feature: Evaluate and improve Reasoning

Open anj-s opened this issue 6 months ago • 5 comments

What problem does this solve?

This feature will evaluate model reasoning to identify areas of improvement.

How will it work?

By implementing a systematic evaluation process, we'll measure and improve model reasoning performance.

anj-s avatar Jul 14 '25 01:07 anj-s

If it be of any help, we may integrate some MCPs for it. See my fix of a fix (the former untested yet): https://github.com/modelcontextprotocol/servers/issues/2332

Manamama avatar Jul 14 '25 10:07 Manamama

Found possible duplicate issues:

  • #4082: (0.9038019449651825)
  • #4084: (0.9015102655145609)

gsquared94 avatar Jul 30 '25 01:07 gsquared94

Proposal: Use SWE bench verified as a scoring framework to evaluate the performance of Gemini CLI.

Or do we already have a different plan of action?

akhil29 avatar Aug 02 '25 07:08 akhil29

Hello! As part of our effort to keep our backlog manageable and focus on the most active issues, we are tidying up older reports.

It looks like this issue hasn't been active for a while, so we are closing it for now. However, if you are still experiencing this bug on the latest stable build, please feel free to comment on this issue or create a new one with updated details.

Thank you for your contribution!

gemini-cli[bot] avatar Dec 03 '25 21:12 gemini-cli[bot]

Found possible duplicate issues:

  • #8773
  • #11692

If you believe this is not a duplicate, please remove the status/possible-duplicate label.

gemini-cli[bot] avatar Dec 04 '25 19:12 gemini-cli[bot]

all subissues are closed, closing this

Adib234 avatar Jan 26 '26 16:01 Adib234