Mo comments

Results 10 comments of

Mo

Is there any plan to support FinGPT

I will try to look into that..

May I ask where can I download the generated results from Claude and GPTs?

Are these the ones shared on www.swebench.com or different?

May I ask where can I download the generated results from Claude and GPTs?

"This repository contains the predictions, execution logs, trajectories, and results for model inference + evaluation runs on the [SWE-bench](https://swe-bench.github.io/) task." https://github.com/swe-bench/experiments

Update graph_utils.py

Thanks for the tip @joshkyh

simplify readme

Maybe a fixed web link of the benchmark image so that it updates everywhere whenever there is a new agent.

Upper bound score by skilled human?

> multiple tries, Range?

Upper bound score by skilled human?

> In our experiments, the best models can achieve rather high scores when given multiple tries, and I believe that the practical upper bound is much higher than the current...

How to inference without docker?

> I do not think this is possible anymore. And that is a good thing indeed! (unless you are on openBSD or something of that sort)

Feature Request: Implement Tool Calling

Any updates on MCP?

LLM Agent for KataGo

@qcgm1978 looks amazing! can't wait to try it myself. Thank you for the hardwork.