Tommaso Cerruti
Results
2
comments of
Tommaso Cerruti
Hi @baberabb @jannalulu, I’d like to help by adding InfiniteBench to the evaluation tasks. I see it’s mentioned in this issue and partially covered in #3256, while BabiLong and LongBench...
I can take this. I would go for approach 1 (pass the LLM tool call id directly) as an optional tool_call_id kwarg, injected only if the tool’s signature accepts it....