LLF-Bench icon indicating copy to clipboard operation
LLF-Bench copied to clipboard

A benchmark for evaluating learning agents based on just language feedback

Results 6 LLF-Bench issues
Sort by recently updated
recently updated
newest added

You can test the code by running ```bash python llfbench/agents/ag_agent.py ``` The code only supports DISCRETE action space now, and I will keep testing other action space. The main difficulty...

Thanks for the interesting work. I try to use this repo, but for alfworld and meta world, there seems many packages are missing. For example, in the screenshot, there's no...

The file path would not work if users use LLF-Bench in other folders (aka, not from the repo's root).

Bumps [requests](https://github.com/psf/requests) from 2.31.0 to 2.32.0. Release notes Sourced from requests's releases. v2.32.0 2.32.0 (2024-05-20) 🐍 PYCON US 2024 EDITION 🐍 Security Fixed an issue where setting verify=False on the...

dependencies

Hey there! This is AG2 👋 First of all, thank you for using pyautogen! We've seen you're using pyautogen, and we're here to help you migrate to ag2. This pull...

Bumps [requests](https://github.com/psf/requests) from 2.32.0 to 2.32.4. Release notes Sourced from requests's releases. v2.32.4 2.32.4 (2025-06-10) Security CVE-2024-47081 Fixed an issue where a maliciously crafted URL and trusted environment will retrieve...

dependencies
python