llm-colosseum
llm-colosseum copied to clipboard
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
How to generate this ranking? If I added new model, how to reproduce this benchmark?
I replace the second character with Gouki and define it as white Gouki_WHITE = [255, 255, 255]. If the two characters do not switch positions, the game proceeds normally, but...
Using the LLM abstraction of llamaindex i've added support for more models. I tried to keep everything else the same as is for simplicity. I've already tested "anthropic:claude-3-haiku-20240307" vs groq:gemma-7b-it...
Add Google Gemini model
How to use Google gemini model to fighting?
Thank you so much for this project, it was great! But I don't want to have two AI's at the same time, I want to have AI's and computer characters.
Is there a way to get it to set the matches to a best of... 3 of 5 4 of 7, etc Thanks,
it is yi:6b-fp16 model for two players. why does it not generate any actions?
INFO:diambra.arena.arena_gym:EnvironmentSettingsMultiAgent(game_id='sfiii3n', frame_shape=[0, 0, 0], step_ratio=6, disable_keyboard=True, disable_joystick=True, render_mode='human', splash_screen=False, rank=0, env_address='127.0.0.1:55006', grpc_timeout=600, seed=1712025299, difficulty=None, continue_game=0.0, show_final=False, tower=3, _last_seed=1712025299, pb_model=game_id: "sfiii3n" how to set show_final=true??? I try to look for in...