Xinyuan Tong

Results 27 comments of Xinyuan Tong

There are still some bugs: And don't forget to Format your code according to the [Code Formatting with Pre-Commit](https://docs.sglang.ai/references/contribution_guide.html#code-formatting-with-pre-commit).

cc @mickqian Could you help review this PR? Thanks!

We should later delete `docs/supported_models/vision_language_models.md` and move mistral part to `docs/supported_models/multimodal_language_models.md`, but it's OK to merge now.

MMMU bench result: ``` {'Accounting': {'acc': 0.4, 'num': 30}, 'Agriculture': {'acc': 0.562, 'num': 16}, 'Architecture_and_Engineering': {'acc': 0.333, 'num': 30}, 'Art': {'acc': 0.667, 'num': 30}, 'Art_Theory': {'acc': 0.7, 'num': 30}, 'Basic_Medical_Science':...

Now whether the `--stream-output` is set or not, the responses API streaming functions properly.

So we decide to remove other commands like replay, fine-tune and encode?