Jason (Siyu) Zhu issues

Results 8 issues of


                                            Jason (Siyu) Zhu

Add GPTQ quantization kernels for 2, 3, 8-bit use cases

Earlier, there was an awesome PR https://github.com/vllm-project/vllm/pull/916 on supporting the GPTQ Exllama kernel in a 4-bit quantization setup. This PR introduces additional kernels for use cases with different quantization bits,...

quantization

Add openfunctionv2 model inference script and fix minor bug

# Summary This PR introduces a new model handler [openfunctions_handler.py](https://github.com/ShishirPatil/gorilla/compare/main...JasonZhu1313:gorilla:jaszhu/add_openfunctions_handler?expand=1#diff-3af430d47eb913aec657f3bad6dcbae4e39ee152dcb8b1699e65614fdd87e10d) to run inference on OS model gorilla-llm/gorilla-openfunctions-v2 and reproduce the results on leaderboard Issue: https://github.com/ShishirPatil/gorilla/issues/352 # Changes * Merge the...

[Reproducibility] OpenFunctions-v2: <Issue> Unable to reproduce the AST scores reported in leaderboard with OS checkpoint

**Describe the bug** A clear and concise description of what the bug is. Great work on gorilla! I have used the OS model checkpoint https://huggingface.co/gorilla-llm/gorilla-openfunctions-v2 with vLLM to try reproducing...

hosted-openfunctions-v2

BFCL-General

Add liger HF blog

Agentic RL Support in GPT-OSS

### Feature request Agentic RL Support in GPT-OSS ### Motivation Hey Community, @HJSang and I are from the LinkedIn Core AI team. Over the past few weeks, we’ve been working...

Jason (Siyu) Zhu

Add GPTQ quantization kernels for 2, 3, 8-bit use cases

Add openfunctionv2 model inference script and fix minor bug

[Reproducibility] OpenFunctions-v2: <Issue> Unable to reproduce the AST scores reported in leaderboard with OS checkpoint

Question about evaluation datasets

Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to Trainer

[feature] Add gpt-4o-2024-08-06 with strict: true in response_format

Add liger HF blog

Agentic RL Support in GPT-OSS