TR-3B comments

Results 48 comments of


                                            TR-3B

Auto-detect tool calling method at startup

> hey looking at the merged PR should this be a open issue be closed ? yeah, had the same question, is this an issue now?

Add tool calling demo notebook to README.md

This is my implementation with llama 3.1 8b model, which pretty much solves issue #1400, This is the reference notebook @danielhanchen for that issue.. [llama 3.1 tool calling](https://colab.research.google.com/gist/MagellaX/2dc7c6b4faf7ae49f17eac9945bacc7c/tool-calling.ipynb)

[DRAFT]: Adding save to gguf support for qwen2_vl

[save.zip](https://github.com/user-attachments/files/20012996/save.zip) This might work, but it's an enhanced version, a more bullet-proof implementation

Enforce JSON Action Contract Across Agent Stacks

@alckasoc any thoughts? u can merge this

Enforce JSON Action Contract Across Agent Stacks

> Hi @MagellaX , sorry for the late response! Thank you for the contribution! > > Some questions: > > * From what I understand, this PR is to enforce...

LoRA Adapter Integration for MLC-LLM: Complete Runtime Support and Compilation Pipeline

Reminder that this is a foundational LoRA support, meaning that from here we can bring things/more features to MLC-LLM such as multi-LoRA batching (pending upstream TVM/Relax changes), dynamic LoRA switching...

LoRA Adapter Integration for MLC-LLM: Complete Runtime Support and Compilation Pipeline

@junrushao @MasterJH5574 ANY TAKES?

Add FSDP backend and --dist_backend flag across CLIs; introduce FSDPStrategy

@gemini-code-assist re-run the review

Add FSDP backend and --dist_backend flag across CLIs; introduce FSDPStrategy

@hijkzzz any thoughts?

Add FSDP backend and --dist_backend flag across CLIs; introduce FSDPStrategy

> This is a great MR. I’ll need some time to go through it carefully. The current implementation may have some issues with vLLM weight synchronization and HF weight saving...