TurboMa issues

Repositories
Issues
Comments

Results 1 issues of


                                            TurboMa

Usage of remote:vllm

What I understand about this is actually deploy a model (e.g Llama3.1-70B-Instruct) by using 'vllm serve Llama3.1-70B-Instruct ... ' and then config the url and model name to llama-stack for...