llmaz
llmaz copied to clipboard
βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!
Results
82
llmaz issues
Sort by
recently updated
recently updated
newest added
a readme or later in website. - KubeCon China Keynote https://sched.co/1x5jP - Higress AI Integration: https://mp.weixin.qq.com/s/DsJ4aY1K6mEnwR_Ms8QvMA - Already native integrated - Envoy AI Gateway - Karpenter - ...
documentation
needs-priority
needs-triage
**What would you like to be added**: See the whole list: https://github.com/InftyAI/llmaz/milestone/3 ### We'll focus on three main things: - [ ] xPyD serving with heterogeneous devices, we need a...
feature
needs-priority
needs-triage