tiny-llm
tiny-llm copied to clipboard
feat: Distributed inference (TP/PP)
It would be great to add support for single-node, multi-process inference to Tiny LLM.