InferenceMAX
InferenceMAX copied to clipboard
AMD needs to use upstream SGLang images instead of fork
Fix issues in https://github.com/InferenceMAX/InferenceMAX/pull/247 Test Have inference engineer verify performance