mlx
mlx copied to clipboard
Make MAX models work on kfserving and model mesh serve
Is your feature request related to a problem? Please describe. Currently, we only can access MAX models if we deployed them using Kubernetes deployment. Both kfserving and model mesh won't work for MAX because MAX models are mapped with non-configurable endpoints and needs complex pre/post-processing methods.
Describe the solution you'd like A clear and concise description of what you want to happen.
Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.
Additional context Add any other context or screenshots about the feature request here.