Michele Zanotti issues

Results 19 issues of


                                            Michele Zanotti

cluster-autoscaler cannot scale to Azure VMSS when running k8s on-prem

Hi everyone, I am trying to use the cluster-autoscaler for provisioning new nodes on Azure Virtual Machine Scale Sets, starting from a cluster run locally on my PC using [kind](https://kind.sigs.k8s.io/)....

cluster-autoscaler

kind/bug

lifecycle/rotten

[CapacityScheduling] ElasticQuota default value for "max" is zero

Hello everyone, When creating an `ElasticQuota` resource, if the "max" field is not provided then its default value is zero. Is this the intended behavior? Quoting [KEP 9 - Capacity...

[Trimaran] Support GPU resource

Hello everyone, It would be nice if Trimaran supported load-aware scheduling based on GPU utilization. The [load-watcher](https://github.com/paypal/load-watcher) service supports Prometheus sources, so the metrics exposed by the NVIDIA DCGM Exporter...

Fix CI pipelines triggers

Speedster optimize_model should raise error if input data is missing

## Description Invoking `optimize_model` with either `input_data=None` or `input_data=[]` generates the following output: ``` 2023-03-09 18:09:34 | INFO | Running Speedster on CPU ``` However, the model is not optimized...

[Speedster] Make Speedster optimize_model() return InferenceLearner also for StableDiffusion models

Speedster `optimize_model` function should return an object of type [BaseInferenceLearner](https://github.com/nebuly-ai/nebullvm/blob/main/nebullvm/operations/inference_learners/base.py#L41). However when optimizing Stable Diffusion models, the function returns an object of type `StableDiffusionPipeline`. It would be useful if the...

speedster

Michele Zanotti

cluster-autoscaler cannot scale to Azure VMSS when running k8s on-prem

[CapacityScheduling] ElasticQuota default value for "max" is zero

[Trimaran] Support GPU resource

Fix CI pipelines triggers

Speedster optimize_model should raise error if input data is missing

[Speedster] Make Speedster optimize_model() return InferenceLearner also for StableDiffusion models

[Nebullvm] Add option to Nebullvm auto-installer for installing all libraries

MPS server not serving any request after connecting with wrong user ID

Support mixed MIG+MPS dynamic partititioning

Handle GPU partitioning mode changes on the same Node (MIG<>MPS)