André Bauer comments

Repositories
Issues
Comments

Results 4 comments of


                                            André Bauer

[WebUI] Show corresponding metrics, not individual metrics in the project

+1 for this feature , is there any way to get the desired behavior already?

[BUG] Unused params lead to "still have inflight params" error

> In the config json, set "stage3_prefetch_bucket_size": 0, that should work While this might "work" this still not solves the problem for example with `mixtral`, since this kind of MoE...

[BUG] deepspeed.init_inference() erroneously attempts to copy out of meta tensor

> I had some success loading the model this way: > > ``` > with deepspeed.OnDevice(dtype=dtype, device="meta"): > model = AutoModelForCausalLM.from_pretrained(model_name, low_cpu_mem_usage=True) > model = deepspeed.init_inference( > model, > tensor_parallel...

[BUG] `max_in_cpu` seems to be ignored?

That means if I have 300G of ram for a 301G model there is no way to offload only the 1G of params to nvme in inference 🤔 ? I...