kiran

Results 6 comments of kiran

Flexgen only supports opt models

clean up disk and keep it under 95% usage, that should fix the issue

@HarrywillDr v100 gpus does not support TF32, remove that tag

maybe try `full_shard offload auto_wrap`

is there any progress on this?