kiran
kiran
Flexgen only supports opt models
clean up disk and keep it under 95% usage, that should fix the issue
@HarrywillDr v100 gpus does not support TF32, remove that tag
maybe try `full_shard offload auto_wrap`
is there any progress on this?
seeing the same issue