Ali Naeimi
Results
3
comments of
Ali Naeimi
for anyone wondering, this is how you can get NVFP4BlockScaling to work on SM120: recipe = NVFP4BlockScaling(disable_rht= True ,disable_stochastic_rounding= True ) @ptrendx any progress on this?
@vgoklani yeah I also thought it's mandatory but turns out it isn't and the model actually does converge although I haven't tried with model_init set to nvfp4. you're welcome to...