Ali Naeimi

Results 3 comments of Ali Naeimi

for anyone wondering, this is how you can get NVFP4BlockScaling to work on SM120: recipe = NVFP4BlockScaling(disable_rht= True ,disable_stochastic_rounding= True ) @ptrendx any progress on this?

@vgoklani yeah I also thought it's mandatory but turns out it isn't and the model actually does converge although I haven't tried with model_init set to nvfp4. you're welcome to...