DiffSynth-Studio
DiffSynth-Studio copied to clipboard
fp8 vs bf16
float8 sadly not even properly working on image to video models yet https://github.com/modelscope/DiffSynth-Studio/issues/466
@xizi Native FP8 computation requires support from Hopper architecture GPUs. To ensure compatibility, we have to temporarily convert to bfloat16 precision during computation. Therefore, FP8 quantization does not provide a speed improvement.