TransformerEngine
TransformerEngine copied to clipboard
FP8 attention with current scaling
Is your feature request related to a problem? Please describe. To be added
Describe the solution you'd like Work on improving performance for FP8 current scaling
Describe alternatives you've considered N/A