nvMelissa issues

Results 13 issues of


                                            nvMelissa

Support DeepSeek FP8 recipe in JAX

**Is your feature request related to a problem? Please describe.** N/A **Describe the solution you'd like** Support DeepSeek FP8 recipe in JAX. Already supported in Pytorch. **Describe alternatives you've considered**...

FP8

Priority = P1

Support multi-GPU DeepSeek recipe in TE/JAX

Support single-GPU DeepSeek recipe in TE/JAX

Restructure attention C API

**Is your feature request related to a problem? Please describe.** At the moment, we enumerate the parameters in C APIs like this: https://github.com/NVIDIA/TransformerEngine/blob/5e4e0b2c378d2b1ec2ee65dfa85124e1dd805389/transformer_engine/common/fused_attn/fused_attn.cpp#L835 As we add more features to attention,...

refactor

attention

MuonClip for Kimi-K2 model

**Is your feature request related to a problem? Please describe.** This is not related to a problem, it is a feature request to expand model coverage **Describe the solution you'd...

attention

FP8 attention with current scaling

Is your feature request related to a problem? Please describe. To be added Describe the solution you'd like Work on improving performance for FP8 current scaling Describe alternatives you've considered...

performance

attention

developer efficiency

pattern matching

nvMelissa

Support DeepSeek FP8 recipe in JAX

Support multi-GPU DeepSeek recipe in TE/JAX

Support single-GPU DeepSeek recipe in TE/JAX

Restructure attention C API

MuonClip for Kimi-K2 model

FP8 attention with current scaling

Replace TE check_support with FE check_support

Dynamic Shapes

MoE Model Optimizations

Pattern Matching