Bissmella Bahaduri
Results
1
issues of
Bissmella Bahaduri
# What does this PR do? This is a draft implementation of the Unified SP attention approach. - Implements `_all_to_all_dim_exchange` with custom scatter and gather indices - Implements `TemplatedUnifiedAttention` Core...