FasterTransformer
FasterTransformer copied to clipboard
bugfix: qk_scale is not used by swin FMHA.
Fixed issue #322. Now qk_scale can be used by swin FMHA.
Signed-off-by: liangtao07 [email protected]
There are some other bugs, so I create another PR https://github.com/NVIDIA/FasterTransformer/pull/324, you can try it.
There are some other bugs, so I create another PR #324, you can try it.
OK, Swin FMHA can work correctly now, thanks!