composable_kernel
composable_kernel copied to clipboard
[Ck tile] Support layernorm one pass
- Add one pass pipeline, switch one pass and two pass pipeline according to problem size
- Fix compile error
- Support padding
Since OnePassLayernorm2dFwd() and TwoPassLayernorm2dFwd() share lots of codes. I think maybe we can merge them into one.
Since
OnePassLayernorm2dFwd()andTwoPassLayernorm2dFwd()share lots of codes. I think maybe we can merge them into one.
1c644ec523223823a6980172399d549d420df8d0