bukejiyu

Results 12 issues of bukejiyu

### PR Category Inference ### PR Types Others ### Description 1.增加add_group_norm_silu kernel 支持传入残差和激活 2.增加add_group_norm pattern 3.增加对应单测 pcard-71500

### PR Category Inference ### PR Types Others ### Description pcard-71500 1.pass支持调入cutlass kernel

### PR types Others ### PR changes Others ### Description

stale

![image](https://github.com/intel/xFasterTransformer/assets/52310069/cd1493de-8a6c-4a9f-83a4-c0ca83166f11)

build

### PR types Others ### PR changes Others ### Description 增加xft使用readme

stale

### PR types Others ### PR changes Others ### Description 增加flash_attn_2 自定义算子

### PR types Others ### PR changes Others ### Description add flash2 和 mqa

#### Before submitting - [ ] Lint code. If there are lint issues, please format the code first. ```shell # Install and register `pre-commit` in the project folder pip install...

#### Before submitting - [ ] Lint code. If there are lint issues, please format the code first. ```shell # Install and register `pre-commit` in the project folder pip install...

stale