fix: disable KV cache reuse if using attention sink
/bot run
PR_Github #284 [ run ] triggered by Bot
I wonder if maybe this should be more invasive - i.e., erase logic that includes the sink bubble length in reuse, like
maxTokenNum? Or are you waiting for me to do so in my VSWA PR?
I would suggest to prohibit the configuration first. Then everyone is free to refactor under this assumption.
PR_Github #284 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #273 completed with status: 'FAILURE'
/bot run
PR_Github #299 [ run ] triggered by Bot
PR_Github #299 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #286 completed with status: 'FAILURE'
/bot run
PR_Github #318 [ run ] triggered by Bot
PR_Github #318 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #303 completed with status: 'FAILURE'
/bot run
PR_Github #395 [ run ] triggered by Bot
PR_Github #395 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #351 completed with status: 'FAILURE'
/bot run --disable-fail-fast
PR_Github #423 [ run ] triggered by Bot
PR_Github #423 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #364 completed with status: 'FAILURE'
/bot run --disable-fail-fast
PR_Github #463 [ run ] triggered by Bot
PR_Github #463 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #396 completed with status: 'SUCCESS'
/bot reuse-pipeline
PR_Github #538 [ reuse-pipeline ] triggered by Bot
PR_Github #538 [ reuse-pipeline ] completed with state SUCCESS
Reusing PR_Github #463 for commit e57a93b
/bot reuse-pipeline
PR_Github #539 [ reuse-pipeline ] triggered by Bot
PR_Github #539 [ reuse-pipeline ] completed with state SUCCESS
Reusing PR_Github #463 for commit 812d7dc
/bot reuse-pipeline
PR_Github #2337 [ reuse-pipeline ] triggered by Bot
PR_Github #2337 [ reuse-pipeline ] completed with state SUCCESS
Can't reuse PR_Github #0 with status: UNKNOWN
/bot reuse-pipeline
PR_Github #2358 [ reuse-pipeline ] triggered by Bot