refactor: Replace DecoderFinishedEvent with CudaEvent in decoder classes
- Updated the
forwardAsyncmethod inGptDecoderBatchedandiGptDecoderBatchedto returnCudaEventinstead ofDecoderFinishedEventPtr, simplifying event handling. - Removed the
DecoderFinishedEventclass and its associated usage across various files, streamlining the codebase. - Adjusted related methods and Python bindings to accommodate the new event structure, ensuring compatibility and maintaining functionality.
These changes enhance the clarity and efficiency of the decoding process in the batch manager.
/bot run
PR_Github #471 [ run ] triggered by Bot
PR_Github #471 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #404 completed with status: 'FAILURE'
/bot run
PR_Github #537 [ run ] triggered by Bot
PR_Github #537 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #458 completed with status: 'SUCCESS'
Looks good to me (can't approve, I would if I could).
/bot reuse-pipeline
PR_Github #674 [ reuse-pipeline ] triggered by Bot
PR_Github #674 [ reuse-pipeline ] completed with state SUCCESS
Reusing PR_Github #537 for commit 8408e3b