pcastonguay

Results 3 issues of pcastonguay

To simplify disaggregated serving deployment and reduce duplicated code, the disaggregated workers and server can now be launched with: ``` python3 ${EXAMPLE_DIR}/launch_disaggregated_workers.py -c ${CONFIG_FILE} trtllm-serve disaggregated -c ${CONFIG_FILE} ``` respectively,...

# PR title ## Description Adds support for TRT backend when running disaggregated serving with trtllm-serve ## Test Coverage ## GitHub Bot Help `/bot [-h] ['run', 'kill', 'skip', 'reuse-pipeline'] ...`...

## Summary by CodeRabbit * **Improvements** * Enhanced timeout messaging for KV cache transfer operations. * **Tests** * Updated KV cache transfer backend configuration in test cases. * Re-enabled previously...