TensorRT-LLM
TensorRT-LLM copied to clipboard
Does tensorrt-llm support blip2 with fp8 quantization??
I wonder if tensorrt-llm supports blip2 with fp8 quantization? Thanks!
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 15 days."
This issue was closed because it has been stalled for 15 days with no activity.