superbenchmark icon indicating copy to clipboard operation
superbenchmark copied to clipboard

V0.11.0 Release Plan

Open cp5555 opened this issue 1 year ago • 0 comments

Release Manager

@cp5555

Endgame

  • [ ] Code freeze: TBD
  • [ ] Bug Bash date: TBD
  • [ ] Release date: TBD

Main Features

SuperBench Improvement

    • [x] Add CUDA 12.4 dockerfile (#619)

Micro-benchmark Improvement

    • [x] Add hipblasLt tuning to dist-inference cpp implementation (#616)
    • [x] Upgrade mlc to v3.11 (#620)
    • [ ] Support cuDNN Backend API in cudnn-function.

Model Benchmark Improvement

  1. Support VGG, LSTM, and GPT-2 small in TensorRT Inference Backend
  2. Support VGG, LSTM, and GPT-2 small in ORT Inference Backend
  3. Support more TensorRT parameters (Related to #366)

Result Analysis

cp5555 avatar Mar 27 '24 07:03 cp5555