Venky

Results 4 issues of Venky

# Description - Add some `Llama-3_3-Nemotron-Super-49B-v1` integration-perf-tests (cpp backend, trtllm-bench). - This also exposes a `--trust_remote_code` flag in the `trtllm-bench-build` subcommand, that is required for `transformers` library to use Autoclasses...

# Expand PyT `llama_v3.1_nemotron_nano_8b` perf tests coverage ## Description This PR adds end-to-end performance results for the **llama_v3.1_nemotron_nano_8b** bfloat16 engine on 1 H100. Two broad load patterns were evaluated on...

## Description - [x] Clean #9272 schema and rebase this PR on top - [x] Include config db packaging - [ ] Reflect #9104 in docs - [x] Add script...

## Description ### Context for reviewers: * Currently to get optimal deployments, we expect user to specify an optimal `extra_llm_api_options` yaml file as well as the right env vars (like...