TensorRT-LLM icon indicating copy to clipboard operation
TensorRT-LLM copied to clipboard

[TRTLLM-4932] Add QA accuracy tests for NIM-prioritized models

Open moraxu opened this issue 8 months ago • 12 comments

New tests added:

  • Llama-3.2-1B: added mmlu benchmark
  • Llama-3.1-Nemotron-Nano-8B-v1: added GSM8K, GPQADiamond benchmarks
  • Llama-3_1-Nemotron-Ultra-253B-v1: added the entire model (FP8 variant is being added to ftp/llm-models)
  • Phi-4-mini-instruct: added the model to the tests; skipped the test as the model likely has to be added to Torch models first (given the current error)

moraxu avatar May 12 '25 20:05 moraxu

/bot run

moraxu avatar May 13 '25 06:05 moraxu

@syuoni @crazydemo @LarryXFly - can you review this PR? Feel free to unassign yourself and tag someone else instead

moraxu avatar May 13 '25 06:05 moraxu

/bot run

moraxu avatar May 13 '25 18:05 moraxu

/bot run

moraxu avatar May 14 '25 06:05 moraxu

PR_Github #5121 [ run ] triggered by Bot

tensorrt-cicd avatar May 14 '25 07:05 tensorrt-cicd

PR_Github #5121 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #3735 completed with status: 'FAILURE'

tensorrt-cicd avatar May 14 '25 09:05 tensorrt-cicd

/bot run

moraxu avatar May 14 '25 19:05 moraxu

PR_Github #5205 [ run ] triggered by Bot

tensorrt-cicd avatar May 14 '25 19:05 tensorrt-cicd

PR_Github #5205 [ run ] completed with state FAILURE /LLM/main/L0_MergeRequest_PR pipeline #3798 completed with status: 'FAILURE'

tensorrt-cicd avatar May 14 '25 20:05 tensorrt-cicd

/bot run

moraxu avatar May 14 '25 21:05 moraxu

PR_Github #5211 [ run ] triggered by Bot

tensorrt-cicd avatar May 14 '25 21:05 tensorrt-cicd

PR_Github #5211 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #3803 completed with status: 'SUCCESS'

tensorrt-cicd avatar May 15 '25 01:05 tensorrt-cicd

/bot run

moraxu avatar May 18 '25 20:05 moraxu

PR_Github #5627 [ run ] triggered by Bot

tensorrt-cicd avatar May 18 '25 20:05 tensorrt-cicd

PR_Github #5627 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #4111 completed with status: 'FAILURE'

tensorrt-cicd avatar May 19 '25 04:05 tensorrt-cicd

/bot run

moraxu avatar May 19 '25 15:05 moraxu

PR_Github #5747 [ run ] triggered by Bot

tensorrt-cicd avatar May 19 '25 16:05 tensorrt-cicd

PR_Github #5747 [ run ] completed with state FAILURE /LLM/main/L0_MergeRequest_PR pipeline #4203 completed with status: 'FAILURE'

tensorrt-cicd avatar May 19 '25 17:05 tensorrt-cicd

/bot run

moraxu avatar May 19 '25 18:05 moraxu

PR_Github #5764 [ run ] triggered by Bot

tensorrt-cicd avatar May 19 '25 18:05 tensorrt-cicd

PR_Github #5764 [ run ] completed with state SUCCESS /LLM/main/L0_MergeRequest_PR pipeline #4218 completed with status: 'FAILURE'

tensorrt-cicd avatar May 19 '25 22:05 tensorrt-cicd

/bot run

moraxu avatar May 19 '25 22:05 moraxu

PR_Github #5780 [ run ] triggered by Bot

tensorrt-cicd avatar May 19 '25 22:05 tensorrt-cicd

@syuoni @Tracin @chang-l @tijyojwad Could you review a stacked PR (can't tag you there directly) that fills the remaining gaps? https://github.com/moraxu/TensorRT-LLM/pull/1

I figured it would be easier to merge it to this branch, given the existing acc references here.

moraxu avatar May 19 '25 22:05 moraxu

/bot kill

moraxu avatar May 19 '25 22:05 moraxu

PR_Github #5781 [ kill ] triggered by Bot

tensorrt-cicd avatar May 19 '25 22:05 tensorrt-cicd

PR_Github #5780 [ run ] completed with state ABORTED

tensorrt-cicd avatar May 19 '25 22:05 tensorrt-cicd

PR_Github #5781 [ kill ] completed with state SUCCESS Successfully killed previous jobs for commit 2b33c83

tensorrt-cicd avatar May 19 '25 22:05 tensorrt-cicd

/bot run

moraxu avatar May 21 '25 04:05 moraxu

/bot run

moraxu avatar May 21 '25 06:05 moraxu