ljk3210
Results
1
issues of
ljk3210
Really exciting to see progress on LLM benchmarking in the `loadgen` codebase. I do wonder that: a) Is `First Token latency` going to be the only metric? Sometimes we might...