Yuting Jiang

Results 6 issues of Yuting Jiang

**Description** Make baseline check optional in data diagnosis and fix bugs. **Major Revision** - make baseline file optional in data diagnosis - fix bugs of output in md and excel...

bug
tool

# V0.6.0 Test Plan ## Test Cases ### single-node test | Machine Type | #Node * #GPU * GPU Type | PyTorch Version | Accelerated Computing Toolkit | Status |...

test

**Description** Add CUDA 12.4 dockerfile. **Major Revision** - upgrade nvidia docker into 23.04 **Minor Revision** - upgrade mlc into v3.11 - upgrade hpcx into 2.18

containers

**Description** Add support of megatron lm deepseek v2 lite model training for cuda gpus. **Major Revision** - Upgrade Megatron-lm submodule to support mock-data and later models - Add support for...

benchmarks
model-benchmarks

**Description** gpu burn: collect per-snapshot per-GPU flops/temp and add summary metrics **Major Revision** - Parse all performance snapshot lines containing "Gflop/s" and record per-snapshot, per-GPU metrics: gpu_gflops: and gpu_temp: -...

benchmarks
micro-benchmarks

# Release Manager @cp5555 # Endgame - [ ] Code freeze: Oct, 2025 - [ ] Bug Bash date: TBD - [ ] Release date: TBD # Main Features ##...