ZHANG Ao
Results
1
comments of
ZHANG Ao
comment line 429 `args=args` in `megatron/training.py` will solve this problem. ```python model, optimizer, _, lr_scheduler = deepspeed.initialize( model=model[0], optimizer=optimizer, lr_scheduler=lr_scheduler, config=config, #args=args, ) ```