Chinthaka issues

Results 8 issues of


                                            Chinthaka

Error when creating deploy.caffemodel with two classes using load_caffe_weights.py, ValueError: cannot reshape array of size 157248 into shape (6,576,1,1)

dump_tensorflow_weights.py works well with the downloaded ssdlite_mobilenet_v2_coco_2018_05_09 model. But stuck at converting it to caffe model using the load_caffe_weights.py script Traceback (most recent call last): File "load_caffe_weights.py", line 82, in...

Zero Redundancy Optimizer - Stage1

To train much larger model variations (2B, 7B, etc), we need larger GPU memory allocations for parameters, optimizer states, and gradients. [Zero Redundancy Optimizer](https://www.deepspeed.ai/tutorials/zero/) introduce the methodology to shard these...

Chinthaka

Error when creating deploy.caffemodel with two classes using load_caffe_weights.py, ValueError: cannot reshape array of size 157248 into shape (6,576,1,1)

Zero Redundancy Optimizer - Stage1

MultiGPU training hangs

Fix incorrect GPU assignment in multi gpu setup

NCCL only multi-gpu multi-node training without MPI

Async optimizer state and model checkpointing

Bugfix eval dataloader out of bound file read and crash

Realtime training visualization using wandb