TextBoxes_plusplus icon indicating copy to clipboard operation
TextBoxes_plusplus copied to clipboard

Training is very slow

Open 425183525 opened this issue 7 years ago • 3 comments

Hi, my training speed is very slow. In most cases, 100 iterations will take nearly two hours.I used a single GPU(NVIDIA 1080), CUDA8.0, cudn 6.0。Tell me, what can I do to improve my training speed。 The image size is 1280x720(1600 images)

image

425183525 avatar Dec 05 '18 01:12 425183525

@MhLiao

425183525 avatar Dec 06 '18 06:12 425183525

You can check the utilization of your GPU. There is a bottleneck in the data loader when the original images are large. But 1280x720 should be OK.

MhLiao avatar Dec 25 '18 07:12 MhLiao

The original SSD repo also suffers such problems. It seems that the training speed is sensitive to GPU and drivers. See https://github.com/weiliu89/caffe/issues/691 I guess the low gpu-util is caused by the data layer. You can open the debug mode and see the time consuming of each layer.

MhLiao avatar Jan 22 '19 08:01 MhLiao