Dongwan Kim comments

Results 15 comments of


                                            Dongwan Kim

[Bug] Synchronization issue on GPU

I've updated the code. ```python from copy import deepcopy import numpy as np import torch from ffcv import Loader from ffcv.fields.basics import IntDecoder from ffcv.fields.rgb_image import SimpleRGBImageDecoder from ffcv.loader import...

[Bug] Synchronization issue on GPU

@andrewilyas I am not aware of the CuDNN simultaneous batch norm bug - could you link me to a reference? If it is a BN related bug, what I don't...

[Bug] Synchronization issue on GPU

@GuillaumeLeclerc, thanks for looking into this. It does seem to be a very weird bug. Please let me know if there are any ways I can help

[Bug] Synchronization issue on GPU

@ByungKwanLee the tests above were conducted on single-gpu, so I don't think SyncBN applies in this case.

[Bug] Synchronization issue on GPU

@andrewilyas @GuillaumeLeclerc Just tested the code - this seems to be fixed in the `fix-198` branch. Doesn't seem to be merged in the `v1.0.0` branch yet. Thanks for the fix!...

Bug with transforms.Resize when used with transforms.ConvertImageDtype

@vfdev-5 Thanks for the reply! As you mentioned, upsampling from 64 --> 224 is not a usual case for training, but it was part of the baseline method (not sure...

Bug with transforms.Resize when used with transforms.ConvertImageDtype

@vfdev-5 thanks for the explanation and the updates. The updated version seems to be a great performance boost (for upsampling)! Also, did not know `torchvision.io.read_image == cv2.imread != PIL read`....

Bug with transforms.Resize when used with transforms.ConvertImageDtype

I see - I'll have to keep an eye out for updates in the future 😄

Incorrect accuracy calculation with DistributedDataParallel - examples/imagenet/main.py

Issue seems to be persistent. You could implement your own distributed average meter or use something like [TorchMetrics](https://torchmetrics.readthedocs.io/en/stable/pages/lightning.html) which supports metrics across GPUs - I haven't used this myself but...

Improving Documentation

Willing to contribute (examples and perhaps installation?), but I have a lot of things going on rn so will need a few weeks to start working