KexinFeng comments

Results 68 comments of


                                            KexinFeng

A helping issue on how to debug my model

If NAN happens only in the middle of the training not the beginning, then it is a little troublesome to debug. In this case, I would print the gradient of...

pytorch doesn't clear out gradients between `backward()` calls in the gradient collector

Can we simply add a zero_grad() step in the PyTorch implementation of backward call @siddvenk ? This solution assumes that repetitive calling of zero_grad() is ok, considering in Trainer there...

BinaryImageClassification

Could you be more specific about what binary image classification task you are looking for? You mentioned "detect if a Image is similiar to another one", it sounds a lot...

BinaryImageClassification

@DxsSucuk You can consider using transfer learning; use pre-trained model to get vector embedding of pictures. Then it can be used for the downstream tasks like one-class classification. We are...

Bulk Batch creation

@patins1 It looks like `BulkDataIterable` is not covered by any unit test. You mentioned that using `BulkDataIterable` is more efficient. Could you add a unit test that covers this class...

Bulk Batch creation

> The problems with MxNet is fixed by using "take" operation instead of "pick". > TrainMnist executes now 4 times faster for MxNet . Could you integrate it with the...

@patins1 Thanks for dealing with this issue in such a timely manner! But it looks like the unittest that covers the new file api/src/main/java/ai/djl/training/dataset/BulkDataIterable.java is still missing. You mentioned you...

Bulk Batch creation

@siddvenk Hi Siddarth, in this pr, we have changed the definition of get(NDArray index) from pick to take, and have given the warning. This will affect the result you mentioned...

Will DJL support time series preprocess ?

@r1ckypp We have just added the support for gluonTS model inference. Check out https://github.com/deepjavalibrary/djl/pull/1903

[FEATURE] Add feature of attach_grad to nonleaf variables in HybridizedBlock.

@mxnet-bot run ci [unix-cpu, unix-gpu, windows-gpu]