Long Luong

Results 7 issues of Long Luong

* Added PReLU backward operation and kernels. * Added driver test and gtest for PReLU backward operation. * New API is guarded by MIOPEN_BETA_API macro. * Compared to ROCm pytorch:...

enhancement
external_collaborator

* Added cumulative reduction forward operation and kernel with solver, support binary operators (max, min, sum, prod). This operation equivalent to cummax, cummin, cumsum, cumprod in Pytorch. * Added driver...

enhancement
external_collaborator

* Added [TripletMarginLoss](https://pytorch.org/docs/stable/generated/torch.nn.TripletMarginLoss.html) foward and backward operations and kernels. * Added driver test and gtest for TripletMarginLoss foward and backward operations. * New API is guarded by MIOPEN_BETA_API macro. *...

enhancement
external_collaborator

This PR is a continuation of PR #3182. Accidently, I have closed the older PR, and then made change to the working branch, which makes me cannot reopen the older...

enhancement
external_collaborator

* Added SmoothL1Loss forward and backward. * Added driver test and gtest for both direction of SmoothL1Loss. * New API is guarded by MIOPEN_BETA_API macro. * Compared to ROCm pytorch:...

enhancement
TESTING_CI_PASSED
external_collaborator

* Added [MatrixDiag](https://www.tensorflow.org/api_docs/python/tf/raw_ops/MatrixDiagV3), [MatrixSetDiag](https://www.tensorflow.org/api_docs/python/tf/raw_ops/MatrixSetDiagV3), [MatrixDiagPart](https://www.tensorflow.org/api_docs/python/tf/raw_ops/MatrixDiagPartV3) forward and backward. * Added driver test and gtest for both direction. * New APIs are guarded by MIOPEN_BETA_API macro. * Compare to ROCm pytorch:...

enhancement
external_collaborator

- Added basic [LogSumExp](https://pytorch.org/docs/stable/generated/torch.logsumexp.html) operation and kernel. - Added driver test and gtest for LogSumExp. - New API is guarded by MIOPEN_BETA_API macro. When comparing the newly developed miopen LogSumExp...

enhancement
external_collaborator