nikitaved comments

Results 69 comments of


                                            nikitaved

_sparse_coo_tensor_with_dims_and_tensors backward: simplify and optimize

@pytorchbot merge

Fix autograd issue with identity conversions

I think it used to break autograd, but that might have changed, cc @albanD .

Fix autograd issue with identity conversions

> ```python > csr = torch.sparse_csr_tensor((0, 1, 2), (0, 1), (1, 1), dtype=torch.float32, requires_grad=True) > csr2 = csr.to_sparse(layout=torch.sparse_csr).detach().requires_grad_(True) > x = torch.ones((2, 1), dtype=torch.float32) > y = torch.matmul(csr2, x) >...

retire sparse_mask_helper

@pytorchbot merge -g

Enable `bsr_dense_mm` Triton kernel in `nn.functional.linear`.

Closing in favor of stack https://github.com/pytorch/pytorch/pull/94823.

RFC: add support for LU factorization in the linalg extension

@rgommers , if there is something for the GPU which is missing for the CPU in PyTorch, that is most likely because of gpu performance/functionality having much higher value... Especially...

RFC: add support for LU factorization in the linalg extension

> For example, it does not exist for a matrix `A` with `A[0,0] = 0`. With this limitation, there are very few families of matrices for which this property holds....

RFC: add support for LU factorization in the linalg extension

> The diagonal case you mentioned I don't think it's correct, but it's true that there is at least one family of matrices that I know of for which LU...

Questions about reproducing the result of "Benchmark 2: Fine-Tuning RoBERTa on GLUE tasks"

I am also curious about the dtype. Do you know whether it is float, or half/bfloat16?

CUDA Graphs: enable in forward

@t-vi , I have updated this PR with tests.