TensorKit.jl About the warning for svd AD

For the SVD AD, there is a warning:

Warning: "svd cotangents sensitive to gauge choice: (|Δgauge| = 1.2888961545189787e-16)"

Is this warning related to the autodifferentiation of the SVD for complex numbers? If we are only concerned with gauge-invariant results, this warning should not pose any issues. Is that correct?

Oct 31 '24 03:10 Confusio

Yes it comes from the AD applied to the SVD; it tells you that in the backward process, there is component entering the adjoint Jacobian of the SVD process that depends on the gauge. However, that component is only 1.28e-16 in size, so it effectively zero. Therefore, I am surprised that the warning is even printed. What did you set for the tol keyword in the primal algorithm?

Oct 31 '24 09:10 Jutho

I used the default settings without setting any tol. I am also surprised about the "warning" with the tiny value.

Oct 31 '24 09:10 Confusio

I was a bit confused; this is coming from the full svd in TensorKit, right? I had KrylovKit in mind. In TensorKit, the current cutoff for printing this warning if no defaults are specified is: eps(eltype(S))^(3 / 4) * S[1, 1] with S[1,1] the largest singular value. I guess that is not a really good default: when S[1,1] is for some reason small, it should still not be expected that the tolerance gets of the order or smaller than machine precision. I will change the default to something like eps(eltype(S))^(3 / 4) * max(S[1,1], one(S[1,1]).

Oct 31 '24 09:10 Jutho

Yes, it is from full tsvd in TensorKit.

Oct 31 '24 10:10 Confusio

Ok, this is now changed on master, hopefully this fixes it. I will tag a new release this weekend, but there are some breaking changes.

Nov 01 '24 20:11 Jutho

The new tol may cause a problem in safe_inv.(Sp' .- Sp, tol)'. Here we need a relative tol. Otherwise, it causes an incorrect result when Sp` is small.

Nov 16 '24 07:11 Confusio

I agree that tol is perhaps a bit overused in svd_rrule!, although it did seem consistent. However, I am open to better suggestions. Currently, tol is used to decide on a number of different things:

if two singular values S[i] and S[j] are closer in distance than tol, they are considered the same; in safe_inv the inverse inv(S[i]-S[j]) will be replaced with zero.
if a singular value S[i] is smaller than tol, it is assumed equal to zero and is truncated away (note that an external truncation by an other criterion may have happened also; that is compatible with the implementation; the current truncation just happens on top of that). I would argue that this is consistent with 1).
The rule for ignoring inv(S[i]-S[j]) in 1) if they are the same comes is motivated by the assumption that the associated numerator (namely the (ij) entry in the antisymmetric part of U'*ΔU and V'*ΔV') is zero. This is tested for with Δgauge. Testing for zero currently again amounts to imposing that these entries are smaller than tol` (in absolute value). I think there is also some rationale for this choice, in that if the numerator is just slightly larger than tol, but the denominator also, then an order 1 contribution is obtained.

I am happy to consider other choices or implement suggestions to improve this.

Dec 08 '24 23:12 Jutho