xmax1
xmax1
**Current Problem** Optimising intra-chain couplings for performance currently needs to be bespoke. **Proposed Solution** Implement standardised way of optimising coupling strength in chains for performance.
hey and thanks for the great work, Just a question on the grad computations (though this could be my lack of understanding on autograd complex analysis). For the forward pass...
Hi, thanks for the great work! There is an assertion error when checking the dataset, which is confusing because as far as I understand it should fail for anyone. Possibly...
What is the best way to center the moving averages? If analytically our the activation kronecker factor is given by (a - \bar{a})^T(a - \bar{a}) where a are the instantaneous...
If we are reusing weights in a linear layer can we use the same approximation to compute the covariances, or are there some subtleties? for example if weights w are...