James Martens comments

Results 15 comments of


                                            James Martens

How to center a moving average

K-FAC shouldn't use centered activation statistics. However if you wanted to do something like this in some general context, the correct thing would probably be to take a decayed average...

how to customize loss function?

Without more information I can't really help you. But if you want to add a custom loss function to K-FAC you would have to write your own class in loss_functions.py...

Is reusing linear layers the same as a convolutional layer

The type of approximation they use is the same one, essentially. However, you can't use them interchangeably in the code since convs involve special ops and also "reuse" the different...

Using K-FAC with physics-based losses

Hi, Sorry for the late reply. The K-FAC method is based on the generalized Gauss-Newton matrix (GGN), which required a convex loss (as a function of the network's output). If...

Add Support for KFAC Optimization in LSTM and GRU Layers

Yeah support for recurrent networks is something we have partially implemented internally. If there's interest I guess we could try to get this out sooner.

Add Support for KFAC Optimization in LSTM and GRU Layers

Sorry, no. Myself and others have been very busy and haven't had time. If you're interested in using a Kronecker-factored method compatible with RNNs out of the box, you could...

Does K-FAC support only float32 operations?

Float64 should work. Although maybe a casting to float32 crept in somewhere. What was the error?

KFAC in early stages query

Hi. That "14x" figure applies only to a certain architecture, and isn't meant to be universal. However, I can see from the README that the phrasing suggests otherwise, and so...

Wrong incompatible versions + worse than Adam performance.

Thanks for your persistence trying to get the code to work. Unfortunately I don't really know much about Python library versions and this code isn't really being maintained anymore. Regarding...

Wrong incompatible versions + worse than Adam performance.

I've updated the package documentation and hopefully fixed the issues with the installation.