eweine
eweine
Could you provide a reproducible example of this issue? Thanks.
Thanks! I'll try to fix the bug.
Thanks Zach! That feature would be great. My interest here has less to do with achieving the smallest loss and more to do with understanding / manipulating how different initializations...
I'd be happy to contribute to this change. It seems like initializing w myself is easy. Where does the initialization of h happen?