Max Zuo

Results 2 issues of Max Zuo

Faster implementation for calculating q_target for training the DDDQN - in your video you mention the slow speed at which it runs. With this small change, it should run significantly...

Hey! I love your work and I am tightly following your titans reproduction! I'm a grad student working on MoE/MoA and this came across my attention – do you have...