Max Zuo
Results
2
issues of
Max Zuo
Faster implementation for calculating q_target for training the DDDQN - in your video you mention the slow speed at which it runs. With this small change, it should run significantly...
Hey! I love your work and I am tightly following your titans reproduction! I'm a grad student working on MoE/MoA and this came across my attention – do you have...