Results 25 issues of Sam Lerman

First, what are EPS_START, EPS_STOP, and EPS_STEPS? If I want episodes to last until the game naturally terminates an episode, how would I modify these? Could I just set EPS_STEPS...

I'm trying to test whether the A2C code for Sonic could be used to train an agent on another environment. I replaced the Sonic environments with 8 copies of Pong,...

Do these algorithms compute n-step returns for the reward propagation? The Sonic A2C code looks like it just does 1 step returns V(S) = R(S) + V(S_next), except it's hard...

I am on a tight deadline. Could you tell me about how long ImageNet takes to train?

I'm still getting the memory error... wondering if it's because of conv layers or batch norm...

This is the last thing the system admin said: `yeah I saw seg faults as well. I think at this point we might have to throw up our hands. this...

question
rendering

It's been a while since I checked in here. From the looks of the readme, it seems like there are a lot fewer steps tp getting this running. Is the...

It looks like the blue color-coded part of Equations (1) and (2) in the main paper are not included in this implementation https://arxiv.org/pdf/2007.14430.pdf

I've seen examples such as pyBittle of sending pre-programmed gaits to the Bittle via WiFi or Bluetooth. I'm an AI researcher looking to instead send precise velocities and torques to...