Yuhua Chen

Results 2 comments of Yuhua Chen

Any news for the Graph Mode models? I tried to use the code from @pidajay. Still, as long as I passed any keywords like `variables` to the recomputed grad function,...

> If you're looking to do gradient checkpointing in graph mode I suggest the implementation tf-slim here, which I've extracted and successfully tested on tf-nightly in graph mode on TPU:...