Yuhua Chen
Results
2
comments of
Yuhua Chen
Any news for the Graph Mode models? I tried to use the code from @pidajay. Still, as long as I passed any keywords like `variables` to the recomputed grad function,...
> If you're looking to do gradient checkpointing in graph mode I suggest the implementation tf-slim here, which I've extracted and successfully tested on tf-nightly in graph mode on TPU:...