Martin Damgaard Nielsen
Results
2
comments of
Martin Damgaard Nielsen
@hannw Thx for your response. The problem is that the default backpropagation code in TensorFlow will save a copy of the concatenated weight tensor (Kernel) for each timestep (In the...
I seem to be experiencing this again even though I'm running with accelerate 0.31.0