Martin Damgaard Nielsen

Results 2 comments of Martin Damgaard Nielsen

@hannw Thx for your response. The problem is that the default backpropagation code in TensorFlow will save a copy of the concatenated weight tensor (Kernel) for each timestep (In the...

I seem to be experiencing this again even though I'm running with accelerate 0.31.0