transfuser
transfuser copied to clipboard
Accessing the gradient of the transfuser model in evaluation mode
Hi,
Would please tell me which part of the code allows me to access the gradient of the transfuser model in evaluation mode ?
Thank you bunches in advance :)
If you mean gradients during deployment, than you need to look at the run_step function. Gradient computation is disabled by default to improve efficiency, you need to remove the inference_mode() and no_grad() decorators.
Thank you so much for your quick reply. Your help allowed me to resolve an issue in which I've been blocked for more than one month. I acknowledge your highly developed professional qualities :)