makcbe
makcbe
On what kind of hardware are you running this on?
The model is at about 15k with values of eval and train at 3.4 & 4.02 and is still on. Lets say the train and eval are terminated, checkpoints and...
@abisee: thank you and this is a great piece of work. @fishermanff, are you able to tell what a high loss is like? Going with instructions, I am running train...
Both, thank you for the support and that's definitely useful.