leao1995
leao1995
yes, the `self()` function calls the `forward()` function, and we set it to default `None` during training, so it says we will always set the state to zero when training,...
When I run the cifar example, it always selects a fixed subset of the embedding, so the issue indeed exists.
@nbei These are very important details to reproduce the results. Could you please share them? Thanks
Have you figured it out? I got the same error.
Have any of you figured out the reason for low accuracy?