MetaDistil icon indicating copy to clipboard operation
MetaDistil copied to clipboard

Code for ACL 2022 paper "BERT Learns to Teach: Knowledge Distillation with Meta Learning".

Results 3 MetaDistil issues
Sort by recently updated
recently updated
newest added

Thanks for your excellent work. I tried to do grid search on the settings that you described in your paper and codes, but it is still hard for me to...

According to the article, in the second step, we calculate CrossEntropy loss of S' on samples from a separate quiz set, and then calculate the gradients of CE loss with...

Hi there, Could you share the teacher model on MRPC dataset? I have tried many hyper-parameters to reproduce the results but all of them failed.