Wangchunshu Zhou comments

Results 15 comments of


                                            Wangchunshu Zhou

Hyper-params settings for MNLI fine-tuning using Albert-v2

I've followed the hyper parameters presented in the paper but only got accuracy of ~83.0. Have you figured out how to get the reported result? Thanks!

About generator in adversarial training

I think it is one batch, which is a common practice in GANs.

strange behavior of reward signal

interesting. Did you figured out why?

Hard to reproduce the results of GLUE benchmark

Hi @Harry-zzh First, could you please share on which dataset you conduct your experiments? If it is some small datasets, 3% variation may indeed come from different random seeds. Otherwise,...

Hard to reproduce the results of GLUE benchmark

@Harry-zzh Can you share the exact command for your best result on the task? Also, can you share the results on the dev set of the GLUE benchmark? You can...

Hard to reproduce the results of GLUE benchmark

Hi, that was my mistake. The teacher is initialized by pretrained BERT (well-read student). But using fine-tuned teacher should be able to achieve similar performance. First I think you should...

Hard to reproduce the results of GLUE benchmark

For further questions, maybe you can send me an email with your wechat ID to [email protected] so that I can offer further guidance and help more promptly and conveniently.

News content only dataset

> Hi @KaiDMML! Thank you for making FakeNewsNet avalable. > > I'm trying to apply the dataset using only the news content, but I'm finding some problems when reproducing the...

Question about self-BLEU implementation

I also found this problem, when the test data and reference data is the same, self-bleu is always 1. However many papers in this domain use it as a diversity...

Is there anyway to bypass the proxy rate limit?

Hey I'm not getting "too many requests in one hour" message even with many access tokens and my own proxy server. I think it's an IP-based rate limit. But it...