Imposingapple
Imposingapple
Oh, I see. Thank you very very much! My results are much similar to yours now(0.1236, 0.293), but there's still some offsets(about 10% larger). Have you use any other tricks...
谢谢解答!我说的是一台机子,有四块卡,一张卡上是可以运行的。最近实验室卡比较紧张,所以我还没机会再次在多卡上跑,我看了下使用说明我之前下标没有按照0开始,没有设置cuda_is_visible那个设置,有机会试一试看行不行!
(以及貌似我申请加您好友您没有通过我哈哈
I meet the same question, and I have printed out all the feature map sizes. The problem I think is miss of resizing the input image. I also don't know...
The question is due to your fairseq version, your version is 1.0+, however, 0.9.0 is used in this project.
我在s2s_model.py中看到了调用,确实是已经用下载下来的权重初始化过了。请问还有什么可能的导致效果没有您文章里好的原因?请指教,谢谢!
I download the MASS parameters from your link, and at the beginning of the training, the perplexity is at the magnitude of 10^6. Does it means that the initialization function...
Here's all the terminal outputs after running the script 'CUDA_VISIBLE_DEVICES=0 ./train_mix_CNN_NYT_X.sh --style humor', haven't seen any wrong signals: args.distributed_init_method: None args: activation_dropout = 0.1 activation_fn = gelu adam_betas = (0.9,...
Yes, of course I did. The first screenshot of this issue is the result after running 'evaluate_mix_CNN_NYT_X.sh' for humor. The hypothesis file to evaluate is already detokenized (sentences with english...
Dear author, I'm sorry to bother you again. I could not figure out why there's discrepancy between my result and the paper's result yet. I'm sure to run the exact...