syoya comments

Results 22 comments of


                                            syoya

Question about Luong Attention Implementation

> @kyquang97 Luong takes the last context vector and concatenates them with the last output vector as an input to RNN. The output from RNN will be passed to Attention...

Question about Luong Attention Implementation

> > > @kyquang97 Luong takes the last context vector and concatenates them with the last output vector as an input to RNN. The output from RNN will be passed...

Nan loss for quantization

@stgzr Thanks! I would have a try. And this seems strange to me. What's the difference between Pytorch 1.1.0 and 1.0.1 that could lead to this Nan loss?

sparse_autoencoder_l1, does the l1 constrain really make the representation sparse?

Sorry for a late reply. I didn't read the notifications. I couldn't find the original paper where this kind of analysis is raised but I hope this one ([Why Regularized...

sparse_autoencoder_l1, does the l1 constrain really make the representation sparse?

Interesting. I thought sparsity on representation means the same as sparsity on parameters. I'll try to figure it out. And sorry I'm quite busy these days.

Config file of CIFAR10 experiments

> Added. According to issue #4 , the code refactor changed the randomness of the code. Please try different seeds if needed. [config](https://github.com/TencentYoutuResearch/Classification-SemiCLS/blob/main/configs/ccssl/fixmatchccssl_exp512_cifar10_wres_x2_b1x64_l4000_soft.py) I've tried several different seeds and finally...

Config file of CIFAR10 experiments

Thanks for your reply! Here is what i use for training: **Training Command** `srun -p caif_dev --ntasks=1 --ntasks-per-node=1 --gres=gpu:1 --cpus-per-task=20 python train_semi.py --cfg configs/comatch/comatch_stl10_wres_r18_b1x64_l5.py --out workdirs/comatch_stl10_wres_r18_b1x64_l5 --seed 1 --gpu-id 0`...