Targeted-Dropout
Targeted-Dropout copied to clipboard
Complementary code for the Targeted Dropout paper
The hparams for ResNet-32 seem to define an input layer with 16 filters followed by stacks of layers with 32, 64, and then 128 filters: https://github.com/for-ai/TD/blob/master/hparams/resnet.py#L12 This doesn't match the...
Thin you for your work,What's the difference between unit dropout and weight dropout?How are they implemented?Looking froward for your reply.
Hello,Think your work,I want to know why do you consider unit pruning and weight pruning in L1 norm and L2 norm?What do L1 norm and L2 norm do in your...