Haram Choi
Haram Choi
I found that #params in paper excludes "relative_position_bias_table" parameters. These parameters should be included in #params. But this code `print({sum(map(lambda x: x.numel(), model.parameters())):,d})` produces right #params that includes "relative_position_bias_table". I...
> I also noticed this problem and it brought me some trouble. Strictly speaking,MACs, i.e., Multi_Adds should be equal to Flops/2. So it actually caculates MACs, and the method name...
> > > I also noticed this problem and it brought me some trouble. Strictly speaking,MACs, i.e., Multi_Adds should be equal to Flops/2. So it actually caculates MACs, and the...
> > > > > I also noticed this problem and it brought me some trouble. Strictly speaking,MACs, i.e., Multi_Adds should be equal to Flops/2. So it actually caculates MACs,...