kossi
Results
1
issues of
kossi
# Exponential activation of the forget gate f_t = torch.sigmoid(f_tilda) # (batch_size, hidden_size) Why was the exponential function not chosen as the activation function for the forget gate?