Daniel Watson
Results
1
comments of
Daniel Watson
+1 because users may have use cases where the decoder's outputs are needed without being passed through the final feedforward layer. Example: the decoder uses scheduled sampling, so the dense...