Daniel Watson

Results 1 comments of Daniel Watson

+1 because users may have use cases where the decoder's outputs are needed without being passed through the final feedforward layer. Example: the decoder uses scheduled sampling, so the dense...