pytorch-seq2seq issues

Teacher forcing per timestep?

1

Hi, I don't understand why the teacher forcing is being done per the whole sequence. The definition of the teacher forcing claims that at each timestep, a predicted or the...

ghost

``` /pytorch-seq2seq/seq2seq/models/EncoderRNN.py", line 68, in forward embedded = self.embedding(input_var) File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py", line 479, in __call__ result = self.forward(*input, **kwargs) File "/usr/local/lib/python3.6/dist-packages/torch/nn/modules/sparse.py", line 113, in forward self.norm_type, self.scale_grad_by_freq, self.sparse) File "/usr/local/lib/python3.6/dist-packages/torch/nn/functional.py",...

lucasjinreal

duplicate

fixed in develop

Backtracking in beam search

6

Compared to OpenNMT, why do we need this [block](https://github.com/IBM/pytorch-seq2seq/blob/master/seq2seq/models/TopKDecoder.py#L257) which handles the dropped sequences that see EOS earlier. (This is not there in their beam search implementation.) They are also...

shubhamagarwal92

question

high priority

NLL & Perplexity Loss

2

Hi, It seems that Perplexity is normalized twice & norm_term of NLLLoss should be masked out as well.

lethienhoa

bug

medium priority

Attention type

8

Can somebody tell me what is the type of attention used in this lib? Because I checked against Bahdanau and Luong attentions and it doesn't look like neither or maybe...

ratis86

contributions welcome

feature

Out of memory for NLLLoss even the batch size is small

Hi I'm using this framework on my dataset, everything works fine on CPU, but when I moved them to gpu, it had the error as following: `File "/home/ibm_decoder/DecoderRNN.py", line 107,...

serenayj

TopKDecoder

6

Hi, I wonder if rnn.forward_step changes the order of (batch_size*self.k) dimension ? With the code about initializing sequence_scores： ![2](https://user-images.githubusercontent.com/25878132/47995468-6260f000-e130-11e8-83f8-7c3d8cff443b.jpg) and in each step: ![3](https://user-images.githubusercontent.com/25878132/47995479-6ab92b00-e130-11e8-815f-ca34f1ce43e1.jpg) It seems like sequence_scores is updated...

Hongzl1996

pytorch-seq2seq
pytorch-seq2seq copied to clipboard

Metadata

Teacher forcing per timestep?

Error for cuda and cpu

Backtracking in beam search

NLL & Perplexity Loss

Attention type

Out of memory for NLLLoss even the batch size is small

TopKDecoder

fix bug Expected object of backend CUDA but got backend CPU for argu…

The dimension of predicted_softmax in DecoderRNN.py

Teacher forcing during beam decoding

← Metadata

Owner

Metadata

pytorch-seq2seq pytorch-seq2seq copied to clipboard

Metadata

← Metadata

Owner

Metadata

pytorch-seq2seq
pytorch-seq2seq copied to clipboard