BiDAF issues

Running on multiple GPUs

Thanks for the code! I am trying to run the training on 8 GPUs but it seems that the code uses only one of them. Is it required to make...

Khaghshenas

https://github.com/jojonki/BiDAF/blob/3e5ac9c76d02de2d8f75b1eda6632f8a9432eba6/layers/char_embedding.py#L28 I feel strange to see this code. Why do you sum over `word_len` dimension? Why don't you apply 1D filter over `word_len` dimension? Thank you.

KazutoshiShinoda

Very small gradients causing no weight update from your model

31

Thanks for your code. It helps me understand the BiDAF in details. However, I found the model had no performance increasing. Every epoch, the metric is always the same. And...

oneTaken

cuda out of memory

6

Thanks for your code, it helps me a lot. And I try to write on my own but I meet some questions. When I rewrite the loss function as follows:...

wlhgtc

Core dumped during training

I followed the steps in the guide, downloaded and unzipped the glove as guided, and run prepro, that is no problem. But when training, I got "core dumped" immediately when...

xz-keg

BiDAF
BiDAF copied to clipboard

Metadata

Running on multiple GPUs

Character Level Embedding

Very small gradients causing no weight update from your model

cuda out of memory

Core dumped during training

← Metadata

Owner

Metadata

BiDAF BiDAF copied to clipboard

Metadata

Running on multiple GPUs

Character Level Embedding

Very small gradients causing no weight update from your model

cuda out of memory

Core dumped during training

← Metadata

Owner

Metadata

BiDAF
BiDAF copied to clipboard