BiDAF
BiDAF copied to clipboard
Bidirectional Attention Flow for Machine Comprehension, Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi https://arxiv.org/abs/1611.01603
Thanks for the code! I am trying to run the training on 8 GPUs but it seems that the code uses only one of them. Is it required to make...
https://github.com/jojonki/BiDAF/blob/3e5ac9c76d02de2d8f75b1eda6632f8a9432eba6/layers/char_embedding.py#L28 I feel strange to see this code. Why do you sum over `word_len` dimension? Why don't you apply 1D filter over `word_len` dimension? Thank you.
Thanks for your code. It helps me understand the BiDAF in details. However, I found the model had no performance increasing. Every epoch, the metric is always the same. And...
Thanks for your code, it helps me a lot. And I try to write on my own but I meet some questions. When I rewrite the loss function as follows:...
I followed the steps in the guide, downloaded and unzipped the glove as guided, and run prepro, that is no problem. But when training, I got "core dumped" immediately when...