Language-Modeling-GatedCNN
Language-Modeling-GatedCNN copied to clipboard
Tensorflow implementation of "Language Modeling with Gated Convolutional Networks"
I have used the code to run in Python notebook. Tensorflow 1.12. I love this piece of implementation - straightforward step-by-step logic of CNN. It is so much more transparent...
Lets say a sentence in the data set is (1,2,3,4) Then prepare_data function will create: X = (1,2,3) Y = (2,3,4) While predicting 2 and 3 your model can copy...
I train the model with the provided dataset and configuration. And the initial loss is huge, e.g. 8~9e+10. I think there is something wrong with my training. Could you please...
Hi, anantzoid, Could you please provide some more information about convergence of the model? e.g.: How long will the training procedure take? What device do you use? How about the...
It takes 50 seconds to handle one batch (do forward / backwards pass) for the model with default settings. I use Titan X. Is it ok, if not, what may...
can you paste the result. tks very much.
I noted that `h, res_input = embed, embed` and `fanin_depth = h.get_shape()[-1] ` is the dimension of convolution output same with the embeddings size ?? why ?