OOV solution
Hi, I am a beginner of nmt.I have a question: How to solve the problem of OOV in translation translation? Can I add a large dictionary? How to add a dictionary in nmt? Looking forward to your advice or answers. Best regards, Thank you very much!
reduce batch size, hidden layer number, hidden state number, etc.
@yapingzhao first , you should have a better Word segmentation tools. then use the output summary to find the alignment , the replace unk word
A precise Word segmentation tool is definitely helpful to reduce the vocabulary size. Alternative ways: try copy net
@hpulfc ,thanks!
I find the solution about this question ---- CopyNet. The code is here.
But I wonder that, its vocabulary size is fixed, and there's no array or no list to store the oov words, so how does the code solve the oov problem?
I find the solution about this question ---- CopyNet. The code is here.
But I wonder that, its vocabulary size is fixed, and there's no array or no list to store the oov words, so how does the code solve the oov problem?
have you understand the copynet in nmt?