nmt icon indicating copy to clipboard operation
nmt copied to clipboard

OOV solution

Open yapingzhao opened this issue 7 years ago • 6 comments

Hi, I am a beginner of nmt.I have a question: How to solve the problem of OOV in translation translation? Can I add a large dictionary? How to add a dictionary in nmt? Looking forward to your advice or answers. Best regards, Thank you very much!

yapingzhao avatar Jul 06 '18 10:07 yapingzhao

reduce batch size, hidden layer number, hidden state number, etc.

thormacy avatar Jul 10 '18 02:07 thormacy

@yapingzhao first , you should have a better Word segmentation tools. then use the output summary to find the alignment , the replace unk word

hpulfc avatar Jul 24 '18 07:07 hpulfc

A precise Word segmentation tool is definitely helpful to reduce the vocabulary size. Alternative ways: try copy net

thormacy avatar Jul 24 '18 07:07 thormacy

@hpulfc ,thanks!

yapingzhao avatar Sep 06 '18 08:09 yapingzhao

I find the solution about this question ---- CopyNet. The code is here.

But I wonder that, its vocabulary size is fixed, and there's no array or no list to store the oov words, so how does the code solve the oov problem?

VieZhong avatar Nov 13 '18 06:11 VieZhong

I find the solution about this question ---- CopyNet. The code is here.

But I wonder that, its vocabulary size is fixed, and there's no array or no list to store the oov words, so how does the code solve the oov problem?

have you understand the copynet in nmt?

paulpaul91 avatar Mar 27 '19 01:03 paulpaul91