AyeshaSarwar
AyeshaSarwar
I am running the same code on python 2.7, Windows 10 I have no hidden files and all the encoding is in UTF-8 Still getting this error UnicodeDecodeError: 'utf8' codec...
What is the maximum sequence length BART can handle? Do you have information on this?
I am getting this error: "index out of range in self"
I have the same question.
Every @highlight means a separate summary? For our custom data, should we put @highlight for every line in our summary? I am a little confused. Please if you could help...
I have the same Questions. @nlpyang Can you kindly help us in this regard? If we want to summarize a document with 3000 token, we just need to set max-pos...
No, but the documentation says that if you have a longer input such as 800 so we can use max-pos = 800. And thus my Q is if I pass...
I am also facing the same issue. There are only two files in the results folder containing .candidate and .gold file. and candidate file contains the same text from the...
@nlpyang Kindly help in this regard. If we want to consider the whole document with its full length, do we just need to replace 512 everywhere with our desired number?...
Updates: For encoding a text longer than 512 tokens, for example 800. Set max_pos to 800 during both preprocessing and training. So, lets say I want to consider documents with...