SQLNet icon indicating copy to clipboard operation
SQLNet copied to clipboard

Issue in running python extract_vocab.py

Open DevalNaik opened this issue 6 years ago • 3 comments

Error while loading word embedding glove

Logs: Loading from original dataset Loading data from data/train_tok.jsonl Loading data from data/train_tok.tables.jsonl Loading data from data/dev_tok.jsonl Loading data from data/dev_tok.tables.jsonl Loading data from data/test_tok.jsonl Loading data from data/test_tok.tables.jsonl Loading word embedding from glove/glove.42B.300d.txt Traceback (most recent call last): File "extract_vocab.py", line 23, in use_small=USE_SMALL) File "C:\Users\SQLNet\sqlnet\utils.py ", line 274, in load_word_emb for idx, line in enumerate(inf): File "C:\Users\miniconda3\lib\encodings\cp1252.py", line 23, in dec ode return codecs.charmap_decode(input,self.errors,decoding_table)[0] UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 2438: cha racter maps to

DevalNaik avatar Mar 07 '19 08:03 DevalNaik

Error while loading word embedding glove

Logs: Loading from original dataset Loading data from data/train_tok.jsonl Loading data from data/train_tok.tables.jsonl Loading data from data/dev_tok.jsonl Loading data from data/dev_tok.tables.jsonl Loading data from data/test_tok.jsonl Loading data from data/test_tok.tables.jsonl Loading word embedding from glove/glove.42B.300d.txt Traceback (most recent call last): File "extract_vocab.py", line 23, in use_small=USE_SMALL) File "C:\Users\SQLNet\sqlnet\utils.py ", line 274, in load_word_emb for idx, line in enumerate(inf): File "C:\Users\miniconda3\lib\encodings\cp1252.py", line 23, in dec ode return codecs.charmap_decode(input,self.errors,decoding_table)[0] UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 2438: cha racter maps to

Execution is started with following changes in utils.py at row#273 with open(file_name,encoding="utf8") as inf:

DevalNaik avatar Mar 07 '19 09:03 DevalNaik

Check your folder structure for data, Is your train_tok.jsonl under data folder or data/data/train_tok.jsonl?

siddharthchauhan avatar Feb 23 '20 09:02 siddharthchauhan

thanks for editing @DevalNaik

shambhaviparashar avatar Jun 25 '20 17:06 shambhaviparashar