Josoope
Josoope
File "/root/autodl-tmp/wordseg/BERT-BiLSTM-CRF/data_loader.py", line 95, in collate_fn batch_labels[j][:cur_tags_len] = labels[j] ValueError: could not broadcast input array from shape (188,) into shape (177,)
Traceback (most recent call last): File "helpData.py", line 383, in dataHelper.process_data() File "helpData.py", line 370, in process_data self.get_sens_and_tags_and_entsRel(self.origin_train_data, case=0) File "helpData.py", line 348, in get_sens_and_tags_and_entsRel ent_rel = np.array(ent_rel) ValueError: setting...
求一份分词数据集
作者代码中分词数据集没有mid_data,谁能提供一个完整的数据集?如果时医疗领域的话就更好了,谢谢
model = KeyedVectors.load_word2vec_format('./dict/Medical.txt', binary=False) sim = model.wv.most_similar('海马', topn = 10) print(sim) 报错信息: return _load_word2vec_format( File "/root/miniconda3/envs/bert-ch/lib/python3.8/site-packages/gensim/models/keyedvectors.py", line 2069, in _load_word2vec_format _word2vec_read_text(fin, kv, counts, vocab_size, vector_size, datatype, unicode_errors, encoding) File "/root/miniconda3/envs/bert-ch/lib/python3.8/site-packages/gensim/models/keyedvectors.py",...
通过bert训练后字向量维度是768吗?
用textcnn,显示没有n_gram_vocab
网上介绍数据集训练集、验证集、测试集有50610、6337、6337,我统计的是正样本,只有50570、6321、6330。
运行了python train_bert_gcn.py --dataset [dataset] --pretrained_bert_ckpt [pretrained_bert_ckpt] -m [m],有很多轮的测试结果,看最后一轮的结果吗?运行后会保存checkpoint文件,这个文件貌似没用上
可以用自己的医疗数据集构建知识图谱吗?要做哪些处理呢