IT-coach
IT-coach
@ZhenYangIACAS @JiqiangLiu 运行命令行示例(无监督训练 en2zh 时, 传递命令行参数 --unsupervised_vocab 8000 才能得到比较好的效果): python map_embeddings.py --unsupervised --unsupervised_vocab 8000 ./jy_data/model_en.vec ./jy_data/model_zh_j.vec ./jy_data/model_en_mapped2.vec ./jy_data/model_zh_j_mapped2.vec --cuda
> Hi, thanks for your interest in our work! We use the _whole_word_mask function to mask tokens, which will not mask the CLS token. You can refer to https://github.com/huggingface/transformers/blob/v4.34.1/src/transformers/data/data_collator.py#L845. Hello,...
> > > Hi, thanks for your interest in our work! We use the _whole_word_mask function to mask tokens, which will not mask the CLS token. You can refer to...
> > > > Hi, thanks for your interest in our work! We use the _whole_word_mask function to mask tokens, which will not mask the CLS token. You can refer...