Ouyanmei
Results
4
comments of
Ouyanmei
> 论文里面说使用带错误的训练数据预训练phonetic Encoder, 但代码里面好像是用的纠正后的数据,不知道我有没有理解错,恳请解惑  我也有这个疑问,你好,请问解决了吗
Due to the vocabulary size of the GPT-2 124M model being 50257, resizing the model's embedding layer dimensions may result in new embeddings that exceed the original vocabulary range. This...