ALBERT-Pytorch
ALBERT-Pytorch copied to clipboard
how to train on custom dataset ?
hey there, sadly i dont clearly understand how the wiki.train.tokens and vocab.txt file is produced. followingly could you please show me the steps on how to reproduce the same train.tokens and vocab.txt file on a custom dataset ? (i am trying to implement this on a different language)