easytokenizer
easytokenizer copied to clipboard
Is the tokenzier's output exactly the same as BertTokenizer?
I have verified the tokenize result on 180000 Chinese sentences. It is exactly the same as BertTokenizer.