Omer Arshad
Omer Arshad
getting this error while loading a language model ```py RuntimeError: Error(s) in loading state_dict for LanguageModelingTransformer: Missing key(s) in state_dict: "model.lm_head.weight". ``` any leads?
any workaround for this?
well in my experiments attention only models achieve comparable results to LSTM, even got better than LSTM with very less training time
yes, structure of my model is attention+crf only
Did you find a solution to this? I am also facing same issue
oops i was talking about "https://tfhub.dev/google/universal-sentence-encoder-large/2" i.e. transformer based
Great, So what is the best way to fine tune trainable transformer based USE?
Can you please guide me where to use above mentioned code ? Below is sniped taken from example , X is first input and X2 is second input with tf.Session(graph=graph)...
yes this happens. please fix it
At that time, they were available on fasttext website