Omer Arshad

Results 11 comments of Omer Arshad

getting this error while loading a language model ```py RuntimeError: Error(s) in loading state_dict for LanguageModelingTransformer: Missing key(s) in state_dict: "model.lm_head.weight". ``` any leads?

any workaround for this?

well in my experiments attention only models achieve comparable results to LSTM, even got better than LSTM with very less training time

yes, structure of my model is attention+crf only

Did you find a solution to this? I am also facing same issue

oops i was talking about "https://tfhub.dev/google/universal-sentence-encoder-large/2" i.e. transformer based

Great, So what is the best way to fine tune trainable transformer based USE?

Can you please guide me where to use above mentioned code ? Below is sniped taken from example , X is first input and X2 is second input with tf.Session(graph=graph)...

At that time, they were available on fasttext website