fast-bert
fast-bert copied to clipboard
Super easy library for BERT based NLP models
`learner.fit(epochs=1, r=6e-5, validate=True, # Evaluate the model after each epoch schedule_type="warmup_cosine", optimizer_type="lamb")`  Hi, following the official tutorial ("Language Model Fine-tuning) , i get the following error presented in screenshot,...
`args = Box({ "seed": 42, "task_name": 'Medical_language_modelling', "model_name": 'roberta-base', "model_type": 'roberta', "train_batch_size": 16, "learning_rate": 4e-5, "num_train_epochs": 20, "fp16": True, "fp16_opt_level": "O2", "warmup_steps": 1000, "logging_steps": 0, "max_seq_length": 512, "multi_gpu": True if...
Hi @kaushaltrivedi , I used: ``` learner.fit(epochs=6, lr=6e-5, validate=True. # Evaluate the model after each epoch schedule_type="warmup_cosine") ``` However, that code onlys checks after the whole training, not after each...
How can i use the confusion matrix for each class and the other metrics in this link https://github.com/kaushaltrivedi/fast-bert/issues/17 ??
Hello, I am the developer of python-box and see that it is a requirement in this repo and has not been version pinned. I suggest that you pin it to...
Hey, The error is that my 11 labels somehow get mapped to 16*11 values for one batch, which does not match the right size of 16. I have set up...
I am doing multi-label classification. Getting this error !
I couldn't get installation with the pinned versions of tokenizers and transformers in place, using the latest nightly rustc. Will my fast-bert installation be incorrect without those version pins? Is...
Hi , When using roberta-large-openai-detector for multiclass classification , I am getting below error: `RuntimeError: Error(s) in loading state_dict for RobertaForSequenceClassification: size mismatch for classifier.out_proj.weight: copying a param with shape...
I believe I found a bug in the `predict` code for the `BertClassificationPredictor` class. I am loading a trained model like this: ``` self.model = BertClassificationPredictor(model_path=model_dir, label_path=labels_dir, multi_label=True, model_type='distilbert', do_lower_case=False,...