Arthuro
Arthuro
Hello all! First of all, thank you for the awesome annotation tool! I was previously working with doccano < 1.5.0 and everything was working fine. More than a year later...
Hi all, I am trying to replicate the DeepSpeed Inference basic tutorial with the following script on a VM with 1x 40GB A100: ``` model = AutoModelForCausalLM.from_pretrained( "EleutherAI/gpt-neox-20b", torch_dtype=torch.float16, #load_in_8bit=True,...
Hi there! First of all, thank you for the amazing work! The readme says the models were trained on "the new dataset based on The Pile" which is 3x the...
As I understood figure 5 in your paper, you further fine-tuned GPT-SelfInstruct on the SuperNatural Instructions data and surprisingly the results got worse compared to the "vanilla" GPT-SelfInstruct. Is my...