Lily Erickson comments

Results 37 comments of


                                            Lily Erickson

Fixed NoneType attribute crash in tokenization_utils_base.py

> Thanks! Do you have a small reproducer? Oh dear, thank you for asking. It appears I've made a very small mistake and jumped to conclusions early. Allow me to...

[Bug] Training on 13B causes loss to be 0, while 7B works fine

![image](https://user-images.githubusercontent.com/87243032/227738302-4bf6ce2d-bef9-4042-a83b-d87f02ec787c.png) Loss and learning rate go sub-0 pretty quick. Usually they're expressed using e-x. Are you sure your terminal isn't just truncating the output?

[Bug] Training on 13B causes loss to be 0, while 7B works fine

![image](https://user-images.githubusercontent.com/87243032/227780494-b6e1b0ea-1487-49b6-9f96-6d66f1074d30.png) So I discovered an issue where, when switching to a new dataset, the attention mask actually just sets the dictionary key for the Output to be "", before calling...

[Bug] Training on 13B causes loss to be 0, while 7B works fine

Yes, the original cleaned version worked fine. After fixing the problem, Loss appears to stay steady for a single epoch.

[Bug] Training on 13B causes loss to be 0, while 7B works fine

7bn works with the cleaned alpaca dataset, and Another dataset of mine that uses a similar, yet not identical, format, with different key names. On Sun, Mar 26, 2023, 9:14...

[Bug] Training on 13B causes loss to be 0, while 7B works fine

Your json dataset will have a list of dictionaries. If your output key is something other than "Reply", (Maybe it's "Output", or "output", they're case-sensitive) you should change the "Reply"...

RuntimeError: "addmm_impl_cpu_" not implemented for 'Half'

Try putting `with torch.autocast("cuda"):` at the start of your evaluate function ``` def generate_response( instruction, inputs=None, temperature=0.7, top_p=0.75, top_k=40, num_beams=4, max_new_tokens=128, **kwargs, ): prompt = prompter.generate_prompt(instruction, inputs) with torch.autocast("cuda"): #Useful...