yxyOo

Results 4 comments of yxyOo

> This looks interesting! How accurate is it? We randomly selected several parallel configurations and conducted "Memory Requirement" tests on the 7B llama2 model using a single H800 machine with...

> Hi @yxyOo: I have a few questions about total_parameters computing. Since you mentioned your experiments on llama, but I find some inconsistency: > > 1. llama doesn't have bias...

> > > This looks interesting! How accurate is it? > > > > > > We randomly selected several parallel configurations and conducted "Memory Requirement" tests on the 7B...

> Hi, @yxyOo this is a great feature! While my suggestion might seem a bit much, I believe it would be beneficial to use the default argument parser from training....