SimonBenhamou comments

Repositories
Issues
Comments

Results 4 comments of


                                            SimonBenhamou

How do I continue training with PEFT?

With either @shaunheilee and @sayhellotoAI2 proposed solutions, I get NaN loss when resuming a training.... Am I the only one ?

Output tokens logits

@minhthuc2502 I want the distribution over the whole vocabulary (or at least top K). With your solution I only get the log prob of the token generated by the model......

Output tokens logits

Hello, Is there an update on this? Thanks, Simon

target_prefix latency

I did, and could reproduce the fact that - no matter how long the prefix, the generation time is the same - when using the generate_token method and measuring the...