Yonatan Ilan

Results 1 comments of Yonatan Ilan

Anyone has any new information on using FlexGen with LlaMa-based models? Maybe it can work with not too many changes to the code?