Alex Cheema comments

Results 388 comments of


                                            Alex Cheema

[Bounty] PyTorch & HuggingFace Interface

The main thing I want to address and test is device support. We can make this the default inference engine if it works reliably across many devices. On that point,...

[Bounty] PyTorch & HuggingFace Interface

Hey @risingsunomi I'm thinking of making this the default inference engine on linux machines. Could you resolve conflicts please?

[Bounty] PyTorch & HuggingFace Interface

torch not added as a dependency

[Bounty] PyTorch & HuggingFace Interface

``` error loading and splitting model: Using `low_cpu_mem_usage=True` or a `device_map` requires Accelerate: `pip install accelerate` Error processing prompt: Using `low_cpu_mem_usage=True` or a `device_map` requires Accelerate: `pip install accelerate` Traceback...

[Bounty] PyTorch & HuggingFace Interface

seems to use some other downloader (perhaps transformers?) it should use the exo downloader for integration with exo (also these other downloads aren't necessarily async friendly but the exo one...

[Bounty] PyTorch & HuggingFace Interface

> > seems to use some other downloader (perhaps transformers?) it should use the exo downloader for integration with exo (also these other downloads aren't necessarily async friendly but the...

[Bounty] PyTorch & HuggingFace Interface

It generates! Looks like some tokenizer issue. It never stops generating.

[Bounty] PyTorch & HuggingFace Interface

> > > > > > seems to use some other downloader (perhaps transformers?) it should use the exo downloader for integration with exo (also these other downloads aren't necessarily...

[Bounty] PyTorch & HuggingFace Interface

Another issue (can be fixed last as this is a tricky one). We need to ensure that the torch operations are not blocking operations. This means the blocking parts need...

[Bounty] PyTorch & HuggingFace Interface

> > > > It generates! Looks like some tokenizer issue. It never stops generating. > > Which model is this tested with? Will test more `llama-3.1-8b` This command: `exo...