vmpuri issues

Results 11 issues of


                                            vmpuri

Do not fail job if subprocess called with ignore_status argument

Summary: Jobs should not be marked as failed if a subprocess has `kwargs["ignore_status"] == True` Or if "silent" is True Reviewed By: KarlSimonsen Differential Revision: D36103193

CLA Signed

fb-exported

OpenAI API JSON formatted

Implement JSON formatted responses using OpenAI API types for server completion requests. Rather than giving single tokens at a time, the server will respond with a JSON following the API...

CLA Signed

Implementation of the /models endpoint https://platform.openai.com/docs/api-reference/models Start the server: ``` python3 torchchat.py server stories15M ``` In another terminal: ``` curl http://127.0.0.1:5000/models {"data": [{"id": "stories15M", "created": 1722531822, "owner": "puri", "object": "model"}],...

CLA Signed

Merge Openai api version route to main

https://github.com/pytorch/torchchat/pull/1016 mistakenly got merged into this development branch instead of main.

CLA Signed

[v0.3.1] Release Tracker

The v0.3.1 release will be cut from the "[release/0.3](https://github.com/pytorch/executorch/tree/release/0.3)" branch for critical fixes to the [v0.3.0](https://github.com/pytorch/executorch/releases/tag/v0.3.0) release. Branch Cut Date: 9/6/2024 Intended Release Date: 9/16/2024 This issue is for tracking...

triaged

release tracker

Fix eval for .pte

**Issue** Inputs aren't set up correctly for .pte files. The input tensors must be static and cannot be reshaped. Currently, running eval will result in this error: ``` python3 torchchat.py...

CLA Signed

Add ability to select models and edit system/assistant prompts

**Goal: ** Users should be able to select the model from the chat interface and receive a response from that model. **Currently:** we just send the request and take the...

CLA Signed

Dataclass Type Enforcement

### 🚀 The feature, motivation and pitch Select a method for enforcing types in Python dataclasses (i.e. pydantic) ### Alternatives Don't enforce types - may cause type errors when decoding...

enhancement

actionable

Slow eval performance for .pte models

### 🐛 Describe the bug Eval is very slow for PTE models vs. non-exported models - the opposite should be true and can be observed in generate. I suspect this...

performance

actionable

ExecuTorch

Download Hugging Face models into Hugging Face cache

Currently, we download models to a local (~/.torchchat by default). For Hugging Face models, we should download to the Hugging Face cache instead. As per Hugging Face: ``` By default,...

enhancement

CLA Signed