Philipp Singer

Results 46 issues of Philipp Singer

I am currently working with longer documents compared to the shorter ones the current version is tailored towards. Currently, the code pads all sequences to the length of the longest...

#### Your system information * Steam client version (build number or date): steam-latest 24th Nov * Distribution (e.g. Ubuntu): Ubuntu 18.04 * Opted into Steam client beta?: Yes * Have...

Steam client
Proton

Took me some time to find the culprit, but after upgrading to sqlitedict `2.0.0` the writing is significantly slower. I am writing with: ``` with SqliteDict("tmp.db") as tmp: tmp["tmp"] =...

bug

Currently, we only push the model weights to Huggingface. We could improve this process by adding some of these additional artifacts: - [x] Tokenizer - [ ] LLM Studio CFG...

area/core

Would be great to have some FAQs and templates/notebooks for common questions. - [ ] How to generate outputs outside of LLM Studio with trained weights pushed to HF -...

### 🚀 Feature Support tree-like conversation data - i.e. chain of thoughts such as the OASST data provdes. ### Motivation Currently, we only support prompt/output data structures. While one can...

type/feature

### 🔧 Proposed code refactoring Add the separator tokens as special tokens. Potentially then also add a separate setting to use the separator tokens as stop tokens. We should at...

area/core

There are several potential things to improve experience of the chat window: - Block the chat window if other training runs are active - Make the actual model loading procedure...

area/core

While we describe steps to get and load OASST demo data, one useful improvement could be to directly load the data into the GUI by default.

type/feature

### 🔧 Proposed code refactoring We can check for available space for: - Starting an experiment - Pushing model to HF ### Motivation Out of space will make an experiment...

area/core