Philipp Singer issues

Results 46 issues of


                                            Philipp Singer

Best practise for long documents

I am currently working with longer documents compared to the shorter ones the current version is tailored towards. Currently, the code pads all sequences to the length of the longest...

Missing downloaded files with Proton Experimental

#### Your system information * Steam client version (build number or date): steam-latest 24th Nov * Distribution (e.g. Ubuntu): Ubuntu 18.04 * Opted into Steam client beta?: Yes * Have...

Steam client

Proton

Version 2.0.0 significantly slower

Took me some time to find the culprit, but after upgrading to sqlitedict `2.0.0` the writing is significantly slower. I am writing with: ``` with SqliteDict("tmp.db") as tmp: tmp["tmp"] =...

bug

[CODE IMPROVEMENT] Push to Huggingface improvements

Currently, we only push the model weights to Huggingface. We could improve this process by adding some of these additional artifacts: - [x] Tokenizer - [ ] LLM Studio CFG...

area/core

FAQ Section

Would be great to have some FAQs and templates/notebooks for common questions. - [ ] How to generate outputs outside of LLM Studio with trained weights pushed to HF -...

[FEATURE] Support nested tree conversation data

### 🚀 Feature Support tree-like conversation data - i.e. chain of thoughts such as the OASST data provdes. ### Motivation Currently, we only support prompt/output data structures. While one can...

type/feature

[CODE IMPROVEMENT] Improve functioanlity for separator and stop tokens

### 🔧 Proposed code refactoring Add the separator tokens as special tokens. Potentially then also add a separate setting to use the separator tokens as stop tokens. We should at...

area/core

[CODE IMPROVEMENT] Chat experience

There are several potential things to improve experience of the chat window: - Block the chat window if other training runs are active - Make the actual model loading procedure...

area/core

[FEATURE] Default dataset

While we describe steps to get and load OASST demo data, one useful improvement could be to directly load the data into the GUI by default.

type/feature

[CODE IMPROVEMENT] Check for available disk space

### 🔧 Proposed code refactoring We can check for available space for: - Starting an experiment - Pushing model to HF ### Motivation Out of space will make an experiment...

area/core