Philipp Singer
Philipp Singer
I am currently working with longer documents compared to the shorter ones the current version is tailored towards. Currently, the code pads all sequences to the length of the longest...
#### Your system information * Steam client version (build number or date): steam-latest 24th Nov * Distribution (e.g. Ubuntu): Ubuntu 18.04 * Opted into Steam client beta?: Yes * Have...
Took me some time to find the culprit, but after upgrading to sqlitedict `2.0.0` the writing is significantly slower. I am writing with: ``` with SqliteDict("tmp.db") as tmp: tmp["tmp"] =...
Currently, we only push the model weights to Huggingface. We could improve this process by adding some of these additional artifacts: - [x] Tokenizer - [ ] LLM Studio CFG...
Would be great to have some FAQs and templates/notebooks for common questions. - [ ] How to generate outputs outside of LLM Studio with trained weights pushed to HF -...
### 🚀 Feature Support tree-like conversation data - i.e. chain of thoughts such as the OASST data provdes. ### Motivation Currently, we only support prompt/output data structures. While one can...
### 🔧 Proposed code refactoring Add the separator tokens as special tokens. Potentially then also add a separate setting to use the separator tokens as stop tokens. We should at...
There are several potential things to improve experience of the chat window: - Block the chat window if other training runs are active - Make the actual model loading procedure...
While we describe steps to get and load OASST demo data, one useful improvement could be to directly load the data into the GUI by default.
### 🔧 Proposed code refactoring We can check for available space for: - Starting an experiment - Pushing model to HF ### Motivation Out of space will make an experiment...