Allen Benz
Allen Benz
https://kaiokendev.github.io/til#extending-context-to-8k Someone had the clever idea of scaling the positional embeddings inversely proportional to the extended context length. Adding ``` self.scale = 1 / 2 t *= self.scale ``` after...
Label is the same but I got one with a newer product Id. Also added the parameters/metadata from the other zw1002 entry as the configuration parameters are the same. (as...
Fixes #347 and a similar issue where closing vscode leaves the dotnet watch processes around (also only on windows)
dotnet sdk: 6.0.100 nunit: 3.13.2 OS: windows 10 example repo: https://github.com/allenbenz/dotnet-test-explorer-linger-issue The dotnet watch process isn't stopped when executing `dotnet-test-explorer.stop`. The cmd shell that started the dotnet watch process is...
### What happened? Tried resuming from backup of an sdxl lora training session, got an exception. I have some **random changes made to the codebase** that may be a precondition...
Summary Some of us are stuck using older versions of maven and the projects builds with 3.3.1. Lower than 3.2.5 and maven-compiler-plugin complains. Lower than 3.3.1 and the integrationTests fail...