m0nsky comments

Results 14 comments of


                                            m0nsky

Unexpected OOM When Using use_gradient_checkpointing = "unsloth"

Running into the same issue. Training w/ Unsloth (LLaMA-Factory) through WSL succesfully spills over to system RAM with CUDA `Sysmem Fallback Policy` enabled, allowing me to train a 16k context...

Unexpected OOM When Using use_gradient_checkpointing = "unsloth"

> @m0nsky OOO interesting so `non_blocking = False` works?? Hmm maybe I should make a new method called `"unsloth-wsl"` for WSL people, to use blocking calls. You will get some...

feat: support dynamic native library loading in .NET standard 2.0.

Using this PR, I was able to switch between AVX2 and CUDA in an Unity game. 👍

feat: support dynamic native library loading in .NET standard 2.0.

My tests were done on Windows 10. Unfortunately I don't have any other platforms to test on.

feat: support dynamic native library loading in .NET standard 2.0.

@AsakusaRinne I think there is an even better use case for this now, since Vulkan support was introduced to LLamaSharp in 0.14 (July 16). Is there any chance you could...

Add LLamaSharp.Backend.Vulkan #3

Maybe it would be better to skip the Vulkan device check for now (but leave the code in place), and leave it to the user to enable Vulkan on the...

Error Loading LLAMA 3.1 llama_model_load: error loading model: done_getting_tensors: wrong number of tensors; expected 292, got 291[BUG]:

@emulk @CastIehard this should now be fixed in the LLamaSharp 0.15 release from 3 days ago.

m0nsky

Unexpected OOM When Using use_gradient_checkpointing = "unsloth"

Unexpected OOM When Using use_gradient_checkpointing = "unsloth"

feat: support dynamic native library loading in .NET standard 2.0.

feat: support dynamic native library loading in .NET standard 2.0.

feat: support dynamic native library loading in .NET standard 2.0.

Add LLamaSharp.Backend.Vulkan #3

Error Loading LLAMA 3.1 llama_model_load: error loading model: done_getting_tensors: wrong number of tensors; expected 292, got 291[BUG]:

Fix issue #382 (multiple publish output files with same relative path)

Fix issue #382 (multiple publish output files with same relative path)

Fix issue #382 (multiple publish output files with same relative path)