Parth Sareen

Results 18 issues of Parth Sareen

I had written this at a time where the Ollama api didn't exist. There is a lot of bloat around langchain and I'd like to get to something a bit...

good first issue

Fixes: https://github.com/ollama/ollama/issues/7993

From offline discussions: - Current API parameters improperly mix system and request level configurations - For now, let users set a context length through `OLLAMA_CONTEXT_LENGTH` - Users can still set...

This package introduces a first pass at the sampler for the new engine. ## Usage ```go sampler, err := sample.NewSampler( req.Temperature, req.TopK, req.TopP, req.MinP, req.Seed, ) token, err := seq.sampler.Sample(logits[seq.iBatch*vocabSize...

- [ ] update Sampler interface to unmarshal json into sampling options - [ ] Error handling on unmarshal - [ ] update transform interface to be private

feature request

- [ ] rm uv sampler - [ ] rm rand - [ ] rm f64 transforms -> f32s - [ ] default seed should be -1

feature request

follow up to https://github.com/ollama/ollama/pull/8938

feature request

WIP tps - in isolation + temperature greedy: 125 - no temp apply top k: 78 -> 84 top p: 44 -> the equivalent 50% drop is noticed in the...

removed transforms from greedy sample to allow testing with the sampler package