Evan Jones

Results 3 issues of Evan Jones

Hi! I decided to take a stab at leveraging the new get / set state APIs to cache initial prompt evaluation in `main`. On my M2 at least, this feature...

enhancement
high priority

Inspired by #1397 and [grantslatton's CFG work](https://github.com/grantslatton/llama.cpp/commit/007e26a99d485007f724957fa8545331ab8d50c3), this adds an API that takes a serialized context-free grammar to guide and constrain sampling. Also adds a sample Backus-Naur form (BNF)-like syntax...

enhancement
high priority
generation quality

**Depends on #1773** (to test this, merge that first) Adds a Python script that converts a JSON schema into the grammar format from #1773. This allows generating JSON according to...