Awesome-LLM-Constrained-Decoding

Towards reliable, controllable and more efficient generation with Large Language Models (LLMs)

A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.

Awesome-LLM-Constrained-Decoding
- Table of Contents
- Libraries
- Papers
- Benchmark & Datasets & Evaluation
- Survey
- Blog Posts
- Related Awesome Lists
- Disclaimer
- Contributing

Libraries

Library	Feature	Stars
guidance-ai/guidance	CFG, Regex, JSON Schema, Token Forcing, compatible with Transformers, LLAMA-CPP
outlines-dev/outlines	CFG, Unicode support, Hugging Face ecosystem, VLLM support
eth-sri/lmql	Regex support, various constraints, more powerful control flow
jxnl/instructor	Try-Reject-Repeat approach to ensure constraints are met
microsoft/aici	A general framework of LLM controller with native support for CFG, Regex, JSON Schema
noamgat/lm-format-enforcer	Regex, JSON Schema, Beam Search etc.
mlc-ai/xgrammar	CFG, careful system optimizations
epfl-dlab/transformers-CFG	CFG (EBNF Interface), Compatible with Transformers, Easy to extend for research
uiuc-focal-lab/syncode	CFG generation that supports builtin grammars like JSON, Python, Go, and more
Dan-wanna-M/formatron	Regex, JSON Schema, CFG, etc
genlm/genlm-control	Arbitrary programmable syntactic and semantic constraints, Constrained decoding as posterior inference, Sequential Monte Carlo
structuredllm/itergen	CFG generation and backtracking to handle semantic constraints
eth-sri/type-constrained-code-generation	TypeScript, including type-safety.
snowkylin/circuit-transformer	Generating logical circuits that are strictly equivalent to given Boolean functions.
epfl-dlab/jsonschemabench	A benchmarking framework for evaluating constrained decoding engines on JSON Schema. Supports Guidance, Outlines, XGrammar, OpenAI and more.
eth-sri/constrained-diffusion	Diffusion LLMs, Multi-Region Infilling (CFG)
grammarllm	LLprefix, DPDA

Disclaimer:

The libraries listed above are not exhaustive and are subject to change.
The features mentioned are 100% not exhaustive and I strongly recommend checking the respective repositories for more details.
The libraries are listed by the Github stars
If you are the author of a library and would like to add or update the information, please open an issue or submit a pull request.

Papers

Papers with are newly added papers (not necessarily newly published papers).

Constrained Decoding for Autoregressive Models

Date	Paper	Publication
2025-07	Grammar-Constrained Decoding Makes Large Language Models Better Logical Parsers	ACL
2025-07	GRAMMAR-LLM: Grammar-Constrained Natural Language Generation	ACL Findings
2025-07	Pre3: Enabling Deterministic Pushdown Automata for Faster Structured LLM Generation	ACL
2025-07	Syntactic Control of Language Models by Posterior Inference	ACL Findings
2025-06	Earley-Driven Dynamic Pruning for Efficient Structured Decoding	ICML
2025-05	Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling	Preprint
2025-05	Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo	ICLR
2025-04	Efficient and Asymptotically Unbiased Constrained Decoding for Large Language Models	AISTATS
2025-02	CRANE: Reasoning with constrained LLM generation	ICML
2025-05	Type-Constrained Code Generation with Language Models	PLDI
2025-02	Lost in Space: Optimizing Tokens for Grammar-Constrained Decoding	Preprint
2025-02	Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence	Preprint
2025-02	Flexible and Efficient Grammar-Constrained Decoding	Preprint
2025-01	Circuit Transformer: A Transformer That Preserves Logical Equivalence	ICLR
2025-01	Generating Structured Outputs from Language Models: Benchmark and Studies	Preprint
2024-11	XGRAMMAR: FLEXIBLE AND EFFICIENT STRUCTURED GENERATION ENGINE FOR LARGE LANGUAGE MODELS	Preprint
2024-10	IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking	ICLR
2024-08	Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models	Preprint
2024-08	FANTAstic SEquences and Where to Find Them: Faithful and Efficient API Call Generation through State-tracked Constrained Decoding and Reranking	Preprint
2024-07	Automata-based constraints for language model decoding	CoLM
2024-06	Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access	ACL
2024-05	Grammar-Aligned Decoding	Preprint
2024-03	SynCode: LLM Generation with Grammar Augmentation	Preprint
2024-03	Guiding LLMs The Right Way: Fast, Non-Invasive Constrained Generation	ICML
2024-02	Constrained Decoding for Cross-lingual Label Projection	ICLR
2024-02	Constrained Decoding for Code Language Models via Efficient Left and Right Quotienting of Context-Sensitive Grammars	Preprint
2024-02	Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents	Preprint
2023-12	SGLang: Efficient Execution of Structured Language Model Programs	Preprint
2023-12	Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context	NeurIPS
2023-11	Prompt Sketching for Large Language Models	Preprint
2023-11	Sequential Monte Carlo Steering of Large Language Models using Probabilistic Programs	PADL
2023-10	Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding	Preprint
2023-10	Amortizing intractable inference in large language models	ICLR
2023-10	Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting	EMNLP
2023-10	KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection	EMNLP
2023-10	Lazy-k Decoding: Constrained Decoding for Information Extraction	EMNLP
2023-07	Efficient Guided Generation for Large Language Models	Preprint
2023-06	Grammar Prompting for Domain-Specific Language Generation with Large Language Models	NeurIPS
2023-06	Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning	EMNLP
2023-06	Prompting Is Programming: A Query Language for Large Language Models	PLDI
2023-05	Measuring and Mitigating Constraint Violations of In-Context Learning for Utterance-to-API Semantic Parsing	EMNLP Findings
2023-04	Tractable Control for Autoregressive Language Generation	ICML
2022-11	Validating Large Language Models with ReLM	MLSys
2022-11	CodePAD: Sequence-based Code Generation with Pushdown Automaton	ISSTA
2022-05	Controllable Text Generation with Neurally-Decomposed Oracle	NeurIPS
2022-05	Gradient-Based Constrained Sampling from Language Models	EMNLP
2022-02	COLD Decoding: Energy-based Constrained Text Generation with Langevin Dynamics	NeurIPS
2022-01	Synchromesh: Reliable code generation from pre-trained language models	ICLR
2021-12	PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models	EMNLP
2021-12	Constrained Language Models Yield Few-Shot Semantic Parsers	EMNLP
2021-12	Controlled Text Generation as Continuous Optimization with Multiple Constraints	NeurIPS
2021-06	NEUROLOGIC DECODING:(Un)supervised Neural Text Generation with Predicate Logic Constraints	NAACL
2019-05	A General-Purpose Algorithm for Constrained Sequential Inference	CoNLL
2019-05	Improved Lexically Constrained Decoding for Translation and Monolingual Rewriting	NAACL
2018-09	CGMH: Constrained Sentence Generation by Metropolis-Hastings Sampling	AAAI
2018-05	Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation	NAACL
2018-04	Incorporating Discriminator in Sentence Generation: a Gibbs Sampling Method	AAAI
2017-12	Guided Open Vocabulary Image Captioning with Constrained Beam Search	EMNLP
2017-06	Lexically Constrained Decoding for Sequence Generation Using Grid Beam Search	ACL

Constrained Decoding for Diffusion Models

Date	Paper	Publication
2025-08	Constrained Decoding of Diffusion LLMs with Context-Free Grammars	Preprint
2025-05	DINGO: Constrained Inference for Diffusion LLMs	Preprint
2025-03	Constrained Language Generation with Discrete Diffusion Models	Preprint

Benchmark & Datasets & Evaluation

Date	Paper	Publication
2025-08	JSON-mode Eval Cleaned/Extended, SMILES Eval and HumanEval MRI C++	HF Hub
2025-01	JsonSchemaBench: Generating Structured Outputs from Language Models: Benchmark and Studies	Preprint
2024-05	COLLIE: Systematic Construction of Constrained Text Generation Tasks	ICLR
2024-02	JSON-mode Eval dataset	HF hub
2023-12	BenchCLAMP: A Benchmark for Evaluating Language Models on Syntactic and Semantic Parsing	NeurIPS Track on Datasets and Benchmarks
2023-10	Evaluating Large Language Models on Controlled Generation Tasks	Preprint
2023-09	Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?	Preprint
2021-10	NLV corpus	CHI
2020-12	CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning	EMNLP Findings
2018-09	Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-SQL task	EMNLP

Survey

Date	Paper	Publication
2024-04	"We Need Structured Output": Towards User-centered Constraints on Large Language Model Output	Preprint

Blog Posts

Many of the blogs are written by Outlines team, many thanks to them for their great work! ❤️

Related Awesome Lists

awesome-llm-json

Disclaimer

This list is not exhaustive and will be updated regularly. If you have any suggestions or want to add a paper, please feel free to open an issue or submit a pull request. We hope to include all relevant papers in this list.

Contributing

Contributions are welcome! Feel free to submit a pull request or open an issue. Please make sure to read the Contributing Guidelines before contributing.

Awesome-LLM-Constrained-Decoding
Awesome-LLM-Constrained-Decoding copied to clipboard

Metadata

Awesome-LLM-Constrained-Decoding

Table of Contents

Libraries

Papers

Benchmark & Datasets & Evaluation

Survey

Blog Posts

Related Awesome Lists

Disclaimer

Contributing

← Metadata

Owner

Metadata

Awesome-LLM-Constrained-Decoding Awesome-LLM-Constrained-Decoding copied to clipboard

Metadata

Awesome-LLM-Constrained-Decoding

Table of Contents

Libraries

Papers

Benchmark & Datasets & Evaluation

Survey

Blog Posts

Related Awesome Lists

Disclaimer

Contributing

← Metadata

Owner

Metadata

Awesome-LLM-Constrained-Decoding
Awesome-LLM-Constrained-Decoding copied to clipboard