convtools is a specialized Python library for dynamic, declarative data transformations with automatic code generation
westandskif
Scalable data pre processing and curation toolkit for LLMs
NVIDIA-NeMo