text-processing topic
PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
tokenizers
Elixir bindings for 🤗 Tokenizers
andaluh-js
Transliterate español (spanish) spelling to andaluz proposals using javascript
corpusexplorer2.0
Korpuslinguistik war noch nie so einfach...
FileSharper
An extensible, GUI-based file search and processing tool for Windows written in C# and WPF, with out-of-the-box functionality similar to grepWin and dnGrep
matcheroni
A minimalist single-header library for building pattern-matchers, lexers, and parsers.
nucleo
A fast and convenient fuzzy matcher library for rust
md-to-html
Sed script that converts Markdown to HTML code.