text-processing topic

List text-processing repositories

PyMuPDF

4.3k
Stars
426
Forks
Watchers

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

tokenizers

90
Stars
14
Forks
Watchers

Elixir bindings for 🤗 Tokenizers

andaluh-js

27
Stars
5
Forks
Watchers

Transliterate español (spanish) spelling to andaluz proposals using javascript

corpusexplorer2.0

20
Stars
3
Forks
Watchers

Korpuslinguistik war noch nie so einfach...

FileSharper

21
Stars
4
Forks
Watchers

An extensible, GUI-based file search and processing tool for Windows written in C# and WPF, with out-of-the-box functionality similar to grepWin and dnGrep

matcheroni

192
Stars
4
Forks
Watchers

A minimalist single-header library for building pattern-matchers, lexers, and parsers.

nucleo

734
Stars
25
Forks
Watchers

A fast and convenient fuzzy matcher library for rust

md-to-html

26
Stars
2
Forks
Watchers

Sed script that converts Markdown to HTML code.

r4strings

38
Stars
21
Forks
Watchers

Handling Strings in R