text-processing topic

List text-processing repositories

Musoq

499
Stars
21
Forks
499
Watchers

SQL Runtime without any database

linux-practice-challenges

32
Stars
6
Forks
32
Watchers

In this course, you will find a collection of Linux practice challenges that will help you to improve your Linux skills. These challenges are designed to help you learn and practice Linux commands, sh...

Amharic-Tokenizer

96
Stars
14
Forks
96
Watchers

Syllable-aware BPE tokenizer for the Amharic language (አማርኛ) – fast, accurate, trainable.

context-compressor

72
Stars
12
Forks
72
Watchers

AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning with advanced compression strategies.