text-processing topic

List text-processing repositories

Auto-CORPus

15
Stars
6
Forks
Watchers

Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London collaboration to standardize text and table data extracted from full text publications. See Open Access publica...

split-markdown4gpt

20
Stars
2
Forks
Watchers

A Python tool for splitting large Markdown files into smaller sections based on a specified token limit. This is particularly useful for processing large Markdown files with GPT models, as it allows t...

humanreadable

15
Stars
1
Forks
Watchers

humanreadable is a Python library to convert human-readable values to other units.

purl

206
Stars
5
Forks
Watchers

Streamlining Text Processing

syntakts

17
Stars
0
Forks
Watchers

Simple to use text parser and syntax highlighter for Kotlin Multiplatform

Kudasai

25
Stars
4
Forks
25
Watchers

Streamlining Japanese-English Translation with Advanced Preprocessing and Integrated Translation Technologies

flashtext2

17
Stars
3
Forks
Watchers

The fastest FlashText library for Python

Valmiki_Ramayan_Dataset

17
Stars
1
Forks
17
Watchers

Structured dataset of Valmiki Ramayana 📜 | Sanskrit Shlokas, Translations, & Explanations for AI & NLP🚀 Contributions welcome!