text-processing topic

List text-processing repositories

stringx

26
Stars
0
Forks
Watchers

Drop-in replacements for base R string functions powered by stringi

bloatectomy

33
Stars
9
Forks
Watchers

A python package for removing duplicate text in clinical notes or other documents

chr

18
Stars
2
Forks
Watchers

🔤 Lightweight R package for manipulating [string] characters

NLP-tools

45
Stars
9
Forks
Watchers

Useful python NLP tools (evaluation, GUI interface, tokenization)

atarashi

25
Stars
22
Forks
Watchers

Atarashi scans for license statements in open source software, focusing on text statistics. Designed to work stand-alone and with FOSSology.

Text2Summary-Android

23
Stars
5
Forks
Watchers

A library for Text Summarization on Android applications.

nlpo3

30
Stars
6
Forks
Watchers

Thai Natural Language Processing library in Rust, with Python and Node bindings.

Russian_subtitles_dataset

22
Stars
1
Forks
Watchers

Preprocessing of the dataset of 347 subtitles for the TV series (thanks to Taiga Corpus) to build a word2vec model, JamSpell model, neural network training, chat bot training or in any other NLP task.

sciteco

47
Stars
6
Forks
Watchers

Advanced TECO dialect and interactive screen editor based on Scintilla

sova-tts-tps

51
Stars
9
Forks
Watchers

NLP-preprocessor for the SOVA-TTS project