text-processing topic

List text-processing repositories

andaluh-py

20
Stars
3
Forks
Watchers

Transliterate español (spanish) spelling to andaluz proposals using python

Emotion-recognition-from-tweets

17
Stars
8
Forks
Watchers

A comprehensive approach on recognizing emotion (sentiment) from a certain tweet. Supervised machine learning.

Twitter_Data_Analysis

22
Stars
20
Forks
Watchers

Complete Guide to text processing and sentiment analysis on Twitter data.

perlpp

15
Stars
7
Forks
Watchers

Perl preprocessor - embed Perl source in any file

yoruba-adr

24
Stars
11
Forks
Watchers

Automatic Diacritic Restoration of Yorùbá language Text

syn

51
Stars
4
Forks
Watchers

syn - the thesaurus

s3-concat

38
Stars
5
Forks
Watchers

Concatenate Amazon S3 files remotely using flexible patterns

s3-utils

53
Stars
11
Forks
Watchers

Utilities and tools based around Amazon S3 to provide convenience APIs in a CLI

Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas

pyline

37
Stars
4
Forks
Watchers

Pyline is a grep-like, sed-like, awk-like command-line tool for line-based text processing in Python. https://pypi.python.org/pypi/pyline