text-processing topic
andaluh-py
Transliterate español (spanish) spelling to andaluz proposals using python
Emotion-recognition-from-tweets
A comprehensive approach on recognizing emotion (sentiment) from a certain tweet. Supervised machine learning.
Twitter_Data_Analysis
Complete Guide to text processing and sentiment analysis on Twitter data.
perlpp
Perl preprocessor - embed Perl source in any file
yoruba-adr
Automatic Diacritic Restoration of Yorùbá language Text
s3-concat
Concatenate Amazon S3 files remotely using flexible patterns
s3-utils
Utilities and tools based around Amazon S3 to provide convenience APIs in a CLI
HuggingFace-Datasets-Text-Quality-Analysis
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas
pyline
Pyline is a grep-like, sed-like, awk-like command-line tool for line-based text processing in Python. https://pypi.python.org/pypi/pyline