text-processing topic

List text-processing repositories

DaisyDiff

78
Stars
63
Forks
Watchers

Visual :white_flower: comparison of HTML in :coffee: Java

cso-classifier

84
Stars
18
Forks
Watchers

Python library that classifies content from scientific papers with the topics of the Computer Science Ontology (CSO).

Hr

53
Stars
2
Forks
Watchers

Easy Access to Uppercase H

sliceslice-rs

87
Stars
16
Forks
Watchers

A fast implementation of single-pattern substring search using SIMD acceleration.

text-dedup

565
Stars
69
Forks
Watchers

All-in-one text de-duplication

odin-ai

21
Stars
11
Forks
Watchers

Orgainzed Digital Intelligent Network (O.D.I.N)

meta_XLM

20
Stars
4
Forks
Watchers

Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks

konfuzio-sdk

58
Stars
10
Forks
Watchers

OCR, extract and classify documents. In addition, annotate documents and build your own NLP and Computer Vision models using Python by downloading the data. Find examples in our Colab Notebooks, e. g....