data-processing topic

List data-processing repositories

collapse

607
Stars
29
Forks
Watchers

Advanced and Fast Data Transformation in R

GODEL

839
Stars
110
Forks
Watchers

Large-scale pretrained models for goal-directed dialog

data-processing-agreements

132
Stars
24
Forks
Watchers

Collection of Data Processing Agreement (DPA) and GDPR compliance resources

DataflowJavaSDK

855
Stars
326
Forks
Watchers

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.

pxi

267
Stars
3
Forks
Watchers

🧚 pxi (pixie) is a small, fast, and magical command-line data processor similar to jq, mlr, and awk.

etl

340
Stars
21
Forks
Watchers

PHP - ETL (Extract Transform Load) data processing library

pysparkling

260
Stars
44
Forks
Watchers

A pure Python implementation of Apache Spark's RDD and DStream interfaces.

mech

199
Stars
10
Forks
Watchers

🦾 Main repository for the Mech programming language. Start here!