data-processing topic

List data-processing repositories

pulsar-flink

278
Stars
118
Forks
Watchers

Elastic data processing with Apache Pulsar and Apache Flink

rapidtables

287
Stars
10
Forks
Watchers

Super fast list of dicts to pre-formatted tables conversion library for Python 2/3

eternal

512
Stars
31
Forks
Watchers

👾~ music, eternal ~ 👾

bonobo

1.6k
Stars
143
Forks
Watchers

Extract Transform Load for Python 3.5+

data-science-on-gcp

1.3k
Stars
709
Forks
Watchers

Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017

haupt

452
Stars
213
Forks
Watchers

Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon

lithops

307
Stars
97
Forks
Watchers

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

DialoGPT

2.3k
Stars
342
Forks
Watchers

Large-scale pretraining for dialogue

texar-pytorch

744
Stars
119
Forks
Watchers

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/