data-processing topic
pulsar-flink
Elastic data processing with Apache Pulsar and Apache Flink
rapidtables
Super fast list of dicts to pre-formatted tables conversion library for Python 2/3
eternal
👾~ music, eternal ~ 👾
bonobo
Extract Transform Load for Python 3.5+
data-science-on-gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
haupt
Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
lithops
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
DialoGPT
Large-scale pretraining for dialogue
texar-pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
VASPy
Manipulating VASP files with Python.