data-engineering topic
Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
mlrun
MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates t...
blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
spark-alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
datadocs
Documentation for data enthusiasts
soorgeon
Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊
dataplane
Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a...
beneath
Beneath is a serverless real-time data platform ⚡️
Movalytics-Data-Warehouse
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow