Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
Updated
Jun 8, 2024 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
An orchestration platform for the development, production, and observation of data assets.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Upserts, Deletes And Incremental Processing on Big Data.
Turns Data and AI algorithms into production-ready web applications in no time.
Business Automations is a collection of automations built to enhance productivity, increase revenue, and reduce manual data manipulation at a retail store location that integrates a NCR Counterpoint SQL database with the BigCommerce e-commerce platform.
The open source high performance ELT framework powered by Apache Arrow
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Software suite for marker gene identification and cell type integration from single cell RNA-sequencing data
Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift) in real-time.
Lean and mean distributed stream processing system written in rust and web assembly.
Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources
Hop Orchestration Platform
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
Flink CDC is a streaming data integration tool
Change Data Capture (CDC) tool from any source(s) to any target
Privacy and Security focused Segment-alternative, in Golang and React
Add a description, image, and links to the data-integration topic page so that developers can more easily learn about it.
To associate your repository with the data-integration topic, visit your repo's landing page and select "manage topics."