The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
-
Updated
May 28, 2024 - Python
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Turns Data and AI algorithms into production-ready web applications in no time.
An orchestration platform for the development, production, and observation of data assets.
Business Automations is a collection of automations built to enhance productivity, increase revenue, and reduce manual data manipulation at a retail store location that integrates a NCR Counterpoint SQL database with the BigCommerce e-commerce platform.
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Building data processing pipelines for documents processing with NLP using Apache NiFi and related services
Fair Entity Matching: A Fairness Suite for Auditing Entity Matching Approaches
Framework for developing extractors in Python
SpDM is a data integration tool designed to organize scientific data from different sources under the same namespace according to a global schema and to provide access to them in a unified form (views). Its main purpose is to provide a unified data access interface for complex scientific computations in order to enable the interaction and integrati
SpDB is a data integration tool designed to organize scientific data from different sources under the same namespace according to a global schema and to provide access to them in a unified form (views). Its main purpose is to provide a unified data access interface for complex scientific computations in order to enable the interaction and integr…
Powerful RDF Knowledge Graph Generation with RML Mappings
Translator of spreadsheet mappings into R2RML, RML or YARRRML
A bioinformatics API to interface with public multi-omics bio databases for wicked fast data integration.
The W4H Integrated Toolkit Repository provides a unified platform for managing, analyzing, and visualizing wearable health data using a suite of open-source tools and frameworks.
An Efficient RML-Compliant Engine for Knowledge Graph Construction
Link Modeling Language (LinkML) model
A data integration tool for BBMRI-ERIC biobanks.
Add a description, image, and links to the data-integration topic page so that developers can more easily learn about it.
To associate your repository with the data-integration topic, visit your repo's landing page and select "manage topics."