🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
-
Updated
May 8, 2024 - TypeScript
🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
Performance Observability for Apache Spark
Jayvee is a domain-specific language and runtime for automated processing of data pipelines
⚡️ Next-generation data transformation framework for TypeScript that puts developer experience first
⛅ Versatile Data Pipeline (VDP) console website
Sync your team's data to your LLM applications in real-time
Watchmen Platform is a low code data platform for data pipeline, meta data management , analysis, indicator objective analysis and quality management
An extensible pipelining tool to build data pipelines from your bank account to any destination.
Create Database agnostic aggregations base on data pipelines
Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."