🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
-
Updated
Jun 5, 2024 - TypeScript
🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
An event-driven pipeline for extracting text from an email attachment (in progress)
A simple document processing registration system project for my cloud engineering honours module demonstrating the power of Amazon Textract, an AWS service for extracting text and data from images or documents.
Amazon Textract Response Parser library for Node.
Add a description, image, and links to the aws-textract topic page so that developers can more easily learn about it.
To associate your repository with the aws-textract topic, visit your repo's landing page and select "manage topics."