noise-trader / aws-pdf-textract-pipeline Goto Github PK
View Code? Open in Web Editor NEWThis project forked from aeksco/aws-pdf-textract-pipeline
:mag: Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
License: MIT License