Web Crawler/Spider for NodeJS + server-side jQuery ;-)
-
Updated
Jun 7, 2024 - JavaScript
Web Crawler/Spider for NodeJS + server-side jQuery ;-)
A project to select only part of a PDF file. It's usefull when you want to extract informations with some python library like fitz.
A Mardown parser for extracting hierarchical content.
Designed for processing and cleaning HTML content from a JSON Lines input file, extracting meaningful text, and writing it to a text output file.
Example project demonstrating how to use PDFix SDK WebAssembly build in Node.js. Make PDF Files Accessible, Extract Data from PDF, Convert PDF to HTML, Fill-in PDF Form, Stamp PDF and more...
Example project demonstrating how to use PDFix SDK WebAssembly build in Node.js. Make PDF Files Accessible, Extract Data from PDF, Convert PDF to HTML, Fill-in PDF Form, Stamp PDF and more...
Automatically extracts packages root name for monorepos
Chrome extension to extract a select portion / section of a webpage into a PDF file
Allows extracting data from DOM
A simple tool to parse and extract data from a resume.
Extract certain data from github repositories using the v4 API offered by github itself.
Extracts sentences from txt files.
Receipt scanner extracts information from your PDF or image receipts - built in NodeJS
Atomic Web Service (AWS, REST API) for converting PDF files to plain/text, powered by pdftotext and Node.js
Get values from complicated data structures, nested arrays and objects, using request string like 'foo.[].bar.[].baz'.
Atomic Web Service (AWS, REST API) for converting DOC/DOCX files to plain/text, powered by catdoc, docx2txt and Node.js
Extract html snippets getting the minimal css rules from source or computing the css values
Add a description, image, and links to the extract-data topic page so that developers can more easily learn about it.
To associate your repository with the extract-data topic, visit your repo's landing page and select "manage topics."