Skip to content

wmeints/extract-pdf-content

Repository files navigation

Extract PDF Content

This library is inspired by the original pdf-parse which is no longer maintained. I made this package to help me build a node-based application to generate a weekly podcast out of interesting content often stored in PDF documents.

If you find this useful, please star the repository on Github!

Getting started

You can install this package using the command npm install extract-pdf-content. Then use the following code to extract text from a PDF file.

import { extractText } from 'extract-pdf-content';
import fs from 'fs/promises';

const documentData = await fs.readFile('my-pdf-document.pdf');
const documentText = await extractText(documentData);

About

Node package to extract text content from PDF files

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published