Skip to content

A thin wrapper on top of pdf-js to extract pdf text content

License

Notifications You must be signed in to change notification settings

lino-levan/parse-pdf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pdf-parse

A modern implementation of https://gitlab.com/autokent/pdf-parse. A thin wrapper on top of pdf-js, just like the original.

import parsePdf from "jsr:@lino/pdf-parse";

const { text, info } = await parsePdf(
  "https://github.com/mozilla/pdf.js/files/1340729/Hyphenator.pdf",
);
console.log(text, info);

About

A thin wrapper on top of pdf-js to extract pdf text content

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published