Skip to content

extract text from a pdf as an array of text blocks

Notifications You must be signed in to change notification settings

dsandor/node-pdf-text

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pdf-text2

Refactor of pdf-text.

Extracts text from a PDF and returns an array of pages that each contains an array of text lines extracted from that page.

install

$ yarn add pdf-text2

use

const PDFText = require('pdf-text2');

var pathToPdf = __dirname + "/info.pdf"

const pdfText = new PDFText(pathToPdf);

const pages = pdfText.parse();

About

extract text from a pdf as an array of text blocks

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • JavaScript 58.9%
  • TypeScript 41.1%