pdfnova

PDFium-powered PDF library for JavaScript & TypeScript.

Chrome-grade rendering via WebAssembly. Full TypeScript types. Zero-config WASM loading.

Docs · Install · Quickstart · Rendering · Search · Annotations · Forms · Signatures

Built on @embedpdf/pdfium — the real PDFium engine (used in Chrome) compiled to WebAssembly.

Install

npm install pdfnova

Two Tiers

	pdfnova/lite	pdfnova (full)
JS Bundle (minified)	~3KB	~5KB
WASM Binary	4.4MB on disk / ~1.5MB over the wire (Brotli)
Rendering	Yes	Yes
Text extraction	Yes	Yes
Text layer (DOM)	Yes	Yes
Search	Yes	Yes
Bookmarks/TOC	Yes	Yes
Virtualization	Yes	Yes
Worker/OffscreenCanvas	Yes	Yes
Annotations (read/write)	—	Yes
Form filling/flattening	—	Yes
Digital signatures	—	Yes
doc.save()	—	Yes

The WASM binary is fetched once and cached in IndexedDB — subsequent page loads are instant with zero network cost. Both tiers share the same binary — the lite/full distinction controls which TypeScript API features are available.

// Lightweight — render, text, search, bookmarks
import { PDFDocument } from "pdfnova/lite";

// Full — everything above + annotations, forms, signatures
import { PDFDocument } from "pdfnova";

Quick Start

import { PDFDocument } from "pdfnova/lite";

// Open from URL, ArrayBuffer, File, Blob, or base64 data URI
const doc = await PDFDocument.open("/report.pdf");

// Document info
console.log(doc.pageCount); // 42
console.log(doc.metadata); // { title, author, subject, ... }
console.log(doc.outline); // OutlineItem[] (bookmarks tree)

// Render a page
const page = doc.getPage(0);
const canvas = document.createElement("canvas");
await page.render(canvas, { scale: 2 });

// Text extraction with character-level precision
const text = page.getText();
const spans = page.getTextSpans(); // TextSpan[] with x, y, width, height

// Build a selectable text layer over the canvas
page.createTextLayer(container);

// Full-text search
const results = doc.search("quarterly revenue");

// Cleanup
doc.close();

Rendering

const page = doc.getPage(0);

// Render to canvas
await page.render(canvas, {
  scale: 2, // 2x resolution
  rotation: 90, // 0, 90, 180, 270
  background: "#ffffff",
});

// Render to ImageData (no DOM required)
const imageData = await page.renderToImageData({ scale: 1.5 });

// Thumbnails
import { PageRenderer } from "pdfnova/lite";
await PageRenderer.renderThumbnail(page, thumbCanvas, 200);

// Fit-to-width scale calculation
const scale = PageRenderer.fitWidthScale(page, containerWidth);

Virtual Renderer

For large documents, only render visible pages:

import { VirtualRenderer } from "pdfnova/lite";

const renderer = new VirtualRenderer({
  container: document.getElementById("viewer")!,
  scale: 1.5,
  overscan: 2, // render 2 pages above/below viewport
  cacheSize: 10, // LRU cache for rendered pages
});

await renderer.setDocument(doc);
renderer.scrollToPage(5);
console.log(renderer.getCurrentPage());

Text Layer

pdfnova uses PDFium's character-level bounding boxes for pixel-perfect text selection:

// Span-level positioning (fast, good enough for most use cases)
const layer = page.createTextLayer(container);

// Character-level positioning (slower but pixel-perfect)
import { TextLayerBuilder } from "pdfnova/lite";
const builder = new TextLayerBuilder(wasm, bridge);
builder.buildCharLevel(textPagePtr, container, page.width, page.height, scale);

Search

// Search a single page
const pageResults = page.search("revenue", { caseSensitive: true });

// Search entire document
const allResults = doc.search("quarterly revenue", { wholeWord: true });

// Each result has:
// - pageIndex, charIndex, charCount
// - rects (TextRect[] for highlighting)
// - text (matched text)

Bookmarks / Table of Contents

const outline = doc.outline;
// OutlineItem { title, pageIndex, children: OutlineItem[] }

for (const item of outline) {
  console.log(`${item.title} → page ${item.pageIndex + 1}`);
  for (const child of item.children) {
    console.log(`  ${child.title} → page ${child.pageIndex + 1}`);
  }
}

Annotations (full tier)

import { PDFDocument, AnnotationType } from "pdfnova";

const doc = await PDFDocument.open(data);
const page = doc.getPage(0);

// Read existing annotations
const annotations = await page.getAnnotations();

// Add a highlight annotation
await page.addAnnotation({
  type: AnnotationType.Highlight,
  rect: { left: 72, top: 720, right: 300, bottom: 700 },
  color: { r: 255, g: 235, b: 59, a: 128 },
  contents: "Important section",
});

// Remove an annotation
await page.removeAnnotation(0);

// Save modified PDF
const bytes = await doc.save();

Form Filling (full tier)

import { PDFDocument } from "pdfnova";

const doc = await PDFDocument.open(formPdf);

// Read all form fields
const fields = await doc.getFormFields();
// FormFieldData { name, type, value, isChecked, pageIndex }

// Fill fields
await doc.setFormField("name", "John Doe");
await doc.setFormField("email", "john@example.com");

// Flatten forms (make non-interactive)
await doc.flattenForms();

// Save
const filled = await doc.save();

Digital Signatures (full tier)

import { PDFDocument } from "pdfnova";

const doc = await PDFDocument.open(signedPdf);

const signatures = await doc.getSignatures();
// SignatureData { index, contents, byteRange, subFilter, reason, signingTime }

// Format signature info
import { SignatureInfo } from "pdfnova";
for (const sig of signatures) {
  console.log(SignatureInfo.formatSummary(sig));
  console.log(SignatureInfo.getSignatureFormat(sig.subFilter));
}

Worker Pool

Render pages concurrently across multiple Web Workers:

import { WorkerPool } from "pdfnova/lite";

const pool = new WorkerPool(4); // 4 workers
await pool.init({ tier: "lite" });

// Render multiple pages in parallel
const images = await pool.renderPages([0, 1, 2, 3], { scale: 2 });

pool.destroy();

Authenticated URLs

const doc = await PDFDocument.open(
  "https://api.example.com/documents/123/download",
  {
    headers: { Authorization: "Bearer eyJ..." },
    credentials: "include",
  },
);

Password-Protected PDFs

const doc = await PDFDocument.open(encryptedPdf, {
  password: "secret123",
});

Custom WASM URL

By default, pdfnova loads the PDFium WASM binary from jsDelivr CDN. To self-host:

const doc = await PDFDocument.open(data, {
  wasmUrl: "https://cdn.example.com/pdfium.wasm",
});

You can get the WASM binary from node_modules/@embedpdf/pdfium/dist/pdfium.wasm and serve it from your own infrastructure.

API Reference

PDFDocument

Method/Property	Description
`PDFDocument.open(source, options?)`	Open a PDF from URL, ArrayBuffer, File, Blob, or data URI
`.pageCount`	Number of pages
`.metadata`	`{ title, author, subject, keywords, creator, producer, creationDate, modDate }`
`.outline`	Bookmark tree (`OutlineItem[]`)
`.permissions`	`{ print, copy, modify, annotate, fillForms, ... }`
`.getPage(index)`	Get a page (0-indexed)
`.search(query, options?)`	Search entire document
`.getFormFields()`	Read form fields (full tier)
`.setFormField(name, value)`	Fill a form field (full tier)
`.flattenForms()`	Flatten forms (full tier)
`.getSignatures()`	Read digital signatures (full tier)
`.save()`	Save modified PDF as Uint8Array (full tier)
`.close()`	Free all WASM memory

PDFPage

Method/Property	Description
`.pageIndex`	0-based page index
`.width` / `.height`	Page dimensions in PDF points
`.render(canvas, options?)`	Render to canvas
`.renderToImageData(options?)`	Render to ImageData
`.getText()`	Extract plain text
`.getTextSpans()`	Extract text with positions
`.getCharBoxes()`	Character-level bounding boxes
`.createTextLayer(container)`	Build selectable text layer
`.search(query, options?)`	Search this page
`.getLinks()`	Extract hyperlinks
`.getAnnotations()`	Read annotations (full tier)
`.addAnnotation(options)`	Add annotation (full tier)
`.removeAnnotation(index)`	Remove annotation (full tier)
`.close()`	Free page resources

How WASM Loading Works

pdfnova uses @embedpdf/pdfium for the pre-built PDFium WebAssembly binary. No manual compilation is needed.

On first use, WasmLoader fetches pdfium.wasm (4.4MB) from CDN — ~1.5MB with Brotli compression (standard on CDNs)
The binary is cached in IndexedDB — subsequent visits load instantly with zero network cost
The module is initialized via PDFiumExt_Init() and adapted to pdfnova's typed interface
All downstream API calls go through the real PDFium C engine via the WASM bridge

To override the WASM URL (e.g., for air-gapped environments), pass wasmUrl in PDFDocument.open() options.

To clear the cached binary: WasmLoader.clearCache()

Development

npm install          # Install dependencies
npm run build        # TypeScript check + tsup build
npm test             # Run all 71 tests
npm run test:watch   # Watch mode
npm run test:coverage # With V8 coverage report

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
docs		docs
src		src
tests		tests
wasm		wasm
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pdfnova

Install

Two Tiers

Quick Start

Rendering

Virtual Renderer

Text Layer

Search

Bookmarks / Table of Contents

Annotations (full tier)

Form Filling (full tier)

Digital Signatures (full tier)

Worker Pool

Authenticated URLs

Password-Protected PDFs

Custom WASM URL

API Reference

PDFDocument

PDFPage

How WASM Loading Works

Development

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

pdfnova

Install

Two Tiers

Quick Start

Rendering

Virtual Renderer

Text Layer

Search

Bookmarks / Table of Contents

Annotations (full tier)

Form Filling (full tier)

Digital Signatures (full tier)

Worker Pool

Authenticated URLs

Password-Protected PDFs

Custom WASM URL

API Reference

PDFDocument

PDFPage

How WASM Loading Works

Development

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages