A lightweight CLI to recursively extract text from PDF files in a directory.
npx pdfxtract <directory> [options]| Option | Description |
|---|---|
-o, --output <dir> |
Output directory (preserves subdirectory structure) |
--overwrite |
Overwrite existing TXT files |
-v, --verbose |
Show detailed output |
-V, --version |
Show version number |
-h, --help |
Show help |
# Convert all PDFs in place
npx pdfxtract ./pdfs
# Output to a separate directory
npx pdfxtract ./pdfs -o ./output
# Overwrite existing files with verbose logging
npx pdfxtract ./pdfs -o ./output --overwrite -vNode.js >= 20.0.0
MIT