image-tools

Content-aware JPEG → WebP / AVIF conversion. Point it at a JPEG, get back a smaller WebP or AVIF at the same perceptual quality — no hand-tuning, no per-image judgment calls.

Same image, same perceptual quality (SSIMULACRA2 ≈ 80 vs the source), at a fraction of the bytes.

▶ Try it in your browser — drop a JPEG, compare WebP/AVIF, nothing uploaded. (Or run the CLI below.)

# Fast mode needs only two encoders — no Python, no ImageMagick, no ssimulacra2.
brew install webp libavif                      # or: apt install webp libavif-bin

node convert.mjs photo.jpg out/                # → out/photo.avif (or .webp), whichever is smaller

Why it's not just "pick quality 60": a JPEG quality 80 photo, illustration, and line-art scan each need a different WebP/AVIF quality to preserve equivalent perceptual quality. This ships pre-computed calibration curves (1% resolution, 10 perceptual metrics × 3 content types) that capture exactly how different, so every conversion lands at the right quality automatically.

How it works

Classify the image by content type (photo / illustration / line-art / pixel-art).
Look up the calibrated WebP/AVIF quality for that content type and the input JPEG's detected quality (read straight from the file — no ImageMagick needed).
Encode WebP and AVIF and ship whichever is smaller — and never larger than the source.

That's the default fast path (just cwebp + avifenc). Add --verify for a per-image guarantee: it binary-searches the lowest quality whose encode clears an absolute SSIMULACRA2 floor vs the source JPEG — accurate and classification-independent, at the cost of needing ssimulacra2 and more time.

How much smaller will my files get?

At the same visual quality — the images look identical, the files are just lighter — switching JPEG → AVIF typically saves:

Photos: ~30–50% smaller
Line-art: ~55–60% smaller
Illustrations / flat artwork: ~60–65% smaller

WebP saves less (and struggles on very-high-quality flat artwork), which is exactly why the converter encodes both and keeps whichever comes out smaller. Flatter images shrink the most; photos the least — same reason the right quality setting differs by content type:

The data, in one picture

Each colored line is one content type. It answers: to match a JPEG at quality X, what WebP (or AVIF) quality do you actually need? The dashed line is 1:1 — where you'd land if the quality numbers were interchangeable. They aren't, and that's the whole point:

Read across at JPEG q80: a photo needs AVIF q60 to match, an illustration only q42, line-art ~q48. Same input quality → wildly different output settings.
Photos sit highest (closest to the diagonal) because their fine texture is the hardest thing for a codec to preserve — they need more quality. Flat-fill illustration and line-art sit lower — they compress to the same perceived quality at a much lower setting.
A single hand-picked "quality 75" rule is one horizontal slice through this — it over-compresses one type and bloats another. These curves pick the right value per image automatically.

(Curves shown are SSIMULACRA2-matched, full 1–100 resolution. Regenerate the figure with calibration/venv/bin/python calibration/plot-curves.py.)

Requirements

The converter is deliberately light. Dependencies scale with what you do:

Fast mode (default conversion) — cwebp (libwebp) + avifenc (libavif) on your PATH. That's it. No Python, no ImageMagick, no ssimulacra2. JPEG quality is read directly from the file.
--verify mode — additionally ssimulacra2 (libjxl devtools), plus avifdec/dwebp (ship with libavif/libwebp) or ImageMagick to decode candidates for scoring.
--contact-sheet — ImageMagick 7 (magick), for the comparison montage only.
Regenerating the curves — see calibration/; needs the full toolchain (ssimulacra2, butteraugli, dssim, ffmpeg, ImageMagick, a PyTorch venv). You never need this to use the converter — the curves are already generated and committed.

Node.js ≥ 18.14 (ESM, no runtime npm dependencies). On macOS the encoders are brew install webp libavif; ssimulacra2 comes from a libjxl build with devtools enabled (only needed for --verify).

Setup

Conversion and classification need no installation beyond cwebp + avifenc:

git clone https://github.com/emrikol/image-tools && cd image-tools
node convert.mjs photo.jpg out/
# or run it straight from GitHub without cloning:
npx -p github:emrikol/image-tools img-convert photo.jpg out/

No npm install needed to use it — there are no runtime JavaScript dependencies (the only devDependency is ESLint, for contributors). The package isn't on the npm registry; use the repo directly, or the zero-install web demo. Everything for regenerating curves (the Python venv, etc.) lives in calibration/ and isn't needed to use the tool.

Datasets

Source images are not bundled in this repo (they're large, and the illustration/line-art sets are third-party content). The committed calibration JSONs contain only numbers, so you can use classify.mjs / convert.mjs immediately without any images.

To re-run calibration you supply your own datasets under test-images/<type>/:

photo — the Kodak lossless set (24 public-domain-style benchmark PNGs)
illustration / line-art — bring your own (flat-color artwork; black-and-white ink/pencil art)

Usage

Convert a JPEG

node convert.mjs input.jpg output-dir/                       # FAST: encode at the calibrated quality
node convert.mjs photos/ output-dir/                         # BATCH: every JPEG in a folder, in parallel
node convert.mjs input.jpg output-dir/ --verify              # binary-search to an absolute SSIMULACRA2 floor
node convert.mjs input.jpg output-dir/ --dry-run             # preview the plan without writing
node convert.mjs input.jpg output-dir/ --type illustration   # override content type
node convert.mjs input.jpg output-dir/ --keep-both           # write both WebP and AVIF winners
node convert.mjs input.jpg output-dir/ --contact-sheet       # also write a visual comparison PNG

Point it at a directory to batch-convert every JPEG in parallel — each file runs in an isolated process, so one bad image is reported and skipped rather than crashing the run.

Two modes. By default the converter trusts the frozen curves and encodes straight at the calibrated quality — fast, and dependency-light (just cwebp + avifenc). --verify adds the per-image guarantee: it binary-searches the lowest quality whose encode clears an absolute SSIMULACRA2 floor vs the source JPEG (--floor, default 80). This is classification-independent — the floor means the same thing on every image, so a misclassification can't silently under-encode — at the cost of needing ssimulacra2 and being slower (AVIF at --speed 0).

Both modes never bloat: if no encoding beats the source JPEG at the required quality, the original is kept (nothing is written) rather than emitting a larger file.

Other flags: --floor N (--verify fidelity bar; higher = stricter/larger), --report (full candidate table), --ssim-only (use only the SSIMULACRA2 curve; --no-lap is a deprecated alias), --contact-sheet / --compare (write <stem>-compare.png: the original JPEG next to the WebP and AVIF at full size, captioned with file size + SSIMULACRA2 score, so you can eyeball the result).

Classify an image

node classify.mjs image.jpg                # single image -> JSON
node classify.mjs image.jpg --verbose      # include raw signal values
node classify.mjs *.png --batch            # JSON array, progress on stderr

Use it as a library

import { writeFileSync } from 'node:fs';
import { convert } from './lib/convert.mjs';

const r = await convert('photo.jpg', { verify: true, floor: 80 });
// r.winner === 'avif' | 'webp' | null, r.keptOriginal, r.jpegQ, r.contentType
if (r.winner && !r.keptOriginal) {
  const best = r[r.winner]; // { quality, size, score, buffer, ... }
  writeFileSync(`photo.${r.winner}`, best.buffer);
}

convert() is pure compute — it returns the winning bytes as a Buffer and writes nothing, so it drops straight into a build pipeline. Options: { type, verify, floor, ssimOnly, dryRun, calibrationDir, extraCalibration, onProgress }.

Regenerating calibration curves (optional / archival)

The curves are already generated and committed — you never need this to use the converter. The generator lives in calibration/ with its own README; it's kept for transparency and reproducibility. It's a one-time, multi-hour job needing the full toolchain.

node calibration/calibrate.mjs \
  --dataset photo:test-images/kodak:. \
  --metrics ssimulacra2,butteraugli,dssim,xpsnr,ms_ssim,lpips,dists,fsim,vif,entropy_diff \
  --step 1

It uses every logical CPU core with single-threaded encoders (--avif-jobs 1, benchmarked fastest for many small images), and PyTorch metrics run through persistent worker pools (model loaded once, not per measurement). See calibration/README.md.

Calibration data

{metric}-calibration-{content-type}.json — JPEG→WebP/AVIF quality lookup tables, one per perceptual metric per content type. Schema and the full list of metrics are documented in calibration-schema.md. convert.mjs loads every curve available for a content type and takes the most conservative (highest) quality across them as its starting point.

All curves are full-resolution (every JPEG quality 1–100) — ssimulacra2, butteraugli, dssim, xpsnr, ms_ssim, lpips, dists, fsim, vif, and entropy_diff — across photo, illustration, and line-art. The one exception is vmaf, kept as a coarse 11-point line-art-only curve (it's intentionally disabled for photo/illustration; see the limitations below).

Performance

Both modes encode AVIF at --speed 0 (max compression). Rough timings on ~0.4 MP images (scale with megapixels):

Fast mode: ~1–2 s/image (one WebP + one AVIF encode, no measurement).
--verify: ~15–20 s/image (binary search ≈ 7 AVIF encodes + WebP parameter tuning, each scored with ssimulacra2).
Batch: runs across all CPU cores, one isolated process per image.

Status & known limitations

This is a research toolkit. Current rough edges:

The classifier is decent but not perfect — ~91% accuracy on the labeled sets (photo 100% / illustration 92% / line-art 79%), using histogram entropy as the photo↔illustration discriminator. Painterly illustrations can still read as photos, and the entropy threshold is tuned on a small set so treat it as provisional. Measure it yourself: node calibration/classify-eval.mjs <type:dir> …. Two safety nets regardless: errors skew toward the conservative photo curve (they cost compression, not quality), and --verify is classification-independent — prefer it (or an explicit --type) for anything ambiguous.
mixed falls back to the photo curves (conservative) when the classifier isn't confident.
vmaf is calibrated for line-art only and is otherwise disabled (it saturates at high quality and distorts the max-across-curves logic).
Datasets are small (24 photo / 25 illustration / 19 line-art) and are not bundled (see Datasets).

Support Policy

This project is shared as-is, with no support.

✅ Use, modify, and redistribute it freely under the GPL-3.0 license
❌ No support, bug fixes, or feature requests accepted
❌ Issues are disabled — please don't open issues or contact the maintainer for help
❌ Pull requests are accepted only from collaborators; PRs from anyone else are closed automatically
💡 Want it to do something else? Fork it — that's what the license is for

Why this policy?

image-tools is a personal research project, published for reference and reuse — not a supported product. The calibration data is tied to specific encoder builds, and the toolchain (cwebp / avifenc / ssimulacra2 plus the PyTorch metrics) varies by platform, so supporting every environment and use case is out of scope.

If it works for you: great. If not: fork it and adapt it to your needs.

For forkers

You're free to fork and redistribute under the GPL-3.0 license. For modified, redistributed versions, please use a different project name to avoid confusion; no endorsement by the original maintainer is implied.

License

GPL-3.0. The calibration JSONs (the shipped data) are covered by the same license.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github		.github
assets		assets
calibration		calibration
curves		curves
lib		lib
test		test
web		web
.editorconfig		.editorconfig
.gitignore		.gitignore
.markdownlint-cli2.jsonc		.markdownlint-cli2.jsonc
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
.stylelintrc.json		.stylelintrc.json
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
calibration-schema.md		calibration-schema.md
classify.mjs		classify.mjs
convert.mjs		convert.mjs
eslint.config.js		eslint.config.js
package-lock.json		package-lock.json
package.json		package.json
ruff.toml		ruff.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

image-tools

How it works

How much smaller will my files get?

The data, in one picture

Requirements

Setup

Datasets

Usage

Convert a JPEG

Classify an image

Use it as a library

Regenerating calibration curves (optional / archival)

Calibration data

Performance

Status & known limitations

Support Policy

Why this policy?

For forkers

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

image-tools

How it works

How much smaller will my files get?

The data, in one picture

Requirements

Setup

Datasets

Usage

Convert a JPEG

Classify an image

Use it as a library

Regenerating calibration curves (optional / archival)

Calibration data

Performance

Status & known limitations

Support Policy

Why this policy?

For forkers

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages