Skip to content
master
Go to file
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 

README.md

scandata

Collection of automatically-generated scan calibration files pulled from Internet Archive. View the project here.

The images in this collection were scraped from the Internet Archive website. Used for calibrating large book scanners, they automate the process of color correction and the removal of hot spots created by the scanner’s lights, and are meant to be read by the calibration software rather than humans.

When present in an Internet Archive entry, the images are stored in an odd file format (.ppm) inside a zip archive called “scandata.zip,” from which the title for this collection comes. Using a script written in Python, URLs for Internet Archive entries containing these files were pulled from the Bing search engine API, their contents downloaded, and the images converted to JPGs. They are shown here with a link to the Internet Archive entry from which they came. This collection represents a small fraction of the thousands of such files on Internet Archive’s site.

All images copyright their creators, everything else CC BY-NC-SA.

About

Collection of automatically-generated scan calibration files pulled from Internet Archive

Resources

Releases

No releases published

Packages

No packages published
You can’t perform that action at this time.