Skip to content

Latest commit

 

History

History
73 lines (51 loc) · 1.11 KB

README.md

File metadata and controls

73 lines (51 loc) · 1.11 KB

ISBN Scanner

Tool to find ISBNs in ebook files and fetch metadata from online databases

Features

  • WorldCat
  • Tika
  • JSON output
  • Multi-threaded

Installation

Make sure you have CMake and Ninja installed.

Binary Release

yay -S libpugixml-dev mold

Compiling from Source

cmake -S . --preset=Release
cmake --build ./out/build/Release -j<cores>

Running Unit Tests

cmake -S . --preset=Test
cmake --build ./out/build/Test -j<cores>
./out/build/Test/scanner

Setup

Tika

docker run -p 127.0.0.1:9998:9998 apache/tika:latest

Usage

scanner -f filetypes.json -c scanner.toml -i <input directory> -o books.json

Using the Results

Recommend JQ

Roadmap

v0.1

  • Better handling of the output JSON file
  • Add rate limiting

v0.2

  • Unit Tests
  • Fuzz testing
  • Add installation to CMake
  • Changelog
  • Complete README documentation

v0.3

  • Progress bar
  • Improve fuzzy matching on filenames/titles
  • Goodreads API

v0.4

  • Calibre compatible output