Combines book-catalogue data files into one unified list
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
input
src/main/java/com/bookmerger
.gitignore
LICENSE
README.md
book-merger.iml
merged-data.json
pom.xml

README.md

book-merger

This monstrosity merges the book metadata about my library from canonical.csv and librarything.csv in book-catalog and the scraped data from book-scraper.

Data files must be formatted such that each line has one parsable JSON object containing the book data. Obviously, pre-processing of the data sources is necessary to get it into a usable format.