tumblr-scraper

This project was created as a black box, from scratch reimplementation of Liru/tumblr-downloader, recreating and improving upon its features.

Features

Documentation (up until now this strictly has been a private project)
Crawling of >5000 posts per day will lead to rate limiting
Continuing a previously failed crawl/scrape is not supported
Setting the before field in the config allows you to scrape backwards starting at a date in the past.
That way you can manually, iteratively scrape a huge blog in "sane" chunks (e.g. first everything before 2014, then 2015, 2016, ...).
Support for youtube-dl would be nice

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
account		account
app		app
config		config
cookiejar		cookiejar
database		database
scraper		scraper
semaphore		semaphore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go
tumblr.toml		tumblr.toml