GitHub - wsb1994/Cola: Reddit Download Script for Debian Systems. Should be able to set itself up and work just fine out of the box. Uses a lot of shell scripting because that's what I was learning in University at the time.

Collecting an Objective Library of Assets with C.O.L.A, or just "Cola" is simple. You only need a debian system or some time to manually install on any other system. Cola downloads large swaths of images from reddit. On better hardware (threadrippers etc.) you can download a hefty number of images concurrently.

requires a debian based linux installation to use the fast install script. (some of it is for show but you definitely need the python3 stuff)
Run the install script ./install.sh in your git directory as the super user or if you are not using a debian based distribution manually install things
Run either ./Concurrent_Cola.sh or ./Sequential_Cola.sh or their zip enabled counterparts. Concurrent requires you have a ridiculous number of cores to achieve an acceptable result. i'd say about a ratio of double the cores of the subreddits you want to pull.
Subreddits must be active to get good results.

configuration guide:

/Core/Subreddits.txt contains a list of subreddits to scrape, line by line, one subreddit per line

You can turn off piping output, but it looks super ugly and basically offers no information other than it's still running, hence it being sent to a txt file.

the driver script is courtesty of Reddit-Fetch, which I have modified here somewhat. The modifications were minor tweaks to syntax between the Python3 version and the Python2 version if I'm remembering correctly. The script is otherwise identical to the one provided by that repository. It's documentation is in tact under the API folder if you wish to become familiar with how it works. It otherwise works as a black box in this project, which is ran with CRON.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
API		API
Core		Core
Database		Database
Images		Images
subservices		subservices
.gitignore		.gitignore
Concurrent_Cola.sh		Concurrent_Cola.sh
Concurrent_Cola_WZip.sh		Concurrent_Cola_WZip.sh
LICENSE		LICENSE
Prezi		Prezi
README.md		README.md
Sequential_Cola.sh		Sequential_Cola.sh
Sequential_Cola_WZip.sh		Sequential_Cola_WZip.sh
install.sh		install.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages