All IT Ebooks.com Scraper

This project has been superceded by my new dockerized Ebook API.

A program that scrapes links, and other data, to your favorite tech ebooks from allitebooks.com and stores them in a MongoDB instance.

Getting Started

Note. One of this projects dependencies, scrapy-mongodb, does not support Python 3 yet. Therefore, this project only supports Python 2

Clone this repository.
Create a new Python Virtual Environment.
pip install -r requirements.txt

Configuration

Create a fresh MongoDB instance either on our local machine or a free 500 MB remote instance from MLab.
Replace the following in settings.py with your own credentials:

MONGODB_URI = ''
MONGODB_DATABASE = ''
MONGODB_COLLECTION = ''

Other MongoDB settings can be found here at the scrapy-mongodb repo.

Contributing

Anyone, regardless of skill level, is encouraged to give feadback and submit pull requests.

Acknowledgments

A special thanks to the following:

The Scraping Hub Team who maintain the Scrapy project;
Sebastian Dahlgren maintainer of scrapy-mongodb

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
ebook		ebook
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
readme.md		readme.md
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ebook

ebook

.gitignore

.gitignore

CODE_OF_CONDUCT.md

CODE_OF_CONDUCT.md

LICENSE

LICENSE

readme.md

readme.md

requirements.txt

requirements.txt

scrapy.cfg

scrapy.cfg

Repository files navigation

All IT Ebooks.com Scraper

Getting Started

Configuration

Contributing

Acknowledgments

About

Releases

Packages

Languages

License

Jeffallan/All-It-Ebooks-Scraper

Folders and files

Latest commit

History

Repository files navigation

All IT Ebooks.com Scraper

Getting Started

Configuration

Contributing

Acknowledgments

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages