Skip to content

Business Directory Scraper: Web crawler for business listing site (Python + Scrapy + MongoDB)

Notifications You must be signed in to change notification settings

mrizqiaal/business-directory-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Business Directory Scraper

Crawls company profile information from a comprehensive business directory of companies in Philippines (PhilippinesBusinessList) using Scrapy and store them into MongoDB.

Environment

Works on Ubuntu 16.04 LTS

Language: Python 3.5.2

Database: MongoDB

Depedencies:

Instructions for use

Pre-boot configuration

  • Install MongoDB and start without configuration (create database and collection)

  • Install Python dependent modules: Scrapy, pymongo, requests or

      $ pip install -r requirements.txt
    

Start up

$ cd PhilippinesBusinessList
$ python3 quickstart.py

Run screenshot

About

Business Directory Scraper: Web crawler for business listing site (Python + Scrapy + MongoDB)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages