yelp-scraper

An simple example of how to perform web scraping by using the Scrapy framework and the Yelp website as target.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

First you need to have the latest version of pip installed in your computer to be able to install de project dependencies, you can check the pip installation guide if you do not already have installed.

yelp-scraper depends on the Scrapy Python framework, you can install the latest version by using the following command on your terminal:

$ (sudo) pip install scrapy

How to setup and run the project

From here I'am assuming that you already have all prerequisites installed and properly configured in your machine.

Setup

Clone the repo or download it.

Running

Open your terminal and change to into the project folder:

$ cd ~/<folder>/yelp-scaper

Where <folder> is where you downloaded or cloned the repo.

Then you can start the scraping process by using the following command:

$ scrapy crawl yelp -a find='something' -a near='somewhere'

Arguments

Note: All arguments must be preceded by the -a argument, this is required by Scrapy.

find: This argument is required. The possible values for this argument are the same which you can use in the Yelp website, for example:

Restaurants, Nightlife, Air Conditioning & Heating, Contractors, Electricians, Home Cleaners, Landscapers, Locksmiths, Movers, Painters, Plumbers.

near: This argument is required. The possible values for this argument are the same which you can use in the Yelp website, for example:

London, San Francisco, etc...

max_results: This argument is optional, and your default value is 3. This argument allows you to limit the amount of results that the Scrapy will scrape from the website.

Contributing

Feel free to make your suggestion and/or contribution.

License

This project is licensed under the MIT License - see the LICENSE file for details

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
src		src
LICENSE		LICENSE
README.md		README.md
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

yelp-scraper

Getting Started

Prerequisites

How to setup and run the project

Setup

Running

Arguments

Contributing

License

About

Releases

Packages

Languages

License

Eustacio/yelp-scraper

Folders and files

Latest commit

History

Repository files navigation

yelp-scraper

Getting Started

Prerequisites

How to setup and run the project

Setup

Running

Arguments

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages