Classified Ads Scraper

Requirements

Python (>= 3.10)

Reproducing the environment

Clone the repository.

git clone https://github.com/toludaree/classified-ads-scraper.git

Create a python virtual environment and activate it. You can use the venv package. Name the environment .venv.

python -m venv .venv

# Activate
.venv/Scripts/activate     # Windows
source .venv/bin/activate  # Linux

Install scrapy and other associated libraries through requirements.txt
```
pip install -r requirements.txt
```

Scrape ClassifiedAds

Navigate into the classifiedads directory.
```
cd classifiedads
```
Choose the category or subcategory you want to scrape from ClassifiedAds.com. Here is a screenshot of all the categories and subcategories

Begin the scrapy process using the scrapy crawl command.

scrapy crawl ads -a name=<category> -O <file-path>

# category - name of subcategory that you chose from the last section
# file-path - path to save the results of the scraping process too. It can be a JSON, CSV or an XML file.

For example, we might want to scrape SUV ads and save the file to suv.json.
```
scrapy crawl ads -a name="SUVs" -O suv.json
```
A screenshot of the crawling session in progress
A screenshot of the results. You can get the JSON file here

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Classified Ads Scraper

Requirements

Reproducing the environment

Scrape ClassifiedAds

Files

README.md

Latest commit

History

README.md

File metadata and controls

Classified Ads Scraper

Requirements

Reproducing the environment

Scrape ClassifiedAds