Skip to content

(2023) A web scraping program to generate personalized used car recommendations using Deep Learning by Dallin Stewart

Notifications You must be signed in to change notification settings

binDebug3/Casper

Repository files navigation

Casper

A web scraping program that gathers online listings to generate used car purchase recommendations by Dallin Stewart


Table of Contents
  1. Welcome
  2. Description
  3. Instructions for Download
  4. Instructions for Use
  5. Packages
  6. Contact
  7. Acknowledgements

Welcome

Casper is a python program that uses selenium to gather data used car sales online from retailers including AutoTrader, CarsDirect, CarGuru, Carvana, Lowbook, and KSL. As of the most recent update, Casper only searches for cars in the Provo, Utah area. After gathering data, Casper scores each car on how good of a deal the sale is and generates a CSV report of the current market in addition to the market history since November 2022.

The name Casper is inspired by the Spanish word 'raspar', which means 'to scrape'. It is also a reference to 'Casper the Friendly Ghost' because when Casper is running, it looks like a ghost is controlling your computer!


Description

  1. AutoTrader.py scrapes basic listing data according to parameters from autotrader.com
  2. CarGuru.py scrapes basic listing data according to parameters from carguru.com
  3. CarsDirect.py scrapes basic listing data according to parameters from carsdirect.com
  4. Carvana.py scrapes basic listing data according to parameters from carvana.com
  5. KSL.py scrapes basic listing data according to parameters from ksl.com
  6. Lowbook.py scrapes basic listing data according to parameters from lowbook.com
  7. CG_Detail.py scrapes detailed information according to parameters
  8. KSL_Detail.py scrapes detailed information according to parameters
  9. CD_Detail.py scrapes detailed information according to parameters

  10. Compressor.py compresses images scraped from websites using a custom image compression algorithm
  11. Search.py performs operations for scraping that are input and website agnostic
  12. main.py contains configuration details and calls each of the website scrapers to build final CSV reports

Instructions for Download

You'll need to start by downloading a few python packages with these commands:

You can then clone this project to your own machine from GitHub and then run the webscraper in any Python development environment.

(top)


Instructions for Use

  1. Create a CSV file for each website and put them in a folder called 'Data':
    a) autotrader.csv
    b) CarsDirect.csv
    c) CarGuru.csv
    e) Carvana.csv
    e) ksl.csv
    f) current_market.csv
  2. Run 'main.py'

(top)


Packages

Python     Jupyter       Numpy     Scipy

Matplotlib     ImageIO     Pandas     Selenium

PyTorch     Plyer

(top)

Contact

Dallin Stewart - dallinpstewart@gmail.com

LinkedIn GitHub Email

(top)


Acknowledgments

(top)

About

(2023) A web scraping program to generate personalized used car recommendations using Deep Learning by Dallin Stewart

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published