Skip to content

thedatadavis/redfin_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

redfin_scraper

Scrapes listing data and photos from Redfin

Overview

  • Uses Selenium WebDriver and a proxy to bypass bot detection
  • Loops through the proxy list to find a good one
  • Once <200> is received, proceeds to load the given listing url
  • Uses BeautifulSoup to parse summary data (bed/bath count, etc.), key details (lot size, year built), amenities, and recent price history
  • Returns to Selenium to loop through photos because only the lead image shows src in the html
  • Joins data into single dictionary

TODO

  • Refactor into modules
  • Create cloud function to run as microservice

About

Scrapes listing data and photos from Redfin

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published