Skip to content

bgithub1/selscrape

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

selscrape - simple webscraping wrapper around python selenium (currently supports +3.5)

The selscrape project contains the following classes:

  1. SelScrape - a class that simplifies calls to selenium to:
  • navigate to website urls
  • extract information from website elements using xpath
  • enter information on forms of a website using xpath
  • click on buttons, etc
  • simplify downloading files from urls
  1. SelDictAccess - a simpler form of SelScrape that allows you to pass a dictionary of xpath statements to use when accessing html elements.
  2. CraigAccess - access auto information from Craigslist using SelScrape and SelDictAccess.
  3. ChaseCareers - another example of using SelScrape to extract job postings from the jpmorganchase careers website.

See the examples.ipynb jupyter notebook for some simple examples of this project.

About

Scrape websites using python3 selenium

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published