Skip to content

MarcusKyung/greenhouse.io-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Greenhouse.io Job Scraper

¨ ¨ ¨ ¨

By Marcus Kyung

Selenium and Beautiful Soup scripts made to scrape Lever and Greenhouse.io job pages in order to see open positions and when they were posted

Contents:

Technologies Used:

  • Python
  • Selenium

Description:

Scrapes Greenhouse.io job careers pages and prints the date the job was posted.

Greenhouse.io Examples:

Lever Examples:

Setup/Installation Requirements:

Run Locally:

  1. Clone this repository to a local machine with Python. Install Selenium and Beautiful Soup. Alternatively, a VENV can be used.
  2. Run program with python3 then the file you wish to run. Follow prompt to input Lever or Greenhouse.io careers page you wish to scrape. Must be a https://boards.greenhouse.io/XYZ careers page or https://jobs.lever.co/XYZ. Cannot be custom or embedded job listings. Lever offers the following query strings: Location Type, Location, Team, Work Type. Users can add optional relevant query string filers formatted like the following ex: ?location=Melbourne. Greenhouse does not offer filtering by query string.
  3. Results are printed to terminal.

Known Bugs/Issues:

  • Script doesn't work on employer site embedded greenhouse.io pages like this one: https://www.airship.com/company/careers/. In this instance a standard greenhouse.io page is available here: https://boards.greenhouse.io/airship, but this is not always the case
  • Script doesn't work on non-standard greenhouse.io page like this one: https://boards.greenhouse.io/alma because the job title is formatted in a non-standard method. It is not in an h1, rather it is in a p tag and bolded. This script works only when page formatting is programatically generated.

Contact:

For questions, comments, or concerns please reach out at Kyungmj@gmail.com

MIT License & Copywrite:

Copyright (c) [2023] [Marcus Kyung]

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR\ A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Releases

No releases published

Packages

No packages published

Languages