Skip to content

Regorg/indeed_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

UPDATE

Take a look at updated version: https://github.com/grzesiekdev/indeed_scraper

Indeed Scraper

This program is designed to scrap job offers from https://pl.indeed.com/. Scraped offers go to output.html, and are nicely represented in form of bootstrap table.

Indeed Web scraper

Built With

Getting Started

git clone https://github.com/Regorg/indeed_scraper.git

Prerequisites

  • Python >=3.7
  • requirements.txt
cd indeed_scraper
pip install -r requirements.txt

Usage

python main.py
Enter job name: <job name that you are looking for>
Enter place: <location>
Enter radius: <maximum distance from location>

Then after a while, you can check output.html in /indeed_scraper/

Development usage

Fill < > parameters with specific data

scraper = Scraper(<job_name>, <location>, <radius>)
template = Template(scraper.offers, scraper.number_of_offers)

Tests

You can find and run tests in /scraper/tests, by:

python test_scraper.py

Additional info

Class ScraperLocal in /scraper/tests/scraper_local.py is designed to run tests on static version of indeed page, because some elements were hard to check on live version of site, due to constant changes.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published