Skip to content

A web scraper that extracts job listing information from Indeed.de and outputs results to a CSV file

License

Notifications You must be signed in to change notification settings

dan-grant-hunter/Indeed_Scraper_DE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Indeed.de Job Scraper

This is a simple program that extracts the following information from job listings on https://de.indeed.com/:

  • Posted (When the job listing was posted - translated from German)
  • Job Title
  • Company
  • Location
  • Job Description
  • Job URL

Two columns are also generated and prepended to the extracted data:

  • Date (The date the search was performed)
  • Time (The time the search was performed)

How to use

To use the program, follow the steps below:

  • Ensure both indeed_job_info.py and indeed_scraper.py are in the same directory
  • Create a new directory inside the directory that contains the above files and name it 'output_data'
  • Change the NUMBER_OF_SEARCH_PAGES variable inside of indeed_scraper.py to the number of pages you would like to extract (each page currently contains 15 job listings)
  • Run indeed_scraper.py

Once the program has finished running, a CSV titled with the current date will be downloaded into the 'output_data' folder. From here, you can open the CSV file and sort through the job listing data.


NOTE

The program is currently set up to search for English-speaking job listings in the Berlin area which are then sorted by date posted, however, the location, sort_type, and language variables within indeed_scraper.py can be changed to generate different search results.


About

A web scraper that extracts job listing information from Indeed.de and outputs results to a CSV file

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages