Skip to content

Jobs have been scraped from linkedin to perform a data analysis to optimize the CV and cover letter

Notifications You must be signed in to change notification settings

Mavengence/linkedin-job-scraper-data-analysis

Repository files navigation

Bosch LinkedIn Scraper - Data Analysis

Tim Löhr


Links to Ressources

Repository Structure


├── (1) Data_Analysis.ipynb                             <- All of the analysis lies here
│
├── (2) data_analysis.py                                <- All of the used functions are here                                          
│
|── (3) features.py                                     <- All of the listed used for dataset building lies here
│       
├── (4) linkedin_jobs_scraper.py                        <- The scripts that scraps the jobs and builds the csv file         
│
│── (5) _secrets.py                                     <- Contains the username and token for the plotly graphics upload
│
├── linkedin_all_jobs.csv                               <- Contains the scraped data
├── envirnment.yml  
├── example.png 
├── .gitignore 
└── README.md   

Abstract

This projects to identify what skills companies require for certain job positions differentiated by the level of the position, such as entry-level or senior-level. For this purpose, the company Robert Bosch in Germany was picked together with the jobs data scientist, devops engineers and data engineers. Ternary plots identify which skills among these three jobs are intersecting, but also differentiating from each other. A CV and cover letter can be optimized with the awareness of a possible company techstack.

Example

Built With

  • Python 3.9.12
  • all following steps are for macOS, steps for Windows or Linux may differ

Getting Started

To get a local copy up and running follow these simple steps.

Prerequisites

  • Python 3
# Install via brew on macOS
brew install python

For Linux and Windows refer to this.

Installation

  • Clone the repository
  • Install conda
  • Install environment.yml
# Run this at repository root
conda env create -f environment.yml
  • Run the scraper
  • Get the cookie from your LinkedIn profile, as shown here
# Run this at repository root
LI_AT_COOKIE=XXX python linkedin_jobs_scraper.py
  • Open the data analysis notebook
# Run this at repository root
jupyter notebook

About

Jobs have been scraped from linkedin to perform a data analysis to optimize the CV and cover letter

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published