# Data Jobs Analysis




##### _Business Intelligence and Data Analytics Jobs in Bulgaria - Availability and Required Skillset._



## Abstract

This document attempts to provide a view on the job market for data analytics and business intelligence in Bulgaria by looking into submitted job offers for a period of over a year. It focuses on the data presentation and tries to present an objective picture of the proffesional demand for what we refer to **data jobs**. While this project was mostly conceived and executed as a learning exercise we believe that the insights gathered are valuable and can be useful for both job seekers and employers looking who use data-driven methods in their search.

## Job Titles Analysed

### Defining the Research Object

So what exactly do we consider a *data job*? The objective for this analysis was to select offers that are related to the field of data engineering, business intelligence, data analytics and reporting. We made an attempt to exclude basic data entry and digitization jobs and false hits that are not related to the previosly described categories. More on the keywords selection below.

### Data Jobs Filtering

Let's say we are interested in job offers in the field of data analysis, data integration, data wrangling, etc. The first step in the process is to correctly filter the offers that are interesting for our research. That was done using a targeted query in the offer titles, and subsequent searches of selected keywords in the contents of the offers to identify additional strings to look for in the titles. A list of the keywords used in the final filter is given below:

```
'bi( |$)'
'(data|business) intelligence'
'etl( |$)'
'data analy(st|tics|sis)'
'анализ.*данни'
'data (engineer|scientist|warehouse)'
'reporting (analyst|specialist)'
'tableau'
'clikview'
```

### Targets Result Overview

The result of our initial filter is a small subset of job offers which we can explore further:

In [36]:
%%HTML 
<iframe width="100%" height="525px" seamless="seamless" src="./data_offers_pie_and_bar.html"></iframe>

## Temporal Analysis

Having identified and verified our set of targeted offers, we can now provide some historical perspective for the time period available to our study. Let's start with showing how the quantity of submitted data jobs has changed over two selected period bins: weekly and monthly.




### Offers over time bar chart

The chart shows a noticable increase in the total numbers of data related jobs available. 

In [10]:
%%HTML 
<iframe width="100%" height="525px" seamless="seamless" src="./data_offers_over_time_bar_chart.html"></iframe>

### Offers over time heatmap

Heatmaps are another useful way to present historical data and visually explore the trends. This one confirms the increase in demand shown in the previous chart, but also reveals weekly dynamics.



In [8]:
%%HTML 
<iframe width="100%" height="525px" src="./data_offers_subm_heatmap.html"></iframe>

## Key Players & Locations


### Key Players by Total Number of Submitted Data Jobs

In [16]:
%%HTML 
<iframe width="100%" height="550px" src="./data_offers_key_players_pareto.html"></iframe>

### Locations
- Identify Location Trends
What are the salaries for our targets.

## Skills and Tools Requirements

Use the requirements to produce a list of top technologies, then look in the offers' contents for them and provide a summary.

In [12]:
%%HTML 
<iframe width="100%" height="550px" src="./data_offers_tech_requirements_chord.html"></iframe>

## Remuneration Analysis

In [13]:
%%HTML 
<iframe width="100%" height="525px" src="./data_offers_scatter.html"></iframe>

## Appendix: 

### Visualization Recipes

Recipes for all charts used in this presentation are available on the links below: 

-------------------
Recipe
1. Data Offers Share Pie and Bar [open notebook](https://nbviewer.jupyter.org/github/nikolovdeyan/Job_Market_Trends_Bulgaria/blob/master/workbooks/Data_Offers_Share_Pie_and_Bar_Chart.ipynb)
2. Data Offers Over Time Bar Chart [open notebook](https://nbviewer.jupyter.org/github/nikolovdeyan/Job_Market_Trends_Bulgaria/blob/master/workbooks/Data_Offers_Historical_Bar_Charts.ipynb)
3. Data Offers Over Time Heatmap [open notebook](https://nbviewer.jupyter.org/github/nikolovdeyan/Job_Market_Trends_Bulgaria/blob/master/workbooks/Data_Offers_Heatmap.ipynb)
4. Key Players by Total Number of Submitted Data Jobs (Pareto Chart) [open notebook](https://nbviewer.jupyter.org/github/nikolovdeyan/Job_Market_Trends_Bulgaria/blob/master/workbooks/Data_Offers_Key_Players_Double_Bar_Chart.ipynb)
5. Deep Dive into Data Offers Requirements with NLTK [open notebook](https://nbviewer.jupyter.org/github/nikolovdeyan/Job_Market_Trends_Bulgaria/blob/master/workbooks/Data_Offers_Requirements_Deep_Dive.ipynb)
6. A Chord Diagram Revealing Key Data Jobs Technology Requirements [open notebook](https://nbviewer.jupyter.org/github/nikolovdeyan/Job_Market_Trends_Bulgaria/blob/master/workbooks/Data_Offers_Requirements_Relationships_Chord.ipynb)
7. Data Offers Salary Scatterplot [open notebook](https://nbviewer.jupyter.org/github/nikolovdeyan/Job_Market_Trends_Bulgaria/blob/master/workbooks/Data_Offers_Salary_Scatter.ipynb)


In [14]:
from IPython.core.display import HTML
with open('../resources/styles/datum.css', 'r') as f:
    style = f.read()
HTML(style)