Switch branches/tags
Nothing to show
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
34 lines (18 sloc) 5.72 KB

Designing the Optimal Employee Experience... For Employers

Joey Maalouf

Medium, an online journal with a dedicated data science category, published an interesting dataset on Kaggle about employee conditions and whether or not a given employee quit their job at that company[1]. It contains 15,000 responses with a variety of potentially useful fields, such as the employee's pay grade, number of projects, and average monthly hours.

The obvious thing to do, with access to all of these features and a categorical result variable, is to train a logistic regression model. With so many different categories, it's hard to really say what primarily drives an employee to quit their job; for that, we can look at the coefficient parameters of our trained model. We can discount any of the features with a p-value greater than 0.05 as not necessarily statistically significant, as we don't want to risk coming to any false or unsupported conclusions.

Figure 1: Logistic Regression Model Coefficients

Figure 1 shows the coefficients of the features in the logistic regression model that can be considered statistically significant, where negative values correspond to conditions in which employees tend to stay and positive values correspond to conditions that make employees want to leave. As we might have suspected, satisfaction level is extremely important to keeping your employees, and the lower the pay grade, the more likely they are to leave. Additionally, a longer employment duration is a big indicator of quitting likelihood; however, this is almost certainly due to the tendency we have of transitioning between jobs over time until we retire.

It's quite interesting that employees are less likely to leave a company if they've suffered a work accident in their time there; my theory is that they tend to receive a benefits package or something similar to convince them to stay as a happy worker and not sue the company. Also of note is the relatively small part the employee's department plays, especially when compared to "major events" like having a work accident or not being promoted. However, within the comparison between departments, human resources workers are most likely to leave, while support staff and miscellaneous employees are least likely to quit.

Now we have the potential to solve an interesting problem: how can employers best optimize their employee conditions to minimize attrition rate while keeping costs low? Well, fortunately enough, our logistic regression model can predict the likelihood of an employee quitting given their current conditions, as well as how that might vary depending on how those conditions change.

To demonstrate, we can look at some theoretical typical employee; someone working 40-hour weeks on a few different projects, with a medium salary and mid-level evaluations and satisfaction. He's been working sales for a few years and hasn't gotten a promotion yet. This individual, according to our model, has a 24.43% chance of quitting before the company lets him go. If we want to lower that, we can look at changing a variety of things; one feature in particular that isn't likely to cost much is the employee's satisfaction level, which can probably change with simple quality of life fixes around the office.

Figure 2: Sample Employee Satisfaction

As Figure 2 shows, even with the other variables like number of projects and working hours held constant, an employee's quit chance can vary by dozens of percent based solely on how much he enjoys being in the office, regardless of pay grade. Alternatively, an employer with a particularly flexible moral compass might opt not to target his employee's satisfaction level, but instead arrange a work-related accident. This would drop our sample employee's probability of leaving to a mere 6.54% and make them much more likely to continue working for the company, for just the cost of a reparations package.

We can also extend the predicted quitting probability for some given set of conditions to be the total attrition rate of a company, department, or other group where all employees exist in conditions similar to the ones given.

Figure 3: Sample Company Departments

Figure 3 gives another example of how employees can be made more or less likely to quit, with all else held constant, this time by changing their department. Another way to look at this, of course, is which departments are most likely to have employees quit, even if all of the groups are treated the same. These results can inform both existing and future situations; giving the human resource employees a chance to be management for a day is the sort of change that would lower a company's current employee attrition rate, and companies now know that in the future, they should pay attention to improving the satisfaction level of their human resource employees more than their management ones.

From the trends observed in all of these features, any condition for which the coefficient positively contributes to employees leaving likely benefits the employer more than the employee, and vice vice versa for the negative coefficients. Using a predictive model for this sort of analysis allows employers to determine which of their current employees is most likely to quit next (and thus, whether the company should tweak their conditions or just let them go), as well as to design their department environments with a maximum allowable attrition rate in mind.

[1] Human Resources Analytics: Why are our best and most experienced employees leaving prematurely?
Notebook Source