
# 🏢 Salifort Motors: Employee Retention Analysis

This notebook analyzes employee attrition at Salifort Motors using logistic regression to identify key drivers of turnover. It includes data cleaning, feature encoding, model training, and evaluation.


## 📂 Load Dataset (Colab Compatible)

In [None]:

import os
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.preprocessing import LabelEncoder
from sklearn.metrics import classification_report, confusion_matrix, ConfusionMatrixDisplay

import warnings
warnings.filterwarnings('ignore')

# ✅ Download CSV from GitHub if running in Colab
if "google.colab" in str(get_ipython()):
    !wget https://raw.githubusercontent.com/Rafsun-Chowdhury/Salifort-Motors-Employee-Retention-Project-/main/HR_capstone_dataset.csv

# Load dataset
df = pd.read_csv("HR_capstone_dataset.csv")
df.head()


## 🧹 Data Cleaning & Encoding

In [None]:

df = df.rename(columns={
    'Work_accident': 'work_accident',
    'average_montly_hours': 'average_monthly_hours',
    'time_spend_company': 'tenure',
    'Department': 'department'
})
df = df.drop_duplicates()
le = LabelEncoder()
df['salary'] = le.fit_transform(df['salary'])
df = pd.get_dummies(df, columns=['department'], drop_first=True)
df.head()


## ✂️ Train-Test Split

In [None]:

X = df.drop('left', axis=1)
y = df['left']
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, stratify=y, random_state=42)


## 🤖 Train Logistic Regression Model

In [None]:

model = LogisticRegression(max_iter=500)
model.fit(X_train, y_train)
y_pred = model.predict(X_test)


## 📈 Model Evaluation

In [None]:

print("Classification Report:")
print(classification_report(y_test, y_pred))

cm = confusion_matrix(y_test, y_pred)
disp = ConfusionMatrixDisplay(confusion_matrix=cm)
disp.plot()
plt.title("Confusion Matrix")
plt.show()



## ✅ Conclusion

This analysis identifies key factors driving employee attrition at Salifort Motors. With logistic regression, we can predict attrition risk and provide HR teams with data-driven strategies for workforce retention.
