**Employee Performance Analysis**

**Project Summary**

**INX Future Inc Employee Performance - Project**
The given project focuses on analyzing employee performance data from INX Future Inc. The objective is to determine performance ratings based on various features such as work experience, gender, department, and current role. The project's goals and insights include:

**Department Wise Performances:** The project aims to understand how different departments within the organization contribute to employee performance.

**Top 3 Important Factors Affecting Employee Performance:** Identifying and analyzing the three most significant factors that influence employee performance. These factors are crucial for understanding what drives performance within the company.

**Trained Model for Predicting Employee Performance:** Developing a machine learning model (using random forest classifier and gradient boosted classifier) capable of predicting employee performance based on input factors. This model has achieved an accuracy of 92%. The intention is to utilize this model for future hiring processes.

**Recommendations for Improving Employee Performance:** Providing actionable recommendations derived from the analysis to enhance employee performance. These recommendations are based on insights gained from the project's findings.

**Data Overview:**

*   The dataset comprises 1200 rows and 28 columns.
*   Features are categorized into 16 qualitative and 11 quantitative attributes.
*   Employee ID, an alphanumeric data, is not considered a relevant feature for performance rating.


**Data Analysis Process:**



*   **Data Preprocessing:** Employed one hot encoding to convert string-categorical data into numerical form, ensuring compatibility with machine learning algorithms.

*   **Exploratory Data Analysis:** Conducted distribution analysis and correlation analysis to gain insights into the data's characteristics and relationships between features and performance ratings.

*   **Department-specific Analysis:** Analyzed data for each department individually, understanding performance variations across different departments.

*   **Machine Learning Models:** Utilized random forest classifier and gradient boosted classifier due to the categorical nature of the labeled data. Achieved a high accuracy rate of 92% in predicting employee performance.

*   Feature Importance: Used machine learning model feature importance techniques to identify key factors influencing performance ratings.

**Key Findings:**

*   Identified department-wise performance disparities.

*   Determined the top three factors significantly impacting employee performance.

*   Established a predictive model with a 92% accuracy rate, aiding in future hiring decisions.

*   Recommendations derived from the analysis can be implemented to enhance overall employee performance within the organization.


**1. Requirement**

 The data for this project was provided by IABACâ„¢ and is based on a fictional organization, INX Future Inc. The project was conducted in a Jupyter Notebook using the Python programming platform. INX Future Inc is one of the leading data analytics and automation solutions provider with over 15 years of global business presence. INX is consistently rated as top 20 best employers past 5 years.

**2. Analysis**

It's essential to start the analysis by understanding the features within the dataset. Describing the features is a fundamental step in data analysis. It helps in grasping the nature of the data and the relationships between different variables. By categorizing the features into numerical and categorical data, it becomes easier to choose appropriate analytical methods and techniques for further exploration and modeling.

**Categorical Features:**
Categorical features include EmpNumber, Gender, EducationBackground, MaritalStatus, EmpDepartment, EmpJobRole, BusinessTravelFrequency, OverTime, Attrition.

**Numerical Features:**
Numerical features include Age, DistanceFromHome, EmpHourlyRate, NumCompaniesWorked, EmpLastSalaryHikePercent, TotalWorkExperienceInYears, TrainingTimesLastYear, ExperienceYearsAtThisCompany, ExperienceYearsInCurrentRole, YearsSinceLastPromotion, YearsWithCurrManager.

**Ordinal Features:**
EmpEducationLevel, EmpEnvironmentSatisfaction, EmpJobInvolvement, EmpJobLevel, EmpJobSatisfaction, EmpRelationshipSatisfaction, EmpWorkLifeBalance, PerformanceRating.

**Alphanumeric Features:**
EmpNumber is an alphanumeric feature consisting of both numeric and distinct (unique) alphanumeric values.

**Data Distribution Insights:**

*   Age distribution: Majority of employees are between 30 to 40 years old.
*   Most employees worked in up to 2 companies before joining this one.
*   Hourly rate ranges from 65 to 95 for most employees.
*   Most employees work up to 5 years in the company
*   Salary hike percentages are mainly between 11% to 15%.

**Data Cleaning and Preprocessing:**

*   No missing data was found in the given dataset.
*   Identified outliers in certain numerical features and performed data preprocessing techniques like square root transformation to handle skewed data.

**Analysis by Visualization:**
Created a correlation heatmap to identify relationships between numerical features. Identified significant correlations between experience years, job levels, and promotions.

**Machine Learning Models:**

*   Implemented different type of Machine learning model for prediction.
*   Addressed class imbalance using the SMOTE method.
*   Achieved high accuracy rates: 97% with XGBoost and 98% with Random Forest Classifier.


**3. Summary**

The data analysis project focused on understanding and predicting employee performance at INX Future Inc. The goals were achieved through a comprehensive analysis of the dataset and the implementation of machine learning models.

**I: Department-wise Performances**


*   **Sales:** Excellent performance was predominant, with slightly higher ratings for males. Total work experience did not significantly influence performance.

*   **Human Resources:** Majority excelled, especially female employees. Older individuals had lower performance. Experience played a role in performance here.

*   **Development:** Largest number of excellent performers. Age and gender had minimal impact.

*   **Data Science:** Highest excellence performance average. Experience mattered, but not age. Males performed well here.

*   **Research & Development:** Performance varied across ages, with good female performers.

*   **Finance:** Performance declined with age. Males performed better, and experience inversely impacted performance.

**II: Top 3 Important Factors Affecting Employee Performance**

*   Employment Environment Satisfaction

*   Employee Salary Hike Percentage

*   Experience Years in Current Role
*   These factors were identified through feature selection techniques and correlation analysis.

**III: Trained Models and Predictions**

*   Logistic Regression Accuracy: 78%

*   Support Vector Machine (SVM) Accuracy: 92%.

*   Decision Tree Accuracy: 91%.

*   Random Forest Accuracy: 98%.
*   XGBoost Accuracy: 97%.

*   K-Nearest Neighbor (KNN) Accuracy: 81%..

*   Naive Bayes (Bernoulli) Accuracy: 67%.

**IV: Recommendations to Improve Employee Performance**

*   Focus on Employee Environment Satisfaction: Enhance the work environment to boost overall performance.
*   Salary Hike: Providing regular and fair salary hikes can motivate employees.
*   Promotions: Encourage promotions to nurture leadership qualities and responsibility.
*   Experience Years in Current Role: Review and optimize the years required for promotions.
*   Work-Life Balance: Balance work demands for better performance and employee well-being.
*   Recruitment Strategies: Consider female candidates for HR positions due to their strong performance.
*   Special Attention to Low & Medium Job/Relationship Satisfaction Employees: Understand their needs and motivations to improve their performance.

**In conclusion, the analysis highlighted department-specific performance trends and crucial factors affecting employee performance. The provided machine learning models exhibited high accuracy, providing a foundation for making informed decisions and implementing strategic improvements within INX Future Inc.**