# <font face = 'Impact' color = '#FFAEBC' > Getting Ready with our Third Mini-Project <font/>
#### <font face = 'Times New Roman' color = '#A16AE8'> Author and Trainer: Paolo Hilado MSc. (Data Science)<font/>
In this notebook, we will be importing packages and use them to load and explore data sets. We will be using the pandas package to load datasets and qdesc to explore both categorical and numerical variables. Functions to be used include pd.read_excel, pd.read_csv, head, qd.desc, qd.normcheck_dashboard, qd.freqdist_a, and qd.freqdist_to_excel.

# Name of Participant:

# <font face = 'Impact' color = '#FFAEBC' > Exploring Job Satisfaction and Employee Performance <font/> 

In [None]:
# Importing the necessary packages for us to use certain functions.
import pandas as pd
import numpy as np
import qdesc as qd
import researchpy as rp

In [None]:
# Loading a data set and storing it into a variable.
df1 = pd.read_excel("JobSatPerf.xlsx")
# Checking the first three rows of the data set.
df1.head(5)

In [None]:
# Default way of doing descriptives in python.
df1.describe().T

In [None]:
# Using the desc function from the qdesc package to perform descriptive analysis.
qd.desc(df1)

In [None]:
# Saving our results to a spreadsheet file.
res = qd.desc(df1)
res.to_excel("JobSatPerfDescriptives.xlsx")

In [None]:
# Generate descriptive analysis for job satisfaction when grouped by sex.
qd.grp_desc(df1, 'JobSat', 'Sex')

In [None]:
# Generate descriptive analysis for employee performance when grouped by sex.
qd.grp_desc(df1, 'EmpPerf', 'Sex')

In [None]:
# Using the desc function from the qdesc package to explore the data for assumption of normality.
qd.normcheck_dashboard(df1)

In [None]:
# Slicing the data frame to have a subset data for sex.
Male = df1[df1['Sex'] == 'Male'] # Slicing the data frame to only include the male.
Male.head() # checking out the data subset.

In [None]:
# Confirming that the subset is all males.
np.unique(Male.Sex)

In [None]:
# Using the desc function from the qdesc package to explore the data for assumption of normality.
qd.normcheck_dashboard(Male)

In [None]:
# Slicing the data frame to have a subset data for females.
Female = df1[df1['Sex'] == 'Female'] # Slicing the data frame to only include the females.
Female.head() # checking out the data subset.

In [None]:
# Confirming that the subset is all females.
np.unique(Female.Sex)

In [None]:
# Using the desc function from the qdesc package to explore the data for assumption of normality.
qd.normcheck_dashboard(Female)

In [None]:
# Generating a frequency distribution for a specified variable: Sex
qd.freqdist(df1, 'Sex')

In [None]:
# Generating a frequency distribution but this time have it store in an excel file and organized
# on a per sheet basis per categorical feature.
qd.freqdist_to_excel(df1, "JobSatPerfCategorical.xlsx")