## Here is an outline of the steps we'll cover:

* Importing the necessary libraries
* Loading the dataset
* Data preprocessing (if needed)
* Formulating the null and alternative hypotheses
* Choosing the appropriate t-test based on the study design
* Conducting the t-test
* Interpreting the results and making conclusions

Let's perform a paired sample t-test using the dataset "Athlete" with columns "pre_training_weight" and "post_training_weight".

### Here's the code with explanations for each analytical step

### Step 1: Import the necessary libraries
* We'll import the required libraries, including numpy, pandas, and scipy.stats.

In [1]:
import numpy as np
import pandas as pd
from scipy.stats import ttest_rel
import os

In [2]:
# Changing working directory
os.chdir("C:\\Users\\HP\\Desktop\\JITSOLUTIONS\\Datasets0")

In [5]:
pwd

'C:\\Users\\HP\\Desktop\\JITSOLUTIONS\\Datasets0'

________________

### Step 2: Load the dataset
* Assuming your dataset is in a CSV file format, you can load it into a Pandas DataFrame using the read_csv() function.
* Assuming your dataset is in a xlsx file format, you can load it into a Pandas DataFrame using the read_excel() function.

In [6]:
# Importing dataset
df = pd.read_csv("Athlete.csv")

In [7]:
df.head()

Unnamed: 0,pre_training_weight,post_training_weight
0,104.985134,110.413039
1,106.763985,92.067213
2,98.451416,79.316233
3,102.664542,98.730797
4,78.854023,102.324667


In [15]:
df.describe().transpose()

Unnamed: 0,count,mean,std,min,25%,50%,75%,max
pre_training_weight,150.0,100.963715,10.835621,69.823667,93.59139,101.094173,108.272078,128.998221
post_training_weight,150.0,99.545093,9.798128,71.60063,91.739313,99.562758,106.125055,124.371665


### Step 3: Formulating the null and alternative hypotheses
* Define the null hypothesis (H0) and alternative hypothesis (H1) based on your research question.
* These hypotheses should be specific to your analysis.

In a paired-sample t-test, the null hypothesis (H0) states that the pre-training-weight and post-training-weight are equal. 

The alternative hypothesis (H1) states that the pre-training-weight and post-training-weight are  not equal. 



In [8]:
# Example hypotheses

# H0: The pre-training-weight and post-training-weight are equal.
# H1: The pre-training-weight and post-training-weight are not equal.

#### Extract the columns:

* Assign the "pre_training_weight" column to the variable pre_training_weight.
* Assign the "post_training_weight" column to the variable post_training_weight.

In [10]:
# Extract the columns for the paired samples
pre_training_weight = df['pre_training_weight']
post_training_weight = df['post_training_weight']

### Step 4: Conduct the paired-sample t-test
* Use the ttest_rel function from the scipy.stats module to perform the paired-sample t-test. 
* Pass the pre_training_weight and post_training_weight variables as arguments.

In [11]:
# Perform the paired sample t-test
t_statistic, p_value = ttest_rel(pre_training_weight, post_training_weight)

In [12]:
print(f"t_statistic: {t_statistic}")
print(f"p_value: {p_value}")

t_statistic: 1.27719047068282
p_value: 0.20352164007759732


### Step 5: Interpret the results and make conclusions
* Compare the obtained p-value with a predetermined significance level (α) to determine if the results are statistically significant. 
* You can print the results and provide a conclusion based on the outcome of the test.

In [14]:
alpha = 0.05

if p_value < alpha:
    print("Reject the null hypothesis. The pre-training-weight and post-training-weight are not equal.")
else:
    print("Fail to reject the null hypothesis. The pre-training-weight and post-training-weight are equal.")


Fail to reject the null hypothesis. The pre-training-weight and post-training-weight are equal.
