# Advanced Certification in AIML
## A Program by IIIT-H and TalentSprint
## Not for grading

## Dataset 

#### Description
This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.

The datasets consists of several medical predictor variables and one target variable, Outcome. 

* Preg: Number of times pregnant
* Glucose: Plasma glucose concentration a 2 hours in an oral glucose tolerance test
* BloodPressure: Diastolic blood pressure (mm Hg)
* SkinThickness: Triceps skin fold thickness (mm)
* Insulin: 2-Hour serum insulin (mu U/ml)
* BMI: Body mass index (weight in kg/(height in m)^2)
* DiabetesPedigreeFunction: Diabetes pedigree function
* Age: Age (years)
* Outcome: Class variable (0 or 1)

In [None]:
!wget https://cdn.iiith.talentsprint.com/aiml/Experiment_related_data/diabetes.csv

### Import required packages

In [None]:
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score
from sklearn.ensemble import BaggingClassifier

### Load the data

In [None]:
# Load the diabetes dataset
diabetes = pd.read_csv("diabetes.csv")
diabetes.head()

In [None]:
diabetes.isnull().any() # Check for null values

In [None]:
# Extract features and labels
X = diabetes.drop("Outcome",axis=1)
y = diabetes["Outcome"]

In [None]:
X.shape, y.shape

In [None]:
# Split the data into train and test data
X_train, X_test, y_train, y_test  = train_test_split(X, y, test_size = 0.25, random_state=42)
print(X_train.shape, X_test.shape, y_train.shape, y_test.shape)

### Apply Bagging Classifier

In [None]:
clf = BaggingClassifier()
clf.fit(X_train, y_train)
y_pred = clf.predict(X_test)
print("Accuracy of Bagging Classifier is:",accuracy_score(y_test, y_pred))