## IN THIS NOTEBOOK - WE'LL BE IMPLEMENTING SOME AUTOMATED LIBRARIES WHICH MAKES THE MACHINE LEARNING TASK EASIER.

--------------

### LIBRARIES WE'LL BE WORKING ON ARE:

--------------------

#### 1. Pandas_Profiling
#### 2. AutoViz
#### 3. D-Tale
#### 4. SweetViz

------------------------


![image.png](attachment:3d1c8fa7-ca8d-4557-85e6-6939d198f8f3.png)


Generates profile reports from a pandas DataFrame. The pandas df.describe() function is great but a little basic for serious exploratory data analysis. pandas_profiling extends the pandas DataFrame with df.profile_report() for quick data analysis.

In short, what pandas profiling does is save us all the work of visualizing and understanding the distribution of each variable. It generates a report with all the information easily available.

**For each column the following statistics - if relevant for the column type - are presented in an interactive HTML report:**

* Type inference: detect the types of columns in a dataframe.

* Essentials: type, unique values, missing values

* Quantile statistics like minimum value, Q1, median, Q3, maximum, range, interquartile range

* Descriptive statistics like mean, mode, standard deviation, sum, median absolute deviation, coefficient of variation, kurtosis, skewness

* Most frequent values

* Histograms

* Correlations highlighting of highly correlated variables, Spearman, Pearson and Kendall matrices

* Missing values matrix, count, heatmap and dendrogram of missing values

* Duplicate rows

* Text analysis learn about categories (Uppercase, Space), scripts (Latin, Cyrillic) and blocks (ASCII) of text data.



#### CODE USING DIABETES DATASET:

In [None]:
import numpy as np
import pandas as pd
from pandas_profiling import ProfileReport

In [None]:
from sklearn.datasets import load_diabetes

In [None]:
diab_data=load_diabetes()

In [None]:

df=pd.DataFrame(data=diab_data.data,columns=diab_data.feature_names)

In [None]:
profile = ProfileReport(df, title='Pandas Profiling Report', explorative=True)

In [None]:

profile.to_file("output.html")

In [None]:
profile

## HERE WE CAN SEE HOW BEAUTIFULLY THE PANDAS_PROFILING HAVE DONE THIS WORK. THIS SHOWS US EVERYTHING AS MENTIONED ABOVE.

![](https://autoviz.io/static/core/img/logo.png)

Visualization is a technique that is used to visualize the data using different graphs and plots. In data science, we generally use data visualization techniques to understand the dataset and find the relation between the data. Visualization can also help in finding the pattern in the dataset which is used for further analysis.

AutoViz performs automatic visualization of any dataset with one line. Give any input file (CSV, txt or json) and AutoViz will visualize it.

### CODE:

In [None]:
# INSTALLATION:

!pip install autoviz

In [None]:
!pip install xlrd

In [None]:
#importing Autoviz class
from autoviz.AutoViz_Class import AutoViz_Class
#Instantiate the AutoViz class
AV = AutoViz_Class()

In [None]:

filename = "../input/pima-indians-diabetes-database/diabetes.csv"
sep = ","
dft = AV.AutoViz(
    filename,
    sep=",",
    depVar="",
    dfte=None,
    header=0,
    verbose=0,
    lowess=False,
    chart_format="svg",
    max_rows_analyzed=150000,
    max_cols_analyzed=30,
)

### YOU CAN SEE THE TIME IT TOOK TO DISPLAY ALL THE VISUALS IS JUST 8.188 SECONDS. THIS LIBRARY HELPS US AND SOLVES OUR 20% OF THE WORK.

![](https://warehouse-camo.ingress.cmh1.psfhosted.org/742ca2c2ae99cf9ab865549efabfaafb753e5483/68747470733a2f2f7261772e67697468756275736572636f6e74656e742e636f6d2f617363686f6e66656c642f6474616c652d6d656469612f6d61737465722f696d616765732f5469746c652e706e67)

## HERE THE NAME ITSELF TELLS "D-TALE" : MEANS **DETAIL** ANALYSIS OF DATA

D-Tale is the combination of a Flask back-end and a React front-end to bring you an easy way to view & analyze Pandas data structures. It integrates seamlessly with ipython notebooks & python/ipython terminals. Currently this tool supports such Pandas objects as DataFrame, Series, MultiIndex, DatetimeIndex & RangeIndex.


#### CODE:

In [None]:
!pip install dtale

In [None]:
import dtale

In [None]:
df = pd.read_csv('../input/pima-indians-diabetes-database/diabetes.csv')

In [None]:
dtale.show(df)

### THE OUTPUT HAS NOT BEEN DISPLAYED HERE. YOU CAN RUN THIS CODE IN YOUR SYSTEM AND CAN SEE THE REPORT.

![](https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRMqwP3APfSq3B3hFCs9BhwPQ-jvW3cjFnO0A&usqp=CAU)

In [None]:
!pip install sweetviz

In [None]:
import sweetviz as sv
report = sv.analyze(df) 
report.show_html('output_sweetviz.html')

### THE OUTPUT WILL NOT BE DISPLAYED HERE. YOU CAN DOWNLOAD THE FILE FROM OUTPUT SECTION AND OPEN IT IN CHROME OR SOME OTHER HTML VIEWER.

## CONCLUSION:

**In this Notebook we have learned the flow of all the Automated Libraries for Both Exploratory Data Analysis and Visualization.**


## IF THIS NOTEBOOK HELPS YOU, PLEASE GIVE AN UPVOTE