Data Explorer

Launch the web app here:

Demo

Analysis and Pandas Profiling

The various functions lets us undertstand the data, it's datatypes and describe the features. We can get basic details about data as well as advanced descriptive statistcs. We can check if any null values are present, if yes we have the functionality to fill them using appropriate logic. Another automation method lets us check for duplicates and lets us remove them if desired.pandas-profiling generates profile reports from a pandas DataFrame. pandas-profiling extends pandas DataFrame with df.profile_report(), which automatically generates a standardized univariate and multivariate report for data understanding. We are proviede with an option to download the pandas profile report.

For each column, the following information (whenever relevant for the column type) is presented in an interactive HTML report:

Type inference: detect the types of columns in a DataFrame
Essentials: type, unique values, indication of missing values
Quantile statistics: minimum value, Q1, median, Q3, maximum, range, interquartile range
Descriptive statistics: mean, mode, standard deviation, sum, median absolute deviation, coefficient of variation, kurtosis, skewness
Most frequent and extreme values
Histograms: categorical and numerical
Correlations: high correlation warnings, based on different correlation metrics (Spearman, Pearson, Kendall, Cramér’s V, Phik)
Missing values: through counts, matrix, heatmap and dendrograms
Duplicate rows: list of the most common duplicated rows
Text analysis: most common categories (uppercase, lowercase, separator), scripts (Latin, Cyrillic) and blocks (ASCII, Cyrilic)
File and Image analysis: file sizes, creation dates, dimensions, indication of truncated images and existance of EXIF metadata

Reproducing this web app

To recreate this web app on your own computer, do the following.

Create conda environment

Firstly, we will create a conda environment called dex

conda create -n dex python=3.7.9

Secondly, we will login to the eda environment

conda activate dex

Install prerequisite libraries

Download requirements.txt file

wget https://raw.githubusercontent.com/gmayuriiii/data-explorer/main/requirements.txt

Pip install libraries

pip install -r requirements.txt

Download and unzip contents from GitHub repo

Download and unzip contents from

Launch the app

streamlit run dataexplorer.py

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Bengaluru_House_Data.csv		Bengaluru_House_Data.csv
Procfile.txt		Procfile.txt
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
setup.sh.txt		setup.sh.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Data Explorer

Launch the web app here:

Demo

Analysis and Pandas Profiling

Reproducing this web app

Create conda environment

Install prerequisite libraries

Download and unzip contents from GitHub repo

Launch the app

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

gmayuri1904/Data-Explorer

Folders and files

Latest commit

History

Repository files navigation

Data Explorer

Launch the web app here:

Demo

Analysis and Pandas Profiling

Reproducing this web app

Create conda environment

Install prerequisite libraries

Download and unzip contents from GitHub repo

Launch the app

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages