outliers

In this repository, using the statistical software R, are been analyzed robust techniques to estimate multivariate linear regression in presence of outliers, using the Bootstrap, a simulation method where the construction of sample distribution of given statistics occurring through resampling the same observed sample.

bootstrap r statistics outliers frb multivariate-linear-regression robust-regresssion outliers-detection

Updated Nov 27, 2019
R

nickwawee / Exploring_Wine_Quality

Star

This repo contains EDA of red and white wine and how it relates to quality.

quality correlation exploratory-data-analysis histogram outliers wine symmetry wine-quality multiple-linear-regression

Updated Dec 20, 2020
HTML

TrilokiDA / Data-pre-processing

Star

Data preprocessing is a data mining technique that is used to transform the raw data into a useful and efficient format.

feature-selection outliers outlier-detection features outlier-removal datapreprocessing

Updated Jul 2, 2022
Jupyter Notebook

QuantumKane / anomaly-detection-exercises

Star

This repo contains my work for Codeup's Anomaly Detection module.

outliers anomaly-detection

Updated Jun 26, 2021
Jupyter Notebook

poonam-ux / Matplotlib_Pharmaceuticals_performance_data

Star

Compare the performance of Pymaceuticals’ drug of interest, Capomulin, versus the other treatment regimens.

pandas-dataframe linear-regression outliers matplotlib summary-statistics correlation-coefficient

Updated Jun 30, 2021
Jupyter Notebook

aber0016 / Data_Cleansing

Star

This project applies data wrangling techniques to a retailer's data set of online orders. These techniques include determining and removing syntactical as well as semantic anomalies, removing outliers and imputing missing values using basic machine learning.

machine-learning outliers data-cleansing semantic-anomalies

Updated Jan 25, 2021
Jupyter Notebook

prasadposture / Data-Preparation

Star

There are lot of things that need to be done on the given dataset before we feed it to the machine, these things come under data preprocessing. In this repository I have tried to explain those things with some examples.

outliers scaling groupby label-encoding missing-value-handling dummy-variables data-binning duplicate-rows

Updated Aug 8, 2023
Jupyter Notebook

G-D-e-e-p-a-k / project-on-Bank-problem-statement

Star

To perform exploratory data analysis and visualization on a dataset containing customer information, to identify potential target customers for the bank’s future marketing campaign.

prediction data-visualization statistical-analysis data-analytics outliers data-manipulation data-cleaning ms-excel

Updated Apr 20, 2024

Develop-Packt / Analyzing-the-Heart-Disease-Dataset

Star

Identify missing values, outliers and trends in medical data. Create bar charts, heatmaps and other visualizations to understand how the features impact the target column of the data set

heatmap data-visualization outliers data-analysis bar-chart missing-values

Updated Mar 29, 2020
Jupyter Notebook

Kavitha-Kothandaraman / Basic-Statistics-pima-diabetes

Star

To explore the given dataset for all basic statistics such as the distributions, correlations, outliers, missing values, etc.

machine-learning plot eda statistical-analysis outliers missing-data pima-diabetes-data pyplot pima-indians-dataset correlations missing-values

Updated May 29, 2020
Jupyter Notebook

pagoma3 / Feature_engineering

Star

Sprint 9, Task 1

pca outliers feature-engineering standarization

Updated Oct 15, 2021
Jupyter Notebook

Improve this page

Add a description, image, and links to the outliers topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the outliers topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

outliers

Here are 149 public repositories matching this topic...

Kavitha-Kothandaraman / Featurization-Model-Selection-Tuning

AkashSDas / classify-customer-churn

scoutiii / HTSoutliers

Csengupta1101 / Data-Is-Good-Exam---September

LLudivina / kickstarter-analysis

sachelsout / regression-outlier-effect

Develop-Packt / Investigating-Air-Quality-in-Beijing

sarsteg / pymaceuticals-python-analysis-visuals-matplotlib

albertorb / tib-clustering

Daniele-montalbano / R-Robust-Estimation-With-Outliers-Using-Bootstrap

nickwawee / Exploring_Wine_Quality

TrilokiDA / Data-pre-processing

QuantumKane / anomaly-detection-exercises

poonam-ux / Matplotlib_Pharmaceuticals_performance_data

aber0016 / Data_Cleansing

prasadposture / Data-Preparation

G-D-e-e-p-a-k / project-on-Bank-problem-statement

Develop-Packt / Analyzing-the-Heart-Disease-Dataset

Kavitha-Kothandaraman / Basic-Statistics-pima-diabetes

pagoma3 / Feature_engineering

Improve this page

Add this topic to your repo