over-sampling

Star

Here are 18 public repositories matching this topic...

baibai25 / MNDO

Star

Multivariate Normal Distribution based Oversampling

machine-learning imbalanced-data imbalanced-learning over-sampling

Updated Mar 18, 2019
Jupyter Notebook

sharmaroshan / Fraud-Detection-in-Online-Transactions

Star

Detecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Frauds is less than 0.00005 thus, simply applying Classification Algorithm may result in Overfitting

finance machine-learning query deep-learning data-visualization data-analytics classification data-analysis sampling confusion-matrix large-dataset classification-report anamoly-detection auprc under-sampling over-sampling

Updated May 23, 2019
Jupyter Notebook

baibai25 / MNDO-NC

Star

Multivariate Normal Distribution Based Over-Sampling for Numerical and Categorical Features

machine-learning imbalanced-data imbalanced-learning over-sampling

Updated Jan 6, 2020
Jupyter Notebook

selkhayri / Determining_Credit_Risk

Star

The aim of this project is to determine which classification technique produces the best results when applied to the task of determining credit riskiness.

ensemble-learning under-sampling over-sampling

Updated Aug 14, 2020
Jupyter Notebook

JalajVora / Text-Analytics-with-Multi-Class-and-Imbalanced-Learning

Star

Genre Identification task along with Text Analytics with Multi-Class and Imbalanced Learning on Gutenberg Corpus

python machine-learning neural-network naive-bayes-classifier logistic-regression gutenberg text-analytics decision-tree-classifier imbalanced-data multiclass-classification imbalanced-learning complement-navie-bayes random-forest-classifier text-retrieval svm-rbf over-sampling

Updated Nov 24, 2020
HTML

jabhinav / Data-Science-and-ML-for-Structured-Data-Classification

Star

Repo contains scripts to perform data analysis on structure data. It also provides a comparison of various ML algorithms at different stages of data preparation.

data-science machine-learning-algorithms data-analysis data-preparation binary-classification cost-sensitive-learning under-sampling over-sampling

Updated Dec 2, 2020
Jupyter Notebook

abhiram-ds / credit_card_fraud_detection

Star

Credit Card Fraud detection based on anonymized data using multiple classification algorithms

python random-forest pandas credit-card-fraud xgboost classification logistic-regression decision-trees class-imbalance smote skewness under-sampling over-sampling

Updated Jan 28, 2021
Jupyter Notebook

richengo / WNV-Predictions

Star

Kaggle Competition: Predictions of West Nile Virus outbreaks in the City of Chicago.

time-series exploratory-data-analysis kaggle-competition cost-benefit-analysis classification-models over-sampling

Updated Jan 31, 2021
Jupyter Notebook

ChaitanyaC22 / Telecom-Churn-Prediction

Star

In this project, data analytics is used to analyze customer-level data of a leading telecom firm, build predictive models to identify customers at high risk of churn, and identify the main indicators of churn. The project focuses on a four-month window, wherein the first two months are the ‘good’ phase, the third month is the ‘action’ phase, whi…

Updated Jul 9, 2021
Jupyter Notebook

BananAlhethlool / Promotions-Prediction-Classification

Star

Predicts the qualified employee for promotion using Classification

random hr logistic-regression decision-tree smote knn-classification classifcation ensemble-methods imbalance over-sampling gaussiannb experimenting-stacking

Updated Dec 5, 2021
Jupyter Notebook

mbdelaresma / football-position-classification

Star

Football Positions: A Multi-class Classification Problem

machine-learning feature-engineering grid-search lime multi-class-classification shannon-entropy cost-sensitive-classification shap model-interpretability over-sampling

Updated Sep 4, 2022
Jupyter Notebook

cbrito3 / Credit_Risk_Analysis

Star

Supervised Machine Learning and Credit Risk

machine-learning scikit-learn machine-learning-algorithms scikitlearn-machine-learning ada-boost-classifier precision-recall imbalance-learning under-sampling over-sampling balanced-random-forest smote-oversampler naive-random-oversampler cluster-centroids-undersampling easy-ensemble-classifier

Updated Feb 4, 2023
Jupyter Notebook

chihangs / diabetes_classification

Star

Use random forest, gradient boosting, neural network, with SMOTE-ENN and random over-sampling

neural-network random-forest gradient-boosting diabetes-prediction over-sampling smote-enn

Updated Feb 9, 2023
Jupyter Notebook

alicevillar / student_admission_prediction

Star

Predicting students admission with Logistic Regression, Decision Tree, SVM (SVC) and Random Forest

machine-learning random-forest machine-learning-algorithms logistic-regression resampling decision-tree prediction-model svc kfold-cross-validation machine-learning-projects over-sampling performance-measurements

Updated May 1, 2023
Jupyter Notebook

NeonOstrich / Credit-Risk-Classification-using-Logistic-Regression

Star

Trained and evaluated two supervised machine learning models using original and resampled data to identify 'healthy loan' and 'high risk loan' applicants from financial disclosures.

numpy reporting sklearn pandas logistic-regression classification-report supervised-machine-learning pathlib over-sampling train-test-split

Updated Jun 7, 2023
Jupyter Notebook

hanfei1986 / Oversampling-of-imbalanced-data-with-RandomOverSampler-SMOTE-and-ADASYN

Star

Imbalanced data commonly exist in real world, especially in anomaly-detection tasks. Handling imbalanced data is important to the tasks, otherwise the predictions are biased towards the majority class. RandomOverSampler, SMOTE, and ADASYN are useful oversampling tools to fabricate data for minority classes and make the dataset balanced.

machine-learning imbalanced-data over-sampling

Updated Aug 29, 2023
Jupyter Notebook

nickkunz / smogn

Star

Synthetic Minority Over-Sampling Technique for Regression

regression imbalanced-data smote synthetic-data over-sampling

Updated Feb 7, 2024
Python

M-Hashemzadeh / RCSMOTE

Star

RCSMOTE: Range-Controlled Synthetic Minority Over-sampling Technique for handling the class imbalance problem

smote over-sampling imbalanced-classification smote-sampling imbalanced-datasets class-imbalance-problem

Updated Aug 31, 2024

Improve this page

Add a description, image, and links to the over-sampling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the over-sampling topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

over-sampling

Here are 18 public repositories matching this topic...

baibai25 / MNDO

sharmaroshan / Fraud-Detection-in-Online-Transactions

baibai25 / MNDO-NC

selkhayri / Determining_Credit_Risk

JalajVora / Text-Analytics-with-Multi-Class-and-Imbalanced-Learning

jabhinav / Data-Science-and-ML-for-Structured-Data-Classification

abhiram-ds / credit_card_fraud_detection

richengo / WNV-Predictions

ChaitanyaC22 / Telecom-Churn-Prediction

BananAlhethlool / Promotions-Prediction-Classification

mbdelaresma / football-position-classification

cbrito3 / Credit_Risk_Analysis

chihangs / diabetes_classification

alicevillar / student_admission_prediction

NeonOstrich / Credit-Risk-Classification-using-Logistic-Regression

hanfei1986 / Oversampling-of-imbalanced-data-with-RandomOverSampler-SMOTE-and-ADASYN

nickkunz / smogn

M-Hashemzadeh / RCSMOTE

Improve this page

Add this topic to your repo