Build software better, together

databrickslabs / automl-toolkit

Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Model selection and training, Hyper parameter optimization and selection, Model interprability.

scala spark apache-spark ml pyspark machinelearning feature-engineering

Updated Jun 1, 2021
HTML

EmilHvitfeldt / feature-engineering-az

Star

Source for book "Feature Engineering A-Z"

machine-learning feature-selection feature-extraction feature-engineering scikit tidymodels

Updated May 16, 2024
HTML

fernandodecastilla / Machine-Learning-for-Predictive-Maintenance

Star

Anomaly detection and failure prognosis applied to industrial machines

data-science machine-learning exploratory-data-analysis jupyter-notebook python-3-6 feature-engineering anomaly-detection failure-prediction industrial-iot

Updated Jun 6, 2019
HTML

sharmaroshan / Numpy-and-Pandas

Star

Numpy and Pandas are one of the most important building blocks of knowledge to get started in the field of Data Science, Analytics, Machine Learning, Business Intelligence, and Business Analytics. This Tutorial Focuses to help the Beginners to learn the core Concepts of Numpy and Pandas and get started with Machine Learning and Data Science.

data-science machine-learning numpy pandas feature-extraction data-analysis data-preprocessing aggregation feature-engineering dataframe pandas-profiling

Updated Apr 12, 2020
HTML

bghojogh / Feature-Extraction-Survey

Star

The code for the survey paper of feature extraction

feature-selection feature-extraction feature-engineering

Updated Oct 16, 2020
HTML

Shuyib / chronic-kidney-disease-kaggle

Star

Using machine learning models to predict if patients have chronic kidney disease based on a few features. The results of the models are also interpreted to make it more understandable to health practitioners.

data-science machine-learning machine-learning-algorithms data-transformation data-visualization feature-selection dimensionality-reduction diagnostics feature-engineering health-data-analysis machine-learning-algorithm model-interpretability data-cleaning-pipeline health-data-science preventative-medicine

Updated Jun 17, 2024
HTML

ChaitanyaC22 / Udacity-AWS-MLE-ND-Project1-Bike-Sharing-Demand-with-AutoGluon

Star

This project focuses on using the AWS open-source AutoML library, AutoGluon, to predict bike sharing demand using the Kaggle Bike Sharing demand dataset.

python data-science machine-learning deep-learning exploratory-data-analysis machine-learning-algorithms jupyter-notebook hyperparameter-optimization amazon-web-services boosting-algorithms feature-engineering model-evaluation automl stacking model-building ensemble-methods autogluon

Updated Jan 24, 2023
HTML

nickdcox / ml-linear-airfare-prediction

Star

Project: What factors impact the accuracy of airfare prediction?

python flask machine-learning linear-regression sklearn exploratory-data-analysis pandas data-visualisation matplotlib feature-engineering google-maps-api data-cleaning model-deployment xgboost-model lgbmregressor

Updated Jan 4, 2021
HTML

jpnevrones / FeatureSelection-using-Ant-Colony-Optimization

Star

A novel feature selection algorithm using ACO-Ant Colony Optimization, to extract feature words from a given web page and then to generate an optimal feature set based on ACO Metaheuristics and normalized weight defined as a learning function of their learned weights, position and frequency of feature in the web page. JAVA based ACO Framework

java feature-extraction ant-colony-optimization feature-engineering classification-algorithm

Updated Mar 30, 2018
HTML

kelhoussaini / Olist_Project

Star

We will analyze a dataset provided by an e-commerce marketplace called [Olist](https://www.olist.com) to answer the CEO's question: Should Olist remove underperforming sellers from its marketplace? How to increase customer satisfaction (so as to increase profit margin) while maintaining a healthy order volume?

visualization python jupyter convert logistic-regression feature-engineering data-exploration nbconvert data-analysis-python

Updated Dec 7, 2021
HTML

ehtisham-sadiq / Movie-Genre-Prediction

Star

The "Movie Genre Prediction" project is a comprehensive machine learning system designed to forecast a movie's genre by analyzing its attributes. By employing advanced machine learning methods, it strives to improve genre classification accuracy, offering valuable insights to creators, film aficionados, and the entertainment sector.

docker flask machine-learning feature-extraction webapp feature-engineering nlp-machine-learning

Updated Oct 18, 2023
HTML

billy-enrizky / Loan-Prediction-Status

Star

Explore an ML model with Logistic Regression, SVM, Gradient Boosting, Random Forest, and Decision Tree, enhanced via Hyperparameter Tuning. Experience our GUI-based ML model with 82.49% accuracy. Try it now!

data-science machine-learning random-forest pandas logistic-regression gradient feature-engineering svm-model gradient-boosting-classifier

Updated Sep 15, 2023
HTML

Kollipati / Diabetes-Prediction-

Star

python flask machine-learning exploratory-data-analysis feature-selection feature-extraction feature-engineering heroku-deployment

Updated Mar 18, 2023
HTML

avisionary / NYT-article-popularity

Star

Machine Learning project to predict popularity of NYT articles

python nlp api machine-learning r exploratory-data-analysis plotly feature-engineering

Updated Nov 17, 2022
HTML

raj-shr-git / Medicare_Provider_Fraud_Detection

Star

This repository contains the code components of work carried out for analyzing the Medical Provider Fraud Detection dataset with the intent to find most important features to crack down the potentially fraud providers.

data-science machine-learning data-visualisation feature-engineering fraud-detection insurance-claims healthcare-analysis medicare-claims-data

Updated Sep 5, 2022
HTML

yeisonmontoya1815 / Machine-Learning_Prediction_CAN_Inflation

Star

we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.

python data-science data machine-learning numpy sklearn pipelines pandas data-visualization data-analysis structured-data feature-engineering super unsupervised-learning numerical-analysis algorithms-and-data-structures matplotlib-pyplot

Updated Jun 28, 2024
HTML

DrScKim / mixedFeatureNN

Star

Find Similar Commercial items by using NN with Mixing Textual, Categorical and Numerical features

visualization python unsupervised scikit-learn feature-engineering mixed-models

Updated Mar 28, 2019
HTML

Davisy / Standard-Bank-Tech-Impact-Xente-credit-scoring-challenge

Star

Zindi competition on predicting the likelihood of credit default of ecommerce clients

data machine-learning exploratory-data-analysis competitive-programming feature-selection datascience feature-engineering

Updated Oct 19, 2020
HTML

mjoneil21 / Homework-3

Star

This assignment centered around advanced data minipulation: gathering columns, using pipes, and creating new columns with mutate. As in homework 2, Census data on Michigan was used as a base for this assignment. Simple statistics such as mean, median and trimmed mean were used to describe the variables. Visualization was also implemented to help…

visualization feature-engineering datamanipulation

Updated Feb 25, 2018
HTML

fpaupier / skin_section_segmentation

Star

Use of kmeans segmentation algorithm to classify dermis, epidermis and tumor infiltration.

matlab image-processing tumor medical medical-imaging feature-extraction image-classification convolution kmeans image-segmentation feature-engineering kmeans-clustering kmeans-image-clustering medical-image-processing oncology tumor-detection

Updated Sep 29, 2019
HTML

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature-engineering

Here are 78 public repositories matching this topic...

databrickslabs / automl-toolkit

EmilHvitfeldt / feature-engineering-az

fernandodecastilla / Machine-Learning-for-Predictive-Maintenance

sharmaroshan / Numpy-and-Pandas

bghojogh / Feature-Extraction-Survey

Shuyib / chronic-kidney-disease-kaggle

ChaitanyaC22 / Udacity-AWS-MLE-ND-Project1-Bike-Sharing-Demand-with-AutoGluon

nickdcox / ml-linear-airfare-prediction

jpnevrones / FeatureSelection-using-Ant-Colony-Optimization

kelhoussaini / Olist_Project

ehtisham-sadiq / Movie-Genre-Prediction

billy-enrizky / Loan-Prediction-Status

Kollipati / Diabetes-Prediction-

avisionary / NYT-article-popularity

raj-shr-git / Medicare_Provider_Fraud_Detection

yeisonmontoya1815 / Machine-Learning_Prediction_CAN_Inflation

DrScKim / mixedFeatureNN

Davisy / Standard-Bank-Tech-Impact-Xente-credit-scoring-challenge

mjoneil21 / Homework-3

fpaupier / skin_section_segmentation

Improve this page

Add this topic to your repo