Skip to content

Exploring different feature engineering approaches, including: implementing PCA (Principal Component Analysis), feature analysis, integral image, and Haar Measurement all from scratch applied to some datasets

Notifications You must be signed in to change notification settings

sidneylafont/feature_engineering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Feature Engineering

A toolkit / library / project for performing feature engineering: transforming raw data into informative features for machine learning models.

Overview

This project provides a set of feature engineering tools to preprocess, transform, and enrich datasets for machine learning.

Feature Extraction Techniques Implemented

  • Principal Component Analysis (PCA) from scratch
  • Integral Image feature extraction from scratch
  • HAAR-like feature extraction from scratch

Tested on Machine Learning Models

  • AdaBoost
  • Decision Tree
  • Gaussian Naive Bayes

Datasets

  • Spambase Dataset
  • Pollution Dataset

For implementation and results see: feature_engineering.ipynb

About

Exploring different feature engineering approaches, including: implementing PCA (Principal Component Analysis), feature analysis, integral image, and Haar Measurement all from scratch applied to some datasets

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published