In this project we will apply statistical methods to understand IOT sensor dataset and come up with feature combination list which can used while building models
One of the important steps in ML project is understanding the data(features).
- Continuous
- Categorical - Ordinal/Nominal
- Continuous variable : Central tendency measure
- Categorical variable : Frequency table
- Dependency between 2 Continuous variables : PearsonR correlation
- Dependency between 2 Categorical variables: Chi-square hypothesis
- Dependency between Continuous and Categorical variables : ANOVA hypothesis
This module can be tested on any dataset after applying required preprocessing (Eg: Convert DateTime variables)