Skip to content

ML competition hosted on kaggle. Binary classification problem to predict the income of USA population.

License

Notifications You must be signed in to change notification settings

Christ14n97/Machine_Learning_Competition_2022

Repository files navigation

Machine_Learning_Competition_2022

ML competition hosted on kaggle. Binary classification problem to predict the income of USA population.

For the Machine Learning Kaggle Competition 2022, we were given a dataset derived from the United States Census Bureau (USCB). The USBC conducts various yearly surveys; as well as the decennial census, which produces data about the U.S. population and its economy. Data obtained essentially enables federal and local governments to make educated decisions regarding the allocation of federal funds, international trade, health, housing, and other influential elements to the standard of living.

This project helped us gaining hands-on experience of the theory learned along the course. Among those aspects we put into practice, it can be highlighted:

  1. Data exploration
  2. Feature engineering
  3. Missing values imputation
  4. Model selection and training
  5. Hyperparameter tuning by cross-validation
  6. Model performance comparison by setting an unified criteria
  7. Stacking

Our final model reached the 10th place in the private leaderboard with an $accuracy = 0.85687$.

About

ML competition hosted on kaggle. Binary classification problem to predict the income of USA population.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published