Skip to content

Trains, tunes, and evaluates different regression models to develop a time-efficient, high-quality model for predicting car prices based on RMSE and CPU runtime.

Notifications You must be signed in to change notification settings

adkwn1/numerical_methods

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Numerical Methods

Author: Andrew Kwon

Description

This project trains and evaluates different prediction models for a regression task. Scope of the project is to balance runtime performance with model quality. Disclaimer: this project runs on CPU only support in its current state to measure CPU runtime performance; users can expect long runtimes if starting the notebook from a fresh kernel.

Introduction

A used car sales service is developing an app to attract new customers where users can quickly find out the market value of their car. In this project, we will need to build the model that predicts the car values. The company is interested in the quality (evaluated on RMSE), speed of the prediction, and time required for model training.

Dataset

We have access to historical data such as technical specifications, trim versions, and prices, which are further detailed in the file car_data.csv. Due to upload size limitations, users will need to extract the 7zip archive into the appropriate directory.

Features

  • DateCrawled: date profile was downloaded from the database
  • VehicleType: vehicle body type
  • RegistrationYear: vehicle registration year
  • Gearbox: gearbox type
  • Power: engine power in horsepower
  • Model: vehicle model
  • Mileage: mileage (in kilometers)
  • RegistrationMonth: vehicle registration month
  • FuelType: fuel type
  • Brand: vehicle brand
  • NotRepaired: vehicle repaired or not
  • DateCreated: date of profile creation
  • NumberOfPictures: number of vehicle pictures
  • PostalCode: postal code of profile user
  • LastSeen: date of the last activity of the user

Target

  • Price: price in euro

Requirements

  • pandas
  • numpy
  • time
  • matplotlib.pyplot
  • sklearn.ensemble
  • sklearn.linear_model
  • sklearn.metrics
  • sklearn.model_selection
  • lightgbm
  • catboost

About

Trains, tunes, and evaluates different regression models to develop a time-efficient, high-quality model for predicting car prices based on RMSE and CPU runtime.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published