# Uber Taxi Fare Prediction

This project is based on trip data from **Uber Inc.**, one of the largest ride-hailing companies in the world.  
The goal is to build regression models to **predict the fare amount** for future Uber rides.

Uber serves hundreds of thousands of customers every day. Managing this large volume of trip data is crucial for generating new business insights and improving decision making. Accurate fare estimation is an important part of pricing strategies and customer satisfaction.

---

## ðŸ“Š Dataset Description

The dataset (referred from Kaggle) contains the following fields:

- **key** â€“ Unique identifier for each trip  
- **fare_amount** â€“ The cost of each trip in USD  
- **pickup_datetime** â€“ Date and time when the meter was engaged  
- **passenger_count** â€“ Number of passengers in the vehicle (driver-entered value)  
- **pickup_longitude** â€“ Longitude where the meter was engaged  
- **pickup_latitude** â€“ Latitude where the meter was engaged  
- **dropoff_longitude** â€“ Longitude where the meter was disengaged  
- **dropoff_latitude** â€“ Latitude where the meter was disengaged  

**Acknowledgement:**  
The dataset is referred from Kaggle.

---

## ðŸŽ¯ Project Objectives

- Understand the dataset and perform data cleaning (if required).  
- Engineer meaningful features (e.g., distance, date & time features).  
- Build regression models to predict the fare price of Uber rides.  
- Evaluate and compare models using metrics such as **RÂ²**, **RMSE**, and **MAE**.

---

## ðŸ§± Project Structure

```bash
uber-taxi-fare-prediction/
â”œâ”€ data/
â”‚   â””â”€ uber_train.csv                # Raw or cleaned training data
â”‚
â”œâ”€ notebooks/
â”‚   â”œâ”€ 01_eda_taxi_fare.ipynb        # Exploratory Data Analysis (EDA)
â”‚   â”œâ”€ 02_feature_engineering.ipynb  # Feature engineering & data cleaning
â”‚   â”œâ”€ 03_modeling_regression.ipynb  # Model training & evaluation
â”‚   â””â”€ 04_prediction_interface.ipynb # Simple prediction interface / demo
â”‚
â”œâ”€ src/
â”‚   â”œâ”€ utils.py                      # Helper functions (e.g., distance calculation)
â”‚   â””â”€ train_model.py                # Script to train and save models (optional)
â”‚
â”œâ”€ requirements.txt
â””â”€ README.md
