Skip to content

Information system for business project - building and mining data warehouse

Notifications You must be signed in to change notification settings

dynonguyen/Data-Warehouse-UKAccident

Repository files navigation

INFORMATION SYSTEM FOR BUSINESS INTELLIGENCE PROJECT

BUILDING AND MINING DATA WAREHOUSE


Team TTKD-12 - HCMUS IT K18

  • 1712766 - Nguyễn Chí Thanh
  • 18120606 - Trần Thị Trang
  • 18120609 - Hồ Khắc Minh Trí
  • 18120634 - Nguyễn Lê Anh Tuấn

Requirements and submission rules

Build and analyze data about car accidents in the UK over 3-4 years

  • Data Description: Describe meaning of the properties of the following data sources (only describe the properties necessary for the project):
    • UK Car Accidents 2005 - 2015 data (Students only takes data over 3-4 years, or the provided 2011-2014 data): Reference
    • LSOA-Postcode Mapping data: Reference
    • UK-Postcodes data: Reference
  • Design data warehouse (DW), synthesize, load data from the sources into DW, then design and build Cube: Suggestions.
    • For England and Wales: map the above data sources to get the values for building Geography dimension with dimensional hierarchy as follows: Country > Region > County > Town City.
    • For Scotland and North Ireland, students need to suggest solutions to create values for Geography dimension.
    • Transform the datetime data to create the Date dimension with dimensional hierarchy: Year > Quarter > Month > Day.
    • Define and design other dimensional hierarchies to meet OLAP and Report requirements.

OLAP & Report

  1. Report the number of calsulities by Severity (Fatal, Serious, Slight) in the Local Authority Districts over years.
  2. Report the number of calsulities by Severity (Fatal, Serious, Slight) in the Local Authority Districts by Quarters in years.
  3. Report the number of faltal calsulities by Gender, Casualty Type and Age Band over years.
  4. Report the number of accidents by Severity and Time of Day (Morning: 5am-12pm, Afternoon: 12pm-5pm, Evening: 5pm-9pm, Night: 9pm-5am) over years.
  5. Report the number of accidents by Severity, Urban or Rural Area and Road Type over years.
  6. Report the number of calsulities by Severity, Casualty Type and Age Group over years, Age Group is defined as below
  • Children: 0-15
  • Young adult: 0-17
  • Adult: 18-59
  • 60 and over: 60-...
  1. Report the number of accidents by Journey Purpose and Vehicle Type.
  2. Create a new attribute Built-up Road in Accidients table. Built-up Road may have 2 values.
  • Built-up road: if Speed Limit below 50 mph
  • Non Built-up road: if Speed Limit above 50 mph
  1. Report the number of accidents by Severity, Vehicle Type, Built-up Road over years.
  2. Students design other reports about UK car accident.
  3. Define fact Variance to calculate the increase and the decrease of the number of car accidents in percent over years.
  4. Build graphs/charts for the above reports
  5. Use regional map to visually represent (by color) the number of car accidents in regions during a year.

Data Mining: Suggestion

  • Using models to predict the severity of accidents
  • Students propose applications of any case, explain the algorithm used, why, how the results are, etc.

Result


NDS (Normalize Data Store):


NDS

DDS (Dimensional Data Store):


DDS Accident DDS Casualty DDS Vehicle

OLAP (Online Analytical Processing):


OLAP OLAP OLAP


MDX:


MDX


Data Visualization:


Visualization Visualization Visualization Visualization Visualization Visualization Visualization Visualization Visualization Visualization


Data Mining:


Data Mining Data Mining

About

Information system for business project - building and mining data warehouse

Topics

Resources

Stars

Watchers

Forks

Contributors 4

  •  
  •  
  •  
  •  

Languages