# COGS 108 - Final Project Proposal

# Names

- Michelle Tenin
- Junhee Chung
- Dilara Marasili
- Nada Gehad Gelal
- Justin Chanthabandith

# Research Question

*What effect do driving scenes, such as weather, road, and traffic conditions, have on the success/failure of developing AV technology?*

## Background and Prior Work

Autonomous vehicles development has been slowly progressing over the last couple decades as buzzwords like machine learning and AI have made AV one of the more intriguing commercial applications to this emerging technology. The AV problem can be encapsulated into 5 levels:
1. “At least one driver support system that provides steering assistance OR braking and acceleration assistance” [^jdpower].
2. “Level 2 driving automation applies to vehicles with advanced driving assistance systems (ADAS) that can take over steering, acceleration, and braking in specific scenarios” [^jdpower].
3. Starting from Level 3, it becomes “automated driving features”, as opposed to simply being “driver support features.” Level 3 requires a human driver to be “present, alert, and able to take control of the vehicle at any time” [^jdpower].
4. “Level 4 autonomy does not require any human interaction in the vehicle’s operation,” but are limited by “geofencing technology” and other external factors [^jdpower].
5. Level 5 autonomy, in addition to all of Level 4’s features, has no boundaries and can drive itself anywhere in any condition [^jdpower].

Most companies have managed to commercialize level 2 autonomy in their vehicles, embedding driver-assist features such as Tesla’s Autopilot, GM’s SuperCruise, and Nissan’s ProPilot. However, as these companies continue to develop for fully self-driving (FSD) capabilities, the imperfections of the software are reflected in collision and disengagement reports. Since most autonomous vehicles development outfits like Cruise and Waymo flock to California’s Silicon Valley for the optimal driving conditions and investment opportunities, the CA DMV has put in place reporting processes to capture the activity of developing autonomous vehicles in public traffic conditions. 

This accumulation of information in collision and disengagement reports has allowed data scientists to explore the nature of driver-car interaction. Specifically, Zhang et al. [^zhang] have explored the case-and-effect relationships of Autonomous Vehicles Disengagement (AVD), looking at long-hand descriptions of disengagements (driver-AV transition) to try to understand their causes and the safety implications. The paper found that disengagements were driver-initiated more than 80% of the time and was a result of some AV system failure (paper breaks it down in more details). Another research group, Alambeigi et al. [^alambeigi] , focuses on the open-ended collision reports to gain insight into driver-AV interaction and the safety issues associated with transitioning from autonomous to manual driving. This paper goes into more detail on data wrangling/cleaning techniques as well as their approach to analysis (how they parsed the long-hand descriptions). Although these papers focus more heavily on the safety of driver-AV takeover transitions rather than driving conditions, the insights into how they handled their datasets will be helpful for our own analysis. A third paper written by Dixit et al. [^dixit] offers a similar research question to what we are asking: what is causing these disengagements? Is there a relationship between driving situations and disengagements initiated (by either driver or AV). The difference here is they look more closely at the disengagement reports alone (specifically reaction times) and compare them to the mileage reports, finding a positive correlation between the AV’s mileage and number of disengagements. This was an insightful paper that looked closely at some of the factors for poor performance that we are interested in as well; however, we also noticed that this research was conducted with data up until 2016 which might be deemed outdated by this point. One potential future direction could be to compare our research findings with those from Dixit et al. 

Other Reading Material:
* [Automated Vehicle Safety Historical Article](https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0168054)  
Evolution of autonomy over the last couple decades
* [Current challenges in developing Autonomous Vehicles](https://www.vox.com/2016/4/21/11447838/self-driving-cars-challenges-obstacles) 
Interesting takeaway: driving has inherently social processes embedded  
Also references weather as an obstacle
* [Impact of Autonomy on Driver Skills](https://www.michigan.gov/documents/mdot/Impact_of_CAV_Technology_on_Driver_Skills_533519_7.pdf)
Analyzes automated driving features that exist on-road today and how that affects human driving behavior  
The report found a loss in driver skill and ability to take over the car when automation ends (abrupt takeover results in worse performance than anticipated takeover). More detailed results can be found in the report.
* [Brief about the 5 main challenges in AV driving](https://www.iiot-world.com/artificial-intelligence-ml/artificial-intelligence/five-challenges-in-designing-a-fully-autonomous-system-for-driverless-cars/)  
Short article that summarizes the main challenges to AV driving: road conditions, weather conditions, traffic conditions, accident liability, radar interference (first three are probably most important)
* [Article giving background on disengagement reports](https://www.cnet.com/roadshow/news/apple-self-driving-car-reveal-cruise-waymo-testing-data/)
* [Research paper exploring the causes of autonomous vehicles disengagement (AVD)](https://arxiv.org/pdf/2111.03511.pdf)  
 Analyzes the safety in transitioning the car from AV to driver (AVD), what causes poor AVD
* [Research paper exploring AVD by mapping to certain collision scenarios and evaluating safety](https://arxiv.org/pdf/2001.11087.pdf)  
Provides a good breakdown of data wrangling and cleaning methods, as well as report language parsing reasoning

[^jdpower]: “Car Reviews, Ratings and Awards | J.D. Power and Associates.” J. D. Power, www.jdpower.com. Accessed 23 Apr. 2022.
[^zhang]: Zhang, Yangtao, et. al Disengagement Cause-and-Effect Relationships
Extraction Using an NLP Pipeline. https://arxiv.org/pdf/2111.03511.pdf
[^alambeigi]: Alambeigi, Hananeh et. al Crash Themes in Automated Vehicles: A Topic Modeling Analysis of the California Department of Motor
Vehicles Automated Vehicle Crash Database. https://arxiv.org/ftp/arxiv/papers/2001/2001.11087.pdf 
[^dixit]: Dixit, Vinayak V. et. al Autonomous Vehicles: Disengagements,
Accidents and Reaction Times. https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0168054&type=printable



# Hypothesis


*We think that poor weather conditions affect driving capabilities - meaning it will cause an autonomous vehicle disengagement (ADV) - due to poor image quality on cameras and slippery roads. Unique driving situations like obstacles (ie pedestrians) and other vehicles will cause more disengagements and collisions due to edge case data. We feel we will find more inconsistency on streets rather than highways due to the inconsistency of neighborhood roadways versus the highway. *

# Data

What variables? 

- Manufacturer 
- Date 
- Vehicle is capable of operating without a driver 
- Driver Present 
- Disengage Initiate 
- Disengagement Location 
- Description of facts that cause disengagement
- Weather condition 
- To see the driving performance 
1. lateral control - steering 
2. longitudinal control - braking, accelerating 
3. object and event detection and response (OEDR) 
- Driving situations 
1. Traffic lights 
2. Stop signs 
3. Intersections 
4. Weather conditions 

How would they be stored? 
- Semi-structured data, well documented, machine-readable, Discrete/categorical variables 

How many observations would you have? 
- More than 1000 

What/who would the observations be? Over what time period? etc. 
- We expect Self-driven car trials between 2013 and 2022, so the last ten years ideally. 


# Ethics & Privacy

The datasets from the DMV report vehicle disengagements of vehicles that were being tested, which are often still under development, making the data not necessarily representative of fully developed cars that are used by the public. However, the tested cars are highly likely to have difficulty during the same type of situations as the fully developed cars. 

People in different cities/states have different driving habits. For example, people in San Diego generally follow the rules of the road more than people in Los Angeles, so when testing cars on the road, the risks that a car faces are likely to differ based on specific locations. Since the exact location of the testing is unspecified in the dataset, we are not able to judge whether the data is diverse enough among different parts of California. 


# Team Expectations 

* *Attend all scheduled meetings*
* *Communication through Facebook Messenger*
* *Keep us updated if you can’t pull through on something*
* *Check messages regularly throughout the day*
* *If someone doesn’t follow through with expectations, we can escalate to TA and then professor*
* *If conflict arises, we will handle communication in a professional way*
* *Ask for help as early as you can to prevent last minute work*

# Project Timeline Proposal

| Meeting Date  | Meeting Time| Completed Before Meeting  | Discuss at Meeting |
|---|---|---|---|
| 4/25  |  12 PM | Read & Think about COGS 108 expectations; brainstorm topics/questions  | Review assignment and assign roles; Find datasets, setup programming environment, clean data| 
| 5/2  |  12 PM |  TBD | Check-in meeting | 
| 5/5  | Before 11:59PM | Soft Deadline  | TBD |
| 5/6  | Before 11:59PM | Deadline | Review work and finishing touches   |
| 5/9  | 12 PM  | TBD | Review assignment and assign roles; Exploratory data analysis (data viz, observations), and discussion |
| 5/16  | 12 PM  | TBD | Check-in meeting |
| 5/19  | Before 11:59PM | Soft Deadline  | TBD |
| 5/20  | Before 11:59PM | Deadline | Review work and finishing touches   |
| 5/23  | 12 PM  | TBD| Review remaining sections for the final report and divide tasks |
| 5/30  | 12 PM | TBD| Check-in meeting, start to consolidate and add finishing touches |
| 6/3  | 12 PM  | Soft Deadline| Review report |
| 6/6  | Before 11:59 PM  | Deadline | Turn in Final Project & Group Project Surveys |