# Overview
In this section you will update us on your project, including the project you have chosen, your problem statement, an extensive outline of EDA and modeling to date, the goal of your predictive model, and the data you will use to explore that model.

This notebook:
- Identifies which of the three proposals you outlined in your lightning talk you have chosen
- Articulates the main goal of your project (your problem statement)
- Outlines your proposed methods and models
- Defines the risks & assumptions of your data
- Revises initial goals & success criteria, as needed
- Documents your data source
- Performs & summarizes preliminary EDA of your data

# Problem Statement
#### Guidelines:
- Specific:
    - What precisely do you plan to do?
    - What type of model will you need to develop?
- Measurable:
    - What metrics will you be using to assess performance?
    - MSE? Accuracy? Precision? AUC?
- Achievable:
    - Is your project appropriately scoped?
    - Is it too aggressive? Too easy?
    - Note: If your project is too big, break it up into smaller pieces. Sometimes a good project is the simply one part of a larger, longer-term agenda.
- Relevant:
    - Does anyone care about this?
    - Why should people be interested in your results?
    - What value will the completion of your project be adding?
- Time-bound
    - What's your deadline?
 

### My Statement:

I plan to compare the results of using a convolutional neural network and using a capsule network to classify energy deposition images of five particle types using data provided in Deep Learn Physics http://deeplearnphysics.org/DataChallenge/. I will use model accuracy to assess performance. If I find that investigating the capsule network is too difficult, I will stick with a convolutional neural network, and will do the capsule network on my own after the course ends. One of the most enticing aspects with using this data is that the Deep Learn Physics group has proposed a challenge where I can contact them to share my result, and be a part of an informal competition. I understand that this data is from 2017, but even if the challenge has ended, it’s still interesting data and heavily connects with my physics background. 

### Why capsule networks?

Why not just convolutional neural networks? 
- Here, we have data of particle trajectories
- These trajectories could be coming in at an infinite number of angles to the dataset
    - These trajectories are in a voxelized 3D data in a 2D format.
    - In 3D computer graphics, a voxel represents a value on a regular grid in three-dimensional space. The word voxel originated analogously to the word "pixel", with vo representing "volume" and el representing "element"; similar formations with el for "element" include the words "pixel" and "texel" 
- A convolutional neural network needs to be trained explicitly on all possible combinations of training images so that it learns on all of them.
    - Yes, Image Augmentation exists (which modifies, rotates, inverts, etc. the data) to generate every possible combination, but is very computationally expensive.
    - CNNs can handle translational invariance but cannot handle rotational invariance (source: https://analyticsindiamag.com/why-do-capsule-networks-work-better-than-convolutional-neural-networks/)
- MaxPooling does not provide 'ViewPoint invariance': the ability to make the model invariant to changes in viewpoint

Why capsule networks?
- Use Dynamic routing in place of max pooling layer in CNN:
    - Outperforms CNNs when it comes to problems of viewpoint invariance
    - could be useful for different incoming particle trajectories
- Use 'heirarchy of parts' method
    - Could have each of these five trajectory types be classified as a 'part'