# DeepLabCut Toolbox - DEMO (mouse reaching)
https://github.com/AlexEMG/DeepLabCut

#### The notebook accompanies the following user-guide:

Nath\*, Mathis\* et al. *Using DeepLabCut for markerless pose estimation during behavior across species* Nature Protocols, 2019: https://www.nature.com/articles/s41596-019-0176-0

This notebook starts from an already initialized project with labeled data.

**Data:** dataset is from Mathis et al. *Somatosensory Cortex Plays an Essential Role in Forelimb Motor Adaptation in Mice* Neuron, 2017: DOI:https://doi.org/10.1016/j.neuron.2017.02.049

This notebook illustrates how to:
- plot the labeled images
- train a network
- evaluate a network
- analyze a novel video
- create an automatically labeled video 
- plot the trajectories 
- identify outlier frames
- annotate the outlier frames manually
- merge the data sets and update the training set
- train a network

## Import the toolbox:

In [1]:
import deeplabcut

  _np_qint8 = np.dtype([("qint8", np.int8, 1)])
  _np_quint8 = np.dtype([("quint8", np.uint8, 1)])
  _np_qint16 = np.dtype([("qint16", np.int16, 1)])
  _np_quint16 = np.dtype([("quint16", np.uint16, 1)])
  _np_qint32 = np.dtype([("qint32", np.int32, 1)])
  np_resource = np.dtype([("resource", np.ubyte, 1)])


### Set a variable to point to the project configuration file:

In [2]:
import os
# Note that parameters of this project can be seen at: *Reaching-Mackenzie-2018-08-30/config.yaml*
from pathlib import Path

#create a variable to set the config.yaml file path:
path_config_file = os.path.join(os.getcwd(),'Reaching-Mackenzie-2018-08-30/config.yaml')
print(path_config_file)

E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30/config.yaml


NOTE: When you use DeepLabCut on your own data, you will (1) create a project, (2) extract frames to label, and (3) label you data. 
**In this demo, this is all done for you!**
The purpose of the demo to for you to get familiar with part of the workflow.

### Load the pre-labeled data:

In [3]:
#let's load some demo data, and create a training set 
#(note, this function is not used when you create your own project):

deeplabcut.load_demo_data(path_config_file)

Loaded, now creating training data...
The training dataset is successfully created. Use the function 'train_network' to start training. Happy training!


In [4]:
#Perhaps plot the labels to see how the frames were annotated:

deeplabcut.check_labels(path_config_file)

Creating images with labels by Mackenzie.
They are stored in the following folder: E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30\labeled-data\reachingvideo1_labeled.
If all the labels are ok, then use the function 'create_training_dataset' to create the training dataset!


## Start training of Feature Detectors
This function trains the network for a specific shuffle of the training dataset. **The user can set various parameters in /Reaching-Mackenzie-2018-08-30/dlc-models/ReachingAug30-trainset95shuffle1/iteration-0/train/pose_cfg.yaml.**

Training can be stopped at any time. Note that the weights are only stored every 'save_iters' steps. For this demo the it is advisable to store & display the progress very often (i.e. display every 20, save every 100). In practice this is inefficient (in reality, you will train until ~200K, so we save every 50K).

**We recommend just training for 10-20 min, as you aren't running this demo to use DLC, just to work through the steps. In total, this demo should take you LESS THAN 1 HOUR!**

In [5]:
deeplabcut.train_network(path_config_file, shuffle=1, saveiters=300, displayiters=10)
#notice the variables "saveiters" and "dsiplayiters" that can be set in the function


#you just need to run this until you get at least 1 snapshot, which is set by: "save_iters" 
#(so in this case you could stop after 500!) How do I stop? Click the STOP button!
# To train until ~2,000 iterations on a CPU should be ~30 min

Config:
{'all_joints': [[0], [1], [2], [3], [4]],
 'all_joints_names': ['Hand', 'Finger1', 'Tongue', 'Joystick1', 'Joystick2'],
 'batch_size': 1,
 'bottomheight': 400,
 'crop': True,
 'crop_pad': 0,
 'cropratio': 0.4,
 'dataset': 'training-datasets\\iteration-0\\UnaugmentedDataSet_ReachingAug30\\Reaching_Mackenzie95shuffle1.mat',
 'dataset_type': 'default',
 'deterministic': False,
 'display_iters': 1000,
 'fg_fraction': 0.25,
 'global_scale': 0.8,
 'init_weights': 'C:\\Users\\LabAdmin\\.conda\\envs\\dlc-windowsGPU\\lib\\site-packages\\deeplabcut\\pose_estimation_tensorflow\\models\\pretrained\\resnet_v1_50.ckpt',
 'intermediate_supervision': False,
 'intermediate_supervision_layer': 12,
 'leftwidth': 400,
 'location_refinement': True,
 'locref_huber_loss': True,
 'locref_loss_weight': 0.05,
 'locref_stdev': 7.2801,
 'log_dir': 'log',
 'max_input_size': 1500,
 'mean_pixel': [123.68, 116.779, 103.939],
 'metadataset': 'training-datasets\\iteration-0\\UnaugmentedDataSet_ReachingAug30\\Do

Starting with standard pose-dataset loader.
Instructions for updating:
Colocations handled automatically by placer.
Instructions for updating:
Use tf.cast instead.
Instructions for updating:
Use standard file APIs to check for files with this prefix.
INFO:tensorflow:Restoring parameters from C:\Users\LabAdmin\.conda\envs\dlc-windowsGPU\lib\site-packages\deeplabcut\pose_estimation_tensorflow\models\pretrained\resnet_v1_50.ckpt
Display_iters overwritten as 10
Save_iters overwritten as 300
Training parameter:
{'stride': 8.0, 'weigh_part_predictions': False, 'weigh_negatives': False, 'fg_fraction': 0.25, 'weigh_only_present_joints': False, 'mean_pixel': [123.68, 116.779, 103.939], 'shuffle': True, 'snapshot_prefix': 'E:\\Users\\Phil\\DeepLabCut\\examples\\Reaching-Mackenzie-2018-08-30\\dlc-models\\iteration-0\\ReachingAug30-trainset95shuffle1\\train\\snapshot', 'log_dir': 'log', 'global_scale': 0.8, 'location_refinement': True, 'locref_stdev': 7.2801, 'locref_loss_weight': 0.05, 'locref_hu

iteration: 10 loss: 0.2919 lr: 0.005
iteration: 20 loss: 0.0413 lr: 0.005
iteration: 30 loss: 0.0302 lr: 0.005
iteration: 40 loss: 0.0318 lr: 0.005
iteration: 50 loss: 0.0267 lr: 0.005
iteration: 60 loss: 0.0248 lr: 0.005
iteration: 70 loss: 0.0203 lr: 0.005
iteration: 80 loss: 0.0289 lr: 0.005
iteration: 90 loss: 0.0289 lr: 0.005
iteration: 100 loss: 0.0250 lr: 0.005
iteration: 110 loss: 0.0201 lr: 0.005
iteration: 120 loss: 0.0220 lr: 0.005
iteration: 130 loss: 0.0235 lr: 0.005
iteration: 140 loss: 0.0261 lr: 0.005
iteration: 150 loss: 0.0204 lr: 0.005
iteration: 160 loss: 0.0251 lr: 0.005
iteration: 170 loss: 0.0251 lr: 0.005
iteration: 180 loss: 0.0211 lr: 0.005
iteration: 190 loss: 0.0208 lr: 0.005
iteration: 200 loss: 0.0235 lr: 0.005
iteration: 210 loss: 0.0287 lr: 0.005
iteration: 220 loss: 0.0161 lr: 0.005
iteration: 230 loss: 0.0203 lr: 0.005
iteration: 240 loss: 0.0216 lr: 0.005
iteration: 250 loss: 0.0201 lr: 0.005
iteration: 260 loss: 0.0193 lr: 0.005
iteration: 270 loss: 

KeyboardInterrupt: 

*Note, that if it reaches the end (defualt 1M) or you stop it (by "stop" or by CTRL+C), 
you will see an keyboard interrupt "error", but it is not a real error, i.e. you can ignore this.*

## Evaluate the trained network

This function evaluates a trained model for a specific shuffle/shuffles at a particular training state (snapshot) or on all the states. The network is evaluated on the data set (images) and stores the results as .csv file in a subdirectory under **evaluation-results**.

You can change various parameters in the ```config.yaml``` file of this project. For the evaluation one can change pcutoff. This cutoff also influences how likely estimated postions need to be so that they are shown in the plots.

In [6]:
deeplabcut.evaluate_network(path_config_file,plotting=True)

Config:
{'all_joints': [[0], [1], [2], [3], [4]],
 'all_joints_names': ['Hand', 'Finger1', 'Tongue', 'Joystick1', 'Joystick2'],
 'batch_size': 1,
 'bottomheight': 400,
 'crop': True,
 'crop_pad': 0,
 'cropratio': 0.4,
 'dataset': 'training-datasets\\iteration-0\\UnaugmentedDataSet_ReachingAug30\\Reaching_Mackenzie95shuffle1.mat',
 'dataset_type': 'default',
 'deterministic': False,
 'display_iters': 1000,
 'fg_fraction': 0.25,
 'global_scale': 0.8,
 'init_weights': 'C:\\Users\\LabAdmin\\.conda\\envs\\dlc-windowsGPU\\lib\\site-packages\\deeplabcut\\pose_estimation_tensorflow\\models\\pretrained\\resnet_v1_50.ckpt',
 'intermediate_supervision': False,
 'intermediate_supervision_layer': 12,
 'leftwidth': 400,
 'location_refinement': True,
 'locref_huber_loss': True,
 'locref_loss_weight': 0.05,
 'locref_stdev': 7.2801,
 'log_dir': 'log',
 'max_input_size': 1500,
 'mean_pixel': [123.68, 116.779, 103.939],
 'metadataset': 'training-datasets\\iteration-0\\UnaugmentedDataSet_ReachingAug30\\Do

Running  DeepCut_resnet50_ReachingAug30shuffle1_300  with # of trainingiterations: 300
INFO:tensorflow:Restoring parameters from E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30\dlc-models\iteration-0\ReachingAug30-trainset95shuffle1\train\snapshot-300
Analyzing data...


55it [00:03, 17.45it/s]


Done and results stored for snapshot:  snapshot-300


  testerrorpcutoff = np.nanmean(RMSEpcutoff.iloc[testIndices].values.flatten())


Results for 300  training iterations: 95 1 train error: 77.31 pixels. Test error: 99.12  pixels.
With pcutoff of 0.4  train error: 10.49 pixels. Test error: nan pixels
Thereby, the errors are given by the average distances between the labels by DLC and the scorer.
Plotting...
The network is evaluated and the results are stored in the subdirectory 'evaluation_results'.
If it generalizes well, choose the best model for prediction and update the config file with the appropriate index for the 'snapshotindex'.
Use the function 'analyze_video' to make predictions on new videos.
Otherwise consider retraining the network (see DeepLabCut workflow Fig 2)


**NOTE: depending on your set up sometimes you get some "matplotlib errors, but these are not important**

Now you can go check out the images. Given the limted data input and it took ~20 mins to test this out, it is not meant to track well, so don't be alarmed. This is just to get you familiar with the workflow... 

## Analyzing videos
This function extracts the pose based on a trained network from videos. The user can choose the trained network - by default the most recent snapshot is used to analyse the videos. However, the user can also specify the snapshot index for the variable **snapshotindex** in the **config.yaml** file).

The results are stored in hd5 file in the same directory, where the video resides. The pose array (pose vs. frame index) can also be exported as csv file (set flag to...). 

In [7]:
# Set the video path:
#The video can be the one you trained with and new videos that look similar, i.e. same experiments, etc.
# You can add individual videos, OR just a folder - it will skip videos that are already analyzed once.

videofile_path = os.path.join(os.getcwd(),'Reaching-Mackenzie-2018-08-30/videos/reachingvideo1.avi')                   

In [13]:
print("Start Analyzing the video!")
deeplabcut.analyze_videos(path_config_file,[videofile_path], save_as_csv=1)
# this video takes ~ 8 min to analyze with a CPU

Start Analyzing the video!


Config:
{'all_joints': [[0], [1], [2], [3], [4]],
 'all_joints_names': ['Hand', 'Finger1', 'Tongue', 'Joystick1', 'Joystick2'],
 'batch_size': 4,
 'bottomheight': 400,
 'crop': True,
 'crop_pad': 0,
 'cropratio': 0.4,
 'dataset': 'training-datasets\\iteration-0\\UnaugmentedDataSet_ReachingAug30\\Reaching_Mackenzie95shuffle1.mat',
 'dataset_type': 'default',
 'deterministic': False,
 'display_iters': 1000,
 'fg_fraction': 0.25,
 'global_scale': 0.8,
 'init_weights': 'C:\\Users\\LabAdmin\\.conda\\envs\\dlc-windowsGPU\\lib\\site-packages\\deeplabcut\\pose_estimation_tensorflow\\models\\pretrained\\resnet_v1_50.ckpt',
 'intermediate_supervision': False,
 'intermediate_supervision_layer': 12,
 'leftwidth': 400,
 'location_refinement': True,
 'locref_huber_loss': True,
 'locref_loss_weight': 0.05,
 'locref_stdev': 7.2801,
 'log_dir': 'log',
 'max_input_size': 1500,
 'mean_pixel': [123.68, 116.779, 103.939],
 'metadataset': 'training-datasets\\iteration-0\\UnaugmentedDataSet_ReachingAug30\\Do

Using snapshot-300 for model E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30\dlc-models\iteration-0\ReachingAug30-trainset95shuffle1
num_outputs =  1
INFO:tensorflow:Restoring parameters from E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30\dlc-models\iteration-0\ReachingAug30-trainset95shuffle1\train\snapshot-300
Starting to analyze %  E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30/videos/reachingvideo1.avi
Loading  E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30/videos/reachingvideo1.avi
Duration of video [s]:  8.53 , recorded with  30.0 fps!
Overall # of frames:  256  found with (before cropping) frame dimensions:  832 747
Starting to extract posture


260it [00:07, 33.07it/s]                                                                                               

Detected frames:  256


260it [00:08, 31.74it/s]


Saving results in E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30\videos...
Saving csv poses!
The videos are analyzed. Now your research can truly start! 
 You can create labeled videos with 'create_labeled_video'.
If the tracking is not satisfactory for some videos, consider expanding the training set. You can use the function 'extract_outlier_frames' to extract any outlier frames!


'DeepCut_resnet50_ReachingAug30shuffle1_300'

*NOTE: Yes, this is slow on a CPU (a GPU is MUCH faster)... see https://www.biorxiv.org/content/early/2018/10/30/457242 if you are interested!*

## Create labeled video

This function is for the visualization purpose and can be used to create a video in .mp4 format with the predicted labels. This video is saved in the same directory, where the (unlabeled) video resides. 

Various parameters can be set with regard to the colormap and the dotsize (matplotlib is used in the backend). See the config.yaml file for how to set these.

In [9]:
deeplabcut.create_labeled_video(path_config_file,[videofile_path], draw_skeleton=True)

Starting %  E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30\videos ['E:\\Users\\Phil\\DeepLabCut\\examples\\Reaching-Mackenzie-2018-08-30/videos/reachingvideo1.avi']
Loading  E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30/videos/reachingvideo1.avi and data.
False 0 832 0 747
256
Duration of video [s]:  8.53 , recorded with  30.0 fps!
Overall # of frames:  256 with cropped frame dimensions:  832 747
Generating frames and creating video.


100%|███████████████████████████████████████████████████████████████████████████████| 256/256 [00:01<00:00, 149.32it/s]


## Plot the trajectories of the analyzed videos
This function plots the trajectories of all the body parts across the entire video. Each body part is identified by a unique color. The underlying functions can easily be customized.

In [10]:
%matplotlib notebook
deeplabcut.plot_trajectories(path_config_file,[videofile_path],showfigures=True)

#These plots can are interactive and can be customized (see https://matplotlib.org/)

E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30/videos/reachingvideo1.avi
Starting %  E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30\videos ['E:\\Users\\Phil\\DeepLabCut\\examples\\Reaching-Mackenzie-2018-08-30/videos/reachingvideo1.avi']
E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30\videos  already exists!
Loading  E:\Users\Phil\DeepLabCut\examples\Reaching-Mackenzie-2018-08-30/videos/reachingvideo1.avi and data.


<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

Plots created! Please check the directory "plot-poses" within the video directory


## Extract outlier frames, where the predictions are off.

This is optional step allows to add more training data when the evaluation results are poor. In such a case, the user can use the following function to extract frames where the labels are incorrectly predicted. Make sure to provide the correct value of the "iterations" as it will be used to create the unique directory where the extracted frames will be saved.

In [None]:
deeplabcut.extract_outlier_frames(path_config_file,videofile_path,outlieralgorithm='uncertain',p_bound=.2)

In [None]:
# Note, if you have questions on parameters, remember "?" gives you answers:
# i.e. deeplabcut.extract_outlier_frames?

The user can run this iteratively, and (even) extract additional frames from the same video.

## Manually correct labels

This step allows the user to correct the labels in the extracted frames. Navigate to the folder corresponding to the video 'MovieS2_Perturbation_noLaser_compressed' and use the GUI as described in the protocol to update the labels.

In [None]:
#GUI pops up! 

%gui wx
deeplabcut.refine_labels(path_config_file)

In [None]:
# Now merge datasets (once you refined all frames)
deeplabcut.merge_datasets(path_config_file)

## Create a new iteration of training dataset, check it and train...

Following the refine labels, append these frames to the original dataset to create a new iteration of training dataset.

In [None]:
#Perhaps plot the labels to see how how all the frames are annotated (including the refined ones)
deeplabcut.check_labels(path_config_file)
# if they are off, you can load them in the labeling_gui to adjust!

In [None]:
deeplabcut.create_training_dataset(path_config_file)

Now one can train the network again... (with the expanded data set)

In [None]:
deeplabcut.train_network(path_config_file)