# DeepLabCut Toolbox - Colab
https://github.com/AlexEMG/DeepLabCut

This notebook illustrates how to use the cloud to:
- create a training set
- train a network
- evaluate a network
- create simple quality check plots
- analyze novel videos!

###This notebook assumes you already have a project folder with labeled data! 

This notebook demonstrates the necessary steps to use DeepLabCut for your own project.

This shows the most simple code to do so, but many of the functions have additional features, so please check out the overview & the protocol paper!

Nath\*, Mathis\* et al.: Using DeepLabCut for markerless pose estimation during behavior across species. Nature Protocols, 2019.


Paper: https://www.nature.com/articles/s41596-019-0176-0

Pre-print: https://www.biorxiv.org/content/biorxiv/early/2018/11/24/476531.full.pdf


## First, go to "Runtime" ->"change runtime type"->select "Python3", and then select "GPU"


In [2]:
#(this will take a few minutes to install all the dependences!)
!pip install deeplabcut

Collecting deeplabcut
[?25l  Downloading https://files.pythonhosted.org/packages/04/fc/b5774e22a3eeaac1e1ca5670aa2281a5c408c9abc25f51e53e3f2525aebd/deeplabcut-2.1.10.4-py3-none-any.whl (695kB)
[K     |████████████████████████████████| 696kB 4.2MB/s 
Collecting opencv-python-headless~=3.4.9.33
[?25l  Downloading https://files.pythonhosted.org/packages/67/1c/5544e626593158c6a23599f40193464121526e45aa470001a8113e45d9b8/opencv_python_headless-3.4.9.33-cp37-cp37m-manylinux1_x86_64.whl (21.6MB)
[K     |████████████████████████████████| 21.6MB 138kB/s 
Collecting scikit-image>=0.17
[?25l  Downloading https://files.pythonhosted.org/packages/cd/d9/d738fdb4954575fb631b9c7a2eaa59df3ab8b7f3c2fc28a37259a42d8a49/scikit_image-0.18.2-cp37-cp37m-manylinux1_x86_64.whl (29.2MB)
[K     |████████████████████████████████| 29.2MB 79kB/s 
[?25hCollecting bayesian-optimization
  Downloading https://files.pythonhosted.org/packages/bb/7a/fd8059a3881d3ab37ac8f72f56b73937a14e8bb14a9733e68cc8b17dbe3c/bayesia

**(Be sure to click "RESTART RUNTIME" is it is displayed above above before moving on !)**

In [1]:
# Use TensorFlow 1.13.1:
%tensorflow_version 1.13.1

`%tensorflow_version` only switches the major version: 1.x or 2.x.
You set: `1.13.1`. This will be interpreted as: `1.x`.


TensorFlow 1.x selected.


## Link your Google Drive (with your labeled data, or the demo data):

### First, place your porject folder into you google drive! "i.e. move the folder named "Project-YourName-TheDate" into google drive.

In [2]:
#Now, let's link to your GoogleDrive. Run this cell and follow the authorization instructions:
#(We recommend putting a copy of the github repo in your google drive if you are using the demo "examples")

from google.colab import drive
drive.mount('/content/drive')

Mounted at /content/drive


YOU WILL NEED TO EDIT THE PROJECT PATH **in the config.yaml file** TO BE SET TO YOUR GOOGLE DRIVE LINK!

Typically, this will be: /content/drive/My Drive/yourProjectFolderName


In [3]:
#Setup your project variables:
# PLEASE EDIT THESE:
  
ProjectFolderName = 'ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12'
VideoType = 'mp4' 

#don't edit these:
videofile_path = ['/content/drive/My Drive/'+ProjectFolderName+'/videos/'] #Enter the list of videos or folder to analyze.
videofile_path

['/content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/videos/']

In [4]:
#GUIs don't work on the cloud, so label your data locally on your computer! This will suppress the GUI support
import os
os.environ["DLClight"]="True"

In [5]:
import deeplabcut

DLC loaded in light mode; you cannot use any GUI (labeling, relabeling and standalone GUI)


In [6]:
deeplabcut.__version__

'2.1.10.4'

In [7]:
#This creates a path variable that links to your google drive copy
#No need to edit this, as you set it up before: 
path_config_file = '/content/drive/My Drive/'+ProjectFolderName+'/config.yaml'
path_config_file

'/content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/config.yaml'

## Create a training dataset:
### You must do this step inside of Colab:
After running this script the training dataset is created and saved in the project directory under the subdirectory **'training-datasets'**

This function also creates new subdirectories under **dlc-models** and appends the project config.yaml file with the correct path to the training and testing pose configuration file. These files hold the parameters for training the network. Such an example file is provided with the toolbox and named as **pose_cfg.yaml**.

Now it is the time to start training the network!

In [9]:
# Note: if you are using the demo data (i.e. examples/Reaching-Mackenzie-2018-08-30/), first delete the folder called dlc-models! 
#Then, run this cell. There are many more functions you can set here, including which netowkr to use!
#check the docstring for full options you can do!
deeplabcut.create_training_dataset(path_config_file, net_type='resnet_50', augmenter_type='imgaug')

Downloading a ImageNet-pretrained model from http://download.tensorflow.org/models/resnet_v1_50_2016_08_28.tar.gz....
The training dataset is successfully created. Use the function 'train_network' to start training. Happy training!


[(0.95,
  1,
  (array([ 3,  9,  4, 10, 12,  5,  6,  2, 11, 14,  8,  7, 18, 13, 17,  0, 16,
          19, 15]), array([1])))]

## Start training:
This function trains the network for a specific shuffle of the training dataset. 

In [None]:
#let's also change the display and save_iters just in case Colab takes away the GPU... 
#if that happens, you can reload from a saved point. Typically, you want to train to 200,000 + iterations.
#more info and there are more things you can set: https://github.com/AlexEMG/DeepLabCut/blob/master/docs/functionDetails.md#g-train-the-network

deeplabcut.train_network(path_config_file, shuffle=1, displayiters=10,saveiters=500)

#this will run until you stop it (CTRL+C), or hit "STOP" icon, or when it hits the end (default, 1.03M iterations). 
#Whichever you chose, you will see what looks like an error message, but it's not an error - don't worry....

Config:
{'all_joints': [[0], [1], [2], [3], [4]],
 'all_joints_names': ['orelha', 'nariz', 'queixo', 'cervical', 'ombro'],
 'alpha_r': 0.02,
 'batch_size': 1,
 'clahe': True,
 'claheratio': 0.1,
 'crop_pad': 0,
 'cropratio': 0.4,
 'dataset': 'training-datasets/iteration-0/UnaugmentedDataSet_Ic-posturajul12/Ic-postura_analise-cervical95shuffle1.mat',
 'dataset_type': 'imgaug',
 'decay_steps': 30000,
 'deterministic': False,
 'display_iters': 1000,
 'edge': False,
 'emboss': {'alpha': [0.0, 1.0], 'embossratio': 0.1, 'strength': [0.5, 1.5]},
 'fg_fraction': 0.25,
 'global_scale': 0.8,
 'histeq': True,
 'histeqratio': 0.1,
 'init_weights': '/usr/local/lib/python3.7/dist-packages/deeplabcut/pose_estimation_tensorflow/models/pretrained/resnet_v1_50.ckpt',
 'intermediate_supervision': False,
 'intermediate_supervision_layer': 12,
 'location_refinement': True,
 'locref_huber_loss': True,
 'locref_loss_weight': 0.05,
 'locref_stdev': 7.2801,
 'log_dir': 'log',
 'lr_init': 0.0005,
 'max_input_si

Selecting single-animal trainer
Starting with imgaug pose-dataset loader (=default).
Batch Size is 1
Initializing ResNet
Loading ImageNet-pretrained resnet_50
Display_iters overwritten as 10
Save_iters overwritten as 500
Training parameter:
{'stride': 8.0, 'weigh_part_predictions': False, 'weigh_negatives': False, 'fg_fraction': 0.25, 'mean_pixel': [123.68, 116.779, 103.939], 'shuffle': True, 'snapshot_prefix': '/content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/dlc-models/iteration-0/Ic-posturajul12-trainset95shuffle1/train/snapshot', 'log_dir': 'log', 'global_scale': 0.8, 'location_refinement': True, 'locref_stdev': 7.2801, 'locref_loss_weight': 0.05, 'locref_huber_loss': True, 'optimizer': 'sgd', 'intermediate_supervision': False, 'intermediate_supervision_layer': 12, 'regularize': False, 'weight_decay': 0.0001, 'crop_pad': 0, 'scoremap_dir': 'test', 'batch_size': 1, 'dataset_type': 'imgaug', 'deterministic': False, 'mirror': False, 'pairwise_huber_lo

[1;30;43mA saída de streaming foi truncada nas últimas 5000 linhas.[0m
iteration: 135680 loss: 0.0004 lr: 0.02
iteration: 135690 loss: 0.0007 lr: 0.02
iteration: 135700 loss: 0.0003 lr: 0.02
iteration: 135710 loss: 0.0006 lr: 0.02
iteration: 135720 loss: 0.0005 lr: 0.02
iteration: 135730 loss: 0.0003 lr: 0.02
iteration: 135740 loss: 0.0008 lr: 0.02
iteration: 135750 loss: 0.0005 lr: 0.02
iteration: 135760 loss: 0.0006 lr: 0.02
iteration: 135770 loss: 0.0005 lr: 0.02
iteration: 135780 loss: 0.0006 lr: 0.02
iteration: 135790 loss: 0.0005 lr: 0.02
iteration: 135800 loss: 0.0006 lr: 0.02
iteration: 135810 loss: 0.0003 lr: 0.02
iteration: 135820 loss: 0.0005 lr: 0.02
iteration: 135830 loss: 0.0005 lr: 0.02
iteration: 135840 loss: 0.0003 lr: 0.02
iteration: 135850 loss: 0.0007 lr: 0.02
iteration: 135860 loss: 0.0008 lr: 0.02
iteration: 135870 loss: 0.0004 lr: 0.02
iteration: 135880 loss: 0.0005 lr: 0.02
iteration: 135890 loss: 0.0006 lr: 0.02
iteration: 135900 loss: 0.0005 lr: 0.02
iterati

**When you hit "STOP" you will get a KeyInterrupt "error"! No worries! :)**

## Start evaluating:
This funtion evaluates a trained model for a specific shuffle/shuffles at a particular state or all the states on the data set (images)
and stores the results as .csv file in a subdirectory under **evaluation-results**

In [8]:
%matplotlib notebook
deeplabcut.evaluate_network(path_config_file,plotting=True)

# Here you want to see a low pixel error! Of course, it can only be as good as the labeler, 
#so be sure your labels are good! (And you have trained enough ;)

Running  DLC_resnet50_Ic-posturajul12shuffle1_186500  with # of trainingiterations: 186500
Initializing ResNet


0it [00:00, ?it/s]

Analyzing data...


20it [00:23,  1.16s/it]
  0%|          | 0/20 [00:00<?, ?it/s]

Done and results stored for snapshot:  snapshot-186500
Results for 186500  training iterations: 95 1 train error: 2.36 pixels. Test error: 3.78  pixels.
With pcutoff of 0.6  train error: 2.36 pixels. Test error: 3.78 pixels
Thereby, the errors are given by the average distances between the labels by DLC and the scorer.
Plotting...


<IPython.core.display.Javascript object>

  5%|▌         | 1/20 [00:00<00:05,  3.47it/s]

<IPython.core.display.Javascript object>

 10%|█         | 2/20 [00:00<00:05,  3.54it/s]

<IPython.core.display.Javascript object>

 15%|█▌        | 3/20 [00:00<00:04,  3.55it/s]

<IPython.core.display.Javascript object>

 20%|██        | 4/20 [00:01<00:04,  3.56it/s]

<IPython.core.display.Javascript object>

 25%|██▌       | 5/20 [00:01<00:04,  3.05it/s]

<IPython.core.display.Javascript object>

 30%|███       | 6/20 [00:01<00:04,  3.16it/s]

<IPython.core.display.Javascript object>

 35%|███▌      | 7/20 [00:02<00:03,  3.25it/s]

<IPython.core.display.Javascript object>

 40%|████      | 8/20 [00:02<00:03,  3.33it/s]

<IPython.core.display.Javascript object>

 45%|████▌     | 9/20 [00:02<00:03,  3.37it/s]

<IPython.core.display.Javascript object>

 50%|█████     | 10/20 [00:03<00:02,  3.35it/s]

<IPython.core.display.Javascript object>

 55%|█████▌    | 11/20 [00:03<00:02,  3.35it/s]

<IPython.core.display.Javascript object>

 60%|██████    | 12/20 [00:03<00:02,  3.36it/s]

<IPython.core.display.Javascript object>

 65%|██████▌   | 13/20 [00:03<00:02,  3.35it/s]

<IPython.core.display.Javascript object>

 70%|███████   | 14/20 [00:04<00:01,  3.31it/s]

<IPython.core.display.Javascript object>

 75%|███████▌  | 15/20 [00:04<00:01,  3.31it/s]

<IPython.core.display.Javascript object>

 80%|████████  | 16/20 [00:04<00:01,  2.90it/s]

<IPython.core.display.Javascript object>

 85%|████████▌ | 17/20 [00:05<00:01,  2.96it/s]

<IPython.core.display.Javascript object>

 90%|█████████ | 18/20 [00:05<00:00,  2.99it/s]

<IPython.core.display.Javascript object>

 95%|█████████▌| 19/20 [00:05<00:00,  3.03it/s]

<IPython.core.display.Javascript object>

100%|██████████| 20/20 [00:06<00:00,  3.19it/s]

The network is evaluated and the results are stored in the subdirectory 'evaluation_results'.
If it generalizes well, choose the best model for prediction and update the config file with the appropriate index for the 'snapshotindex'.
Use the function 'analyze_video' to make predictions on new videos.
Otherwise consider retraining the network (see DeepLabCut workflow Fig 2)





## There is an optional refinement step you can do outside of Colab:
- if your pixel errors are not low enough, please check out the protocol guide on how to refine your network!
- You will need to adjust the labels **outside of Colab!** We recommend coming back to train and analyze videos... 
- pplease see the repo and protocol instructions on how to refine your data!

## Start Analyzing videos: 
This function analyzes the new video. The user can choose the best model from the evaluation results and specify the correct snapshot index for the variable **snapshotindex** in the **config.yaml** file. Otherwise, by default the most recent snapshot is used to analyse the video.

The results are stored in hd5 file in the same directory where the video resides. 

In [9]:
deeplabcut.analyze_videos(path_config_file,videofile_path, videotype=VideoType)

Using snapshot-186500 for model /content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/dlc-models/iteration-0/Ic-posturajul12-trainset95shuffle1
Initializing ResNet
Analyzing all the videos in the directory...
Starting to analyze %  /content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/videos/postura.mp4
/content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/videos  already exists!
Loading  /content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/videos/postura.mp4


  0%|          | 0/5124 [00:00<?, ?it/s]

Duration of video [s]:  170.78 , recorded with  30.0 fps!
Overall # of frames:  5124  found with (before cropping) frame dimensions:  640 352
Starting to extract posture


5151it [03:23, 25.29it/s]

Saving results in /content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/videos...
The videos are analyzed. Now your research can truly start! 
 You can create labeled videos with 'create_labeled_video'
If the tracking is not satisfactory for some videos, consider expanding the training set. You can use the function 'extract_outlier_frames' to extract a few representative outlier frames.





'DLC_resnet50_Ic-posturajul12shuffle1_186500'

## Plot the trajectories of the analyzed videos:
This function plots the trajectories of all the body parts across the entire video. Each body part is identified by a unique color.

In [10]:
deeplabcut.plot_trajectories(path_config_file,videofile_path, videotype=VideoType)

Analyzing all the videos in the directory...
Loading  /content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/videos/postura.mp4 and data.


<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

Plots created! Please check the directory "plot-poses" within the video directory


Now you can look at the plot-poses file and check the "plot-likelihood.png" might want to change the "p-cutoff" in the config.yaml file so that you have only high confidnece points plotted in the video. i.e. ~0.8 or 0.9. The current default is 0.4. 

## Create labeled video:
This funtion is for visualiztion purpose and can be used to create a video in .mp4 format with labels predicted by the network. This video is saved in the same directory where the original video resides. 

In [11]:
deeplabcut.create_labeled_video(path_config_file,videofile_path, videotype=VideoType)

Analyzing all the videos in the directory...
/content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/videos  already exists!
Starting to process video: /content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/videos/postura.mp4
Loading /content/drive/My Drive/ic-postura-cervical/Ic-postura-analise-cervical-2021-07-12/videos/postura.mp4 and data.
Duration of video [s]: 170.78, recorded with 30.0 fps!
Overall # of frames: 5124 with cropped frame dimensions: 640 352
Generating frames and creating video.


100%|██████████| 5124/5124 [00:20<00:00, 249.73it/s]
