<a href="https://colab.research.google.com/github/levin-mutai/challanges/blob/main/Copy_of_Copy_of_latest_Colab_TrainNetwork_VideoAnalysis.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# DeepLabCut Toolbox - Colab for standard (single animal) projects!
https://github.com/DeepLabCut/DeepLabCut

This notebook illustrates how to use the cloud to:
- create a training set
- train a network
- evaluate a network
- create simple quality check plots
- analyze novel videos!

###This notebook assumes you already have a project folder with labeled data! 

This notebook demonstrates the necessary steps to use DeepLabCut for your own project.

This shows the most simple code to do so, but many of the functions have additional features, so please check out the overview & the protocol paper!

Nath\*, Mathis\* et al.: Using DeepLabCut for markerless pose estimation during behavior across species. Nature Protocols, 2019.


Paper: https://www.nature.com/articles/s41596-019-0176-0

Pre-print: https://www.biorxiv.org/content/biorxiv/early/2018/11/24/476531.full.pdf


## First, go to "Runtime" ->"change runtime type"->select "Python3", and then select "GPU"


In [1]:
#(this will take a few minutes to install all the dependences!)
!pip install deeplabcut

Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Collecting deeplabcut
  Downloading deeplabcut-2.2.3-py3-none-any.whl (593 kB)
[K     |████████████████████████████████| 593 kB 6.2 MB/s 
Collecting ruamel.yaml>=0.15.0
  Downloading ruamel.yaml-0.17.21-py3-none-any.whl (109 kB)
[K     |████████████████████████████████| 109 kB 52.2 MB/s 
[?25hCollecting matplotlib>=3.3
  Downloading matplotlib-3.5.3-cp37-cp37m-manylinux_2_5_x86_64.manylinux1_x86_64.whl (11.2 MB)
[K     |████████████████████████████████| 11.2 MB 64.7 MB/s 
Collecting filterpy>=1.4.4
  Downloading filterpy-1.4.5.zip (177 kB)
[K     |████████████████████████████████| 177 kB 69.7 MB/s 
Collecting tensorpack>=0.11
  Downloading tensorpack-0.11-py2.py3-none-any.whl (296 kB)
[K     |████████████████████████████████| 296 kB 22.5 MB/s 
Collecting tf-slim>=1.1.0
  Downloading tf_slim-1.1.0-py2.py3-none-any.whl (352 kB)
[K     |████████████████████████████████| 352 kB 68.3 MB

**(Be sure to click "RESTART RUNTIME" if it is displayed above before moving on !)**

## Link your Google Drive (with your labeled data, or the demo data):

### First, place your project folder into you google drive! "i.e. move the folder named "Project-YourName-TheDate" into google drive.

In [1]:
#Now, let's link to your GoogleDrive. Run this cell and follow the authorization instructions:
#(We recommend putting a copy of the github repo in your google drive if you are using the demo "examples")

from google.colab import drive
drive.mount('/content/drive')

Mounted at /content/drive


YOU WILL NEED TO EDIT THE PROJECT PATH **in the config.yaml file** TO BE SET TO YOUR GOOGLE DRIVE LINK!

Typically, this will be: /content/drive/My Drive/yourProjectFolderName


In [12]:
#Setup your project variables:
# PLEASE EDIT THESE:

ProjectFolderName = 'rhino-levin'
VideoType = 'mp4' 

#don't edit these:
videofile_path = ['/content/drive/My Drive/'+ProjectFolderName+'/videos/'] #Enter the list of videos or folder to analyze.
videofile_path

['/content/drive/My Drive/rhino-levin/videos/']

In [3]:
import deeplabcut

Loading DLC 2.2.3...
DLC loaded in light mode; you cannot use any GUI (labeling, relabeling and standalone GUI)


In [None]:
deeplabcut.__version__

'2.2.3'

In [4]:
#This creates a path variable that links to your google drive copy
#No need to edit this, as you set it up before: 
path_config_file = '/content/drive/My Drive/'+ProjectFolderName+'/config.yaml'
path_config_file

'/content/drive/My Drive/rhino-levin/config.yaml'

## Create a training dataset:
### You must do this step inside of Colab:
After running this script the training dataset is created and saved in the project directory under the subdirectory **'training-datasets'**

This function also creates new subdirectories under **dlc-models** and appends the project config.yaml file with the correct path to the training and testing pose configuration file. These files hold the parameters for training the network. Such an example file is provided with the toolbox and named as **pose_cfg.yaml**.

Now it is the time to start training the network!

In [5]:
# Note: if you are using the demo data (i.e. examples/Reaching-Mackenzie-2018-08-30/), first delete the folder called dlc-models! 
#Then, run this cell. There are many more functions you can set here, including which netowkr to use!
#check the docstring for full options you can do!
deeplabcut.create_training_dataset(path_config_file, net_type='resnet_50', augmenter_type='imgaug')

Downloading a ImageNet-pretrained model from http://download.tensorflow.org/models/resnet_v1_50_2016_08_28.tar.gz....
The training dataset is successfully created. Use the function 'train_network' to start training. Happy training!


[(0.95,
  1,
  (array([ 1,  6,  8,  9, 14,  4,  2, 13, 10,  7, 11,  3,  0,  5]),
   array([12])))]

## Start training:
This function trains the network for a specific shuffle of the training dataset. 

In [6]:
#let's also change the display and save_iters just in case Colab takes away the GPU... 
#if that happens, you can reload from a saved point. Typically, you want to train to 200,000 + iterations.
#more info and there are more things you can set: https://github.com/DeepLabCut/DeepLabCut/wiki/DOCSTRINGS#train_network

deeplabcut.train_network(path_config_file,shuffle=1, displayiters=500,saveiters=500,maxiters=60000)

#this will run until you stop it (CTRL+C), or hit "STOP" icon, or when it hits the end (default, 1.03M iterations). 
#Whichever you chose, you will see what looks like an error message, but it's not an error - don't worry....

Config:
{'all_joints': [[0],
                [1],
                [2],
                [3],
                [4],
                [5],
                [6],
                [7],
                [8],
                [9],
                [10],
                [11],
                [12],
                [13],
                [14],
                [15],
                [16],
                [17],
                [18],
                [19],
                [20],
                [21],
                [22]],
 'all_joints_names': ['uppertail',
                      'lowertail',
                      'leftlowerbacklimb',
                      'leftbackjoint',
                      'leftbackupperlimb',
                      'lefthips',
                      'rightlowerbacklimb',
                      'rightbackjoint',
                      'rightupperbacklimb',
                      'righthips',
                      'backspine1',
                      'middle',
                      'frontspine',

Selecting single-animal trainer
Batch Size is 1




Loading ImageNet-pretrained resnet_50
Max_iters overwritten as 60000
Display_iters overwritten as 500
Save_iters overwritten as 500
Training parameter:
{'stride': 8.0, 'weigh_part_predictions': False, 'weigh_negatives': False, 'fg_fraction': 0.25, 'mean_pixel': [123.68, 116.779, 103.939], 'shuffle': True, 'snapshot_prefix': '/content/drive/My Drive/rhino-levin/dlc-models/iteration-0/rhinoNov25-trainset95shuffle1/train/snapshot', 'log_dir': 'log', 'global_scale': 0.8, 'location_refinement': True, 'locref_stdev': 7.2801, 'locref_loss_weight': 0.05, 'locref_huber_loss': True, 'optimizer': 'sgd', 'intermediate_supervision': False, 'intermediate_supervision_layer': 12, 'regularize': False, 'weight_decay': 0.0001, 'crop_pad': 0, 'scoremap_dir': 'test', 'batch_size': 1, 'dataset_type': 'imgaug', 'deterministic': False, 'mirror': False, 'pairwise_huber_loss': False, 'weigh_only_present_joints': False, 'partaffinityfield_predict': False, 'pairwise_predict': False, 'all_joints': [[0], [1], [2], 

iteration: 500 loss: 0.0484 lr: 0.005
iteration: 1000 loss: 0.0317 lr: 0.005
iteration: 1500 loss: 0.0299 lr: 0.005
iteration: 2000 loss: 0.0269 lr: 0.005
iteration: 2500 loss: 0.0254 lr: 0.005
iteration: 3000 loss: 0.0234 lr: 0.005
iteration: 3500 loss: 0.0216 lr: 0.005
iteration: 4000 loss: 0.0203 lr: 0.005
iteration: 4500 loss: 0.0187 lr: 0.005
iteration: 5000 loss: 0.0179 lr: 0.005
iteration: 5500 loss: 0.0170 lr: 0.005
iteration: 6000 loss: 0.0158 lr: 0.005
iteration: 6500 loss: 0.0153 lr: 0.005
iteration: 7000 loss: 0.0146 lr: 0.005
iteration: 7500 loss: 0.0142 lr: 0.005
iteration: 8000 loss: 0.0132 lr: 0.005
iteration: 8500 loss: 0.0129 lr: 0.005
iteration: 9000 loss: 0.0125 lr: 0.005
iteration: 9500 loss: 0.0120 lr: 0.005
iteration: 10000 loss: 0.0116 lr: 0.005
iteration: 10500 loss: 0.0145 lr: 0.02
iteration: 11000 loss: 0.0124 lr: 0.02
iteration: 11500 loss: 0.0113 lr: 0.02
iteration: 12000 loss: 0.0106 lr: 0.02
iteration: 12500 loss: 0.0099 lr: 0.02
iteration: 13000 loss: 0.

KeyboardInterrupt: ignored

**When you hit "STOP" you will get a KeyInterrupt "error"! No worries! :)**

## Start evaluating:
This function evaluates a trained model for a specific shuffle/shuffles at a particular state or all the states on the data set (images)
and stores the results as .csv file in a subdirectory under **evaluation-results**

In [7]:
%matplotlib notebook
deeplabcut.evaluate_network(path_config_file,plotting=True)

# Here you want to see a low pixel error! Of course, it can only be as good as the labeler, 
#so be sure your labels are good! (And you have trained enough ;)

Config:
{'all_joints': [[0],
                [1],
                [2],
                [3],
                [4],
                [5],
                [6],
                [7],
                [8],
                [9],
                [10],
                [11],
                [12],
                [13],
                [14],
                [15],
                [16],
                [17],
                [18],
                [19],
                [20],
                [21],
                [22]],
 'all_joints_names': ['uppertail',
                      'lowertail',
                      'leftlowerbacklimb',
                      'leftbackjoint',
                      'leftbackupperlimb',
                      'lefthips',
                      'rightlowerbacklimb',
                      'rightbackjoint',
                      'rightupperbacklimb',
                      'righthips',
                      'backspine1',
                      'middle',
                      'frontspine',

Running  DLC_resnet50_rhinoNov25shuffle1_56000  with # of training iterations: 56000
Running evaluation ...


15it [00:01,  7.73it/s]

Analysis is done and the results are stored (see evaluation-results) for snapshot:  snapshot-56000
Results for 56000  training iterations: 95 1 train error: 3.72 pixels. Test error: 28.6  pixels.
With pcutoff of 0.6  train error: 3.72 pixels. Test error: 28.6 pixels
Thereby, the errors are given by the average distances between the labels by DLC and the scorer.
Plotting...





<IPython.core.display.Javascript object>

100%|██████████| 15/15 [00:08<00:00,  1.79it/s]

The network is evaluated and the results are stored in the subdirectory 'evaluation_results'.
Please check the results, then choose the best model (snapshot) for prediction. You can update the config.yaml file with the appropriate index for the 'snapshotindex'.
Use the function 'analyze_video' to make predictions on new videos.
Otherwise, consider adding more labeled-data and retraining the network (see DeepLabCut workflow Fig 2, Nath 2019)





## There is an optional refinement step you can do outside of Colab:
- if your pixel errors are not low enough, please check out the protocol guide on how to refine your network!
- You will need to adjust the labels **outside of Colab!** We recommend coming back to train and analyze videos... 
- Please see the repo and protocol instructions on how to refine your data!

## Start Analyzing videos: 
This function analyzes the new video. The user can choose the best model from the evaluation results and specify the correct snapshot index for the variable **snapshotindex** in the **config.yaml** file. Otherwise, by default the most recent snapshot is used to analyse the video.

The results are stored in hd5 file in the same directory where the video resides. 

In [14]:
deeplabcut.analyze_videos(path_config_file,videofile_path, videotype=VideoType)

Config:
{'all_joints': [[0],
                [1],
                [2],
                [3],
                [4],
                [5],
                [6],
                [7],
                [8],
                [9],
                [10],
                [11],
                [12],
                [13],
                [14],
                [15],
                [16],
                [17],
                [18],
                [19],
                [20],
                [21],
                [22]],
 'all_joints_names': ['uppertail',
                      'lowertail',
                      'leftlowerbacklimb',
                      'leftbackjoint',
                      'leftbackupperlimb',
                      'lefthips',
                      'rightlowerbacklimb',
                      'rightbackjoint',
                      'rightupperbacklimb',
                      'righthips',
                      'backspine1',
                      'middle',
                      'frontspine',

Using snapshot-56000 for model /content/drive/My Drive/rhino-levin/dlc-models/iteration-0/rhinoNov25-trainset95shuffle1
Analyzing all the videos in the directory...
Starting to analyze %  /content/drive/My Drive/rhino-levin/videos/kifaru.mp4
Loading  /content/drive/My Drive/rhino-levin/videos/kifaru.mp4
Duration of video [s]:  14.1 , recorded with  30.0 fps!
Overall # of frames:  423  found with (before cropping) frame dimensions:  640 352
Starting to extract posture


 99%|█████████▉| 420/423 [00:09<00:00, 46.40it/s]

Saving results in /content/drive/My Drive/rhino-levin/videos...
The videos are analyzed. Now your research can truly start! 
 You can create labeled videos with 'create_labeled_video'
If the tracking is not satisfactory for some videos, consider expanding the training set. You can use the function 'extract_outlier_frames' to extract a few representative outlier frames.





'DLC_resnet50_rhinoNov25shuffle1_56000'

## Plot the trajectories of the analyzed videos:
This function plots the trajectories of all the body parts across the entire video. Each body part is identified by a unique color.

In [15]:
deeplabcut.plot_trajectories(path_config_file,videofile_path, videotype=VideoType)

Analyzing all the videos in the directory...
Loading  /content/drive/My Drive/rhino-levin/videos/kifaru.mp4 and data.


<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

Plots created! Please check the directory "plot-poses" within the video directory


Now you can look at the plot-poses file and check the "plot-likelihood.png" might want to change the "p-cutoff" in the config.yaml file so that you have only high confidnece points plotted in the video. i.e. ~0.8 or 0.9. The current default is 0.4. 

## Create labeled video:
This function is for visualiztion purpose and can be used to create a video in .mp4 format with labels predicted by the network. This video is saved in the same directory where the original video resides. 

In [16]:
deeplabcut.create_labeled_video(path_config_file,videofile_path, videotype=VideoType)

Analyzing all the videos in the directory...
Starting to process video: /content/drive/My Drive/rhino-levin/videos/kifaru.mp4
Loading /content/drive/My Drive/rhino-levin/videos/kifaru.mp4 and data.
Duration of video [s]: 14.1, recorded with 30.0 fps!
Overall # of frames: 423 with cropped frame dimensions: 640 352
Generating frames and creating video.


100%|██████████| 423/423 [00:02<00:00, 168.37it/s]
