Skip to content

AutoGesture with Temporal Difference Convolutions (TIP'21)

License

Notifications You must be signed in to change notification settings

ZitongYu/3DCDC-NAS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AutoGesture with 3DCDC

Pytorch code for the TIP paper "Searching Multi-Rate and Multi-Modal Temporal Enhanced Networks for Gesture Recognition"

Welcome to plug and play 3DCDC in your networks

# -------- Vanilla ---------#
nn.Conv3d(3, 64, kernel_size=3, padding=1)

# -------- 3DCDC ---------#
from 3DCDC import CDC_ST, CDC_T, CDC_TR
CDC_ST(3, 64, kernel_size=3, padding=1, theta=0.6)
CDC_T(3, 64, kernel_size=3, padding=1, theta=0.6)
CDC_TR(3, 64, kernel_size=3, padding=1, theta=0.3)

Citation

If you find our project useful in your research, please consider citing:

@article{yu2021searching,
  title={Searching Multi-Rate and Multi-Modal Temporal Enhanced Networks for Gesture Recognition},
  author={Yu, Zitong and Zhou, Benjia and Wan, Jun and Wang, Pichao and Chen, Haoyu and Liu, Xin and Li, Stan Z and Zhao, Guoying},
  journal={IEEE Transactions on Image Processing (TIP)},
  year={2021}
}

Pretrained model on IsoGD

You can download the checkpoints from google drive

Visualization


Figure 1: The searched architecture from (a) the first stage NAS, and (b) the second stage NAS. The three rows in (a) represent the searched cell structure in the low, mid, and high frame branches, respectively.

Figure 2: Features visualization from C3D assembled with varied convolutions on the IsoGD dataset. With (a) RGB and (b) Depth modality inputs, the four rows represent the neural activation with 3D vanilla convolution, 3D-CDC-ST, 3D-CDC-T, and 3D-CDC-TR, respectively.

About

AutoGesture with Temporal Difference Convolutions (TIP'21)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published