Skip to content

fmthoker/skeleton-contrast

Repository files navigation

Skeleton-Contrastive 3D Action Representation Learning

arch

This repository contains the implementation of our ACM MultiMedia 2021 paper:

  • inter-skeleton contrastive pretraining
  • 3D action recognition and retrieval downstream

Link:

[PDF] [Arxiv]

Requirements

 conda create -n  skeleton_contrast python=3.7 anaconda
 conda install pytorch==1.2.0 torchvision==0.4.0 -c pytorch
 pip install tensorboard

Data prepreprocessing instructions

cd data_gen
python ntu_gendata.py

Pretrain Instruction

  • e.g Inter-skeleton contrast using seq-based and graph-based representation on cross-view NTU-60
CUDA_VISIBLE_DEVICES=0,1,2,3 python  main_moco_inter_skeleton.py \
  --lr 0.01 \
  --batch-size 64 \
  --mlp --moco-t 0.07   --moco-k 16384  --checkpoint-path ./checkpoints/ntu_60_cross_view/interskeleton_seq_based_graph_based  --schedule 351  --epochs 451  --pre-dataset ntu60 --skeleton-representation seq-based_and_graph-based --protocol cross_view

Downstream Instruction

  • 3D Action Classification (Linear evaluation )
  • e.g Train a linear classifier on seq-based query encoder features pretrained via inter-skeleton contrast
CUDA_VISIBLE_DEVICES=0,1,2,3 python action_classification.py \
  --lr 0.1 \
  --batch-size 64 \
 --pretrained  ./checkpoints/ntu_60_cross_view/interskeleton_seq_based_graph_based/checkpoint_0450.pth.tar \
  --finetune-dataset ntu60 --protocol cross_view --pretrain-skeleton-representation seq-based_and_graph-based  --finetune-skeleton-representation seq-based
  • 3D Action Retrieval (KNN evaluation)
  • Use a Knn classifier on seq-based query encoder features pretrained via inter-skeleton contrast
 CUDA_VISIBLE_DEVICES=0,1,2,3 python action_retrieval.py \
  --lr 0.1 \
  --batch-size 64 \
  --knn-neighbours 1 \
  --pretrained  ./checkpoints/ntu_60_cross_view/interskeleton_seq_based_graph_based/checkpoint_0450.pth.tar \
 --finetune-dataset ntu60 --protocol cross_view --pretrain-skeleton-representation seq-based_and_graph-based  --finetune-skeleton-representation seq-based

Pretrained Models

Results

  • 3D Action Classification Results arch

  • 3D Action Retrieval Results arch

Citation

If you use our work or code, kindly consider citing our paper:

@InProceedings{fmthoker_acmmm,
  author    = {Fida Mohammad Thoker, Hazel Doughty and Cees Snoek},
  title     = {Skeleton-Contrastive 3D Action Representation Learning},
  booktitle = {in Proceedings of the 29th ACM International Conference on Multimedia,(ACM MM), 2021},
  year      = {2021},
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages