BKinD-3D: Self-Supervised 3D Keypoint Discovery from Multi-View Videos

Implementation from the paper:

Jennifer J. Sun*, Lili Karashchuk*, Amil Dravid*, Serim Ryou, Sonia Fereidooni, John C. Tuthill, Aggelos Katsaggelos, Bingni W. Brunton, Georgia Gkioxari, Ann Kennedy, Yisong Yue, Pietro Perona, BKinD-3D: Self-Supervised 3D Keypoint Discovery from Multi-View Videos. In Conference on Computer Vision and Pattern Recognition (CVPR), 2023

BKinD-3D discovers keypoints in 3D from multi-view behavioral videos without any keypoint or bounding box supervision in 2D or 3D. It works on a range of organisms including rats and humans. The discovered keypoints and additional shape features are directly applicable to downstream tasks including behavior classification and pose regression!

Set up environment

We provided an environment file vol3d_env.yml for running our train & evaluation code

How to set up data for human experiments

Download Human 3.6M dataset: http://vision.imar.ro/human3.6m/description.php (Ask for permission to authors)
Extract frames from video to data/H36M
- Our implementation uses this code (https://github.com/anibali/h36m-fetch) for frame extraction
- Dataloader for h36m has to be updated if you extract frames using different code.
Run command

python train_video_multiview.py --config config/H36M.yaml

This will take 1~2 days on a single GPU.

Evaluation

Once you are done with training keypoint discovery model, evaluate 3D pose regression task on the Human 3.6M dataset using code in the evaluation folder
Run command

python evaluation/evaluate.py --config config/H36M.yaml

This will take 1~2 hours on a single GPU.

License

Please refer to our paper for details and consider citing it if you find the code useful:

@article{bkind-3d_2023,
  title = {{{BKinD-3D}}: {{Self-Supervised 3D Keypoint Discovery}} from {{Multi-View Videos}}},
  author = {Sun, Jennifer J. and Karashchuk, Lili and Dravid, Amil and Ryou, Serim and Fereidooni, Sonia and Tuthill, John and Katsaggelos, Aggelos and Brunton, Bingni W. and Gkioxari, Georgia and Kennedy, Ann and Yue, Yisong and Perona, Pietro},
  year = {2023},
  journal={arXiv preprint arXiv:2112.05121},
  doi = {10.48550/arXiv.2212.07401}
}

Our code is available under the Apache License 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
classifier		classifier
config		config
dataloader		dataloader
evaluation		evaluation
figs		figs
loss		loss
model		model
regressor		regressor
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
convert_discovered_keypoints_to_classifier.py		convert_discovered_keypoints_to_classifier.py
extract_features.py		extract_features.py
test_simplehuman36m.py		test_simplehuman36m.py
train_video_multiview.py		train_video_multiview.py
vol3d_env.yml		vol3d_env.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BKinD-3D: Self-Supervised 3D Keypoint Discovery from Multi-View Videos

Set up environment

How to set up data for human experiments

Evaluation

License

About

Releases

Packages

Contributors 4

Languages

License

neuroethology/BKinD-3D

Folders and files

Latest commit

History

Repository files navigation

BKinD-3D: Self-Supervised 3D Keypoint Discovery from Multi-View Videos

Set up environment

How to set up data for human experiments

Evaluation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages