Multi-view Masked Contrastive Representation Learning for Endoscopic Video Analysis

This repository provides the official PyTorch implementation of the paper Multi-view Masked Contrastive Representation Learning for Endoscopic Video Analysis

Installation

We can install packages using provided environment.yaml.

cd MMCRL
conda env create -f environment.yaml
conda activate MMCRL

Data Preparation

We use the datasets provided by Endo-FM and are grateful for their valuable work.

weights

pretrain weight:

downstream weight:

Pre-training

cd MMCRL
wget -P checkpoints/ https://github.com/kahnchana/svt/releases/download/v1.0/kinetics400_vitb_ssl.pth
bash scripts/pretrain.sh

Fine-tuning

# PolypDiag (Classification)
cd MMCRL
bash scripts/eval_finetune_polypdiag.sh

# CVC (Segmentation)
cd MMCRL/TransUNet
python train.py

# KUMC (Detection)
cd MMCRL/STMT
bash script/train_stft.sh

Acknowledgement

Our code is based on Endo-FM, DINO, TimeSformer, SVT, TransUNet, and STFT. Thanks them for releasing their codes.

Citation

@article{hu2024one,
  title={Multi-view Masked Contrastive Representation Learning for Endoscopic Video Analysis},
  author={Hu, Kai and Xiao, Ye and Zhang, Yuan and Gao, Xieping},
  journal={Advances in Neural Information Processing Systems},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
STFT		STFT
TransUNet		TransUNet
datasets		datasets
img		img
models		models
scripts		scripts
utils		utils
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
eval_finetune.py		eval_finetune.py
train_ssl.py		train_ssl.py
vision_transformer.py		vision_transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-view Masked Contrastive Representation Learning for Endoscopic Video Analysis

Installation

Data Preparation

weights

Pre-training

Fine-tuning

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-view Masked Contrastive Representation Learning for Endoscopic Video Analysis

Installation

Data Preparation

weights

Pre-training

Fine-tuning

Acknowledgement

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages