GitHub - HRL-Mike/PitVQA-Plus: This repository is for paper 'PitVQA++: Vector Matrix-Low-Rank Adaptation for Open-Ended Visual Question Answering in Pituitary Surgery'

PitVQA++: Vector Matrix-Low-Rank Adaptation for Open-Ended Visual Question Answering in Pituitary Surgery

[`arXiv`]	[`Paper`]	[`Colab Demo`]

The dataset and pretrained weights will be released upon acceptance.

PitVQA++ Network

Update later

Open-ended PitVQA Dataset

Our Open-ended PitVQA dataset comprises 25 videos of endoscopic pituitary surgeries from the The National Hospital of Neurology and Neurosurgery in London, United Kingdom. All videos were annotated for the surgical phases, steps, instruments present and operation notes guided by a standardised annotation framework, which was derived from a preceding international consensus study on pituitary surgery workflow. Annotation was performed collaboratively by 2 neurosurgical residents with operative pituitary experience and checked by an attending neurosurgeon.
We extracted image frames from each video at 1 fps and removed any frames that were blurred or occluded. Ultimately, we obtained a total of 101,803 frames, with the videos of minimum and maximum length yielding 2,443 and 7,179 frames, respectively. We acquired frame-wise question-answer pairs for all the types of the annotation. Overall, there are 745,972 question-answer pairs from 101,803 frames, which is around 8 pairs for each frame.
This work is an extension of our previous work PitVQA. The Open-ended PitVQA dataset uses the same videos and frames as PitVQA dataset. We will release the QA pairs for Open-ended PitVQA dataset soon.

How to Download PitVQA Dataset

The dataset derived from our previous MICCAI work PitVQA close-ended dataset of VQA classification which can be found below:
Please download full PitVQA dataset from UCL RDR portal.
The original videos were taken and preprocessed from MICCAI PitVis challenge

The dataset split for training and validation as below:

train_seq = ['01', '03', '04', '05', '07', '08', '09', '10', '11', '14', '15', '16', '17', '18', '19', '20', '21', '22', '23', '25']

val_seq = ['02', '06', '12', '13', '24']

Training Command:

For EndoVis18-VQA dataset:

python main.py --dataset=endo --epochs=50 --batch_size=64 --lr=0.0000002 --seq_length=64
--mora_base_rank=8
--mora_coeff 26 26 24 24 22 22 20 20 18 18 16 16
--lora_rank 18 18 16 16 14 14 12 12 10 10 8 8
--lora_alpha 18 18 16 16 14 14 12 12 10 10 8 8

For Open-ended PitVQA dataset:

python main.py --dataset=pit --epochs=50 --batch_size=64 --lr=0.0000002 --seq_length=100
--mora_base_rank=8
--mora_coeff 56 56 48 48 40 40 32 32 24 24 16 16
--lora_rank 28 28 24 24 20 20 16 16 12 12 8 8
--lora_alpha 28 28 24 24 20 20 16 16 12 12 8 8

Acknowledgement

The implementation of PitVQA++ relies on resources from BLIP, Huggingface Transformers, timm and our previous work PitVQA. We thank the original authors for their open-sourcing.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
LICENSE		LICENSE
README.md		README.md
dataloader.py		dataloader.py
evaluation.py		evaluation.py
main.py		main.py
model.py		model.py
plot_risk_coverage_curve.py		plot_risk_coverage_curve.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PitVQA++: Vector Matrix-Low-Rank Adaptation for Open-Ended Visual Question Answering in Pituitary Surgery

The dataset and pretrained weights will be released upon acceptance.

PitVQA++ Network

Open-ended PitVQA Dataset

How to Download PitVQA Dataset

Training Command:

Acknowledgement

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

HRL-Mike/PitVQA-Plus

Folders and files

Latest commit

History

Repository files navigation

PitVQA++: Vector Matrix-Low-Rank Adaptation for Open-Ended Visual Question Answering in Pituitary Surgery

The dataset and pretrained weights will be released upon acceptance.

PitVQA++ Network

Open-ended PitVQA Dataset

How to Download PitVQA Dataset

Training Command:

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages