Adaptive Visual Scene Understanding: Incremental Learning in Scene Graph Generation

Authors: Naitik Khandelwal, Xiao Liu, Mengmi Zhang

This repository houses our CSEGG benchmark implementation, encompassing source code for experimenting with Transformer-based SGG methods across various continual learning algorithms in all proposed learning scenarios outlined in our paper. Additionally, it includes the code for data generation in all the scenarios presented in the paper.

Project Description

The project focuses on scene graph generation (SGG), which involves analyzing images to extract valuable information about objects and their relationships. In the dynamic visual world, it becomes crucial for AI systems to detect new objects and establish their relationships with existing objects. However, the field of SGG lacks continual learning methodologies. To address this gap, we introduce the comprehensive Continual ScenE Graph Generation (CSEGG) dataset, which includes three learning scenarios and six evaluation metrics. Our research aims to investigate the performance of existing SGG methods in continual learning, specifically regarding the retention of previous object entities and relationships while learning new ones. Furthermore, we explore how continual object detection enhances generalization in classifying known relationships on unknown objects.

Below is an illustration of all the learning scenarios in CSEGG:


CSEGG Learning Scenarios.

From left to right, they are S1. relationship (Rel.) incremental learning (Incre.); S2. relationship and object (Rel. + Obj.) Incre.; and S3. relationship generalization (Rel. Gen.) in Object Incre.. In S1 and S2, example triplets in the training (solid line) and test sets (dash line) from each task are presented. The training and test sets from the same task are color-coded. The new objects or relationships in each task are bold and underlined. In S3, one single test set (dashed gray box) is used for benchmarking the relationship generalization ability of object incre. learning models across all the tasks.

Installation

Check INSTALL.md for installation instructions.

Dataset

Check DATASET.md for instructions of dataset preprocessing.

Training and Evaluation

Understanding Args

Training:

--num-gpus : Number of GPUs used for training.
--start_task : To resume the training from certain task.
--sgg : To activate Stage 2 for Learning Scenario S2, S3. (This argument is not present in Learning Scenario S1).
--continual : To choose which CSEGG model to train.
- Learning Scenario S1 :- "replay_10", "ewc", "replay_100". To train "naive", exclude this argument from training command.
- Learning Scenario S2 :- "replay_10", "ewc", "replay_20". To train "naive", exclude this argument from training command.
- Learning Scenario S3 :- "replay_10". To train "naive", exclude this argument from training command.

Evaluation:

--num-gpus : Number of GPUs used for testing.

Learning Scenario S1

There is only Stage 2 training for Learning Scenario S1. To train the model, run the following in the command window:

cd ~/CSEGG/playground/sgg/detr.res101.c5.one_stage_rel_tfmer
pods_train_S1 --num-gpus 4 --continual "replay_10"

To evaluate,

cd ~/CSEGG/playground/sgg/detr.res101.c5.one_stage_rel_tfmer
pods_test_S1 --num-gpus 1

Learning Scenario S2

To train the model, run the following in the command window:

#Stage 1
cd ~/CSEGG/playground/sgg/detr.res101.c5.multiscale.150e.bs16
pods_train_S2 --num-gpus 4 --continual "replay_10"

#Stage 2
cd ~/CSEGG/playground/sgg/detr.res101.c5.one_stage_rel_tfmer
pods_train_S2 --num-gpus 4 --continual "replay_10" --sgg "sgg"

To evaluate,

cd ~/CSEGG/playground/sgg/detr.res101.c5.one_stage_rel_tfmer
#Evaluation of Object Detection (Stage 1) and SGG (Stage 2) is combined
pods_test_S2 --num-gpus 1

Learning Scenario S3

To train the model, run the following in the command window:

#Stage 1
cd ~/CSEGG/playground/sgg/detr.res101.c5.multiscale.150e.bs16
pods_train_S3 --num-gpus 4 --continual "replay_10"

#Stage 2
cd ~/CSEGG/playground/sgg/detr.res101.c5.one_stage_rel_tfmer
pods_train_S3 --num-gpus 4 --continual "replay_10" --sgg "sgg"

To evaluate,

#evaluation of R_bbox and R@k_relation_gen
cd ~/CSEGG/playground/sgg/detr.res101.c5.one_stage_rel_tfmer
pods_test_S3 --num-gpus 1

Acknowledgment

This repository borrows code from scene graph benchmarking frameworks: Scene Graph Benchmark developed by KaihuaTang, PySGG and SGTR developed by Rongjie Li.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
cvpods		cvpods
datasets		datasets
playground		playground
samples		samples
tools		tools
.gitignore		.gitignore
DATASET.md		DATASET.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
README.md		README.md
pylint.cfg		pylint.cfg
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
spec_file.txt		spec_file.txt

License

ZhangLab-DeepNeuroCogLab/CSEGG

Folders and files

Latest commit

History

Repository files navigation

Adaptive Visual Scene Understanding: Incremental Learning in Scene Graph Generation

Project Description

Installation

Dataset

Training and Evaluation

Understanding Args

Learning Scenario S1

Learning Scenario S2

Learning Scenario S3

Acknowledgment

About

Resources

License

Stars

Watchers

Forks

Languages