[multimodal]Upgrade object detection backend to mmdet 3.0 #3188

FANGAreNotGnu · 2023-05-02T00:03:18Z

Change MMDetection backend to 3.0.

Changes:

Internal data structures
Pipeline logics
Yolox Configs
Mosaic/mixup tranform logics
Multigpu support logics
Basically everything

Benefits:

Able to use latest work like DINO, RTMDet, Detic, etc.
Have better data structure now (DataContainer in 2.0 is not as good), and also aligns better with open vocabulary detection's needs.
Will be stable and do not need big refactors for a long period.

Future works:

Benchmarking
Update Presets
Update examples and tutorials
Refine tests

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

review-notebook-app · 2023-05-03T20:27:38Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

tonyhoo · 2023-05-05T18:49:51Z

Thanks for updating the version. Can you provide some benchmark results and attach them in the description? Thanks

github-actions · 2023-05-11T07:34:27Z

Job PR-3188-7cd1dc2 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3188/7cd1dc2/index.html

FANGAreNotGnu · 2023-05-11T19:46:13Z

Thanks for updating the version. Can you provide some benchmark results and attach them in the description? Thanks

Just completed the basic changes. I'll run a small benchmark after working on a PR in autogluon-bench. And will do a complete benchmarking after upgrading presets to more recent models (RTMDet/DINO).

github-actions · 2023-05-11T23:04:40Z

Job PR-3188-50bedc8 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3188/50bedc8/index.html

github-actions · 2023-05-13T00:58:29Z

Job PR-3188-70feb9d is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3188/70feb9d/index.html

github-actions · 2023-05-25T22:14:55Z

Job PR-3188-78157d1 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3188/78157d1/index.html

github-actions · 2023-05-25T22:24:08Z

Job PR-3188-0635abd is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3188/0635abd/index.html

CI/docker/full_install_image.sh

...utorials/multimodal/object_detection/finetune/detection_high_performance_finetune_coco.ipynb

multimodal/setup.py

multimodal/src/autogluon/multimodal/configs/pretrain/detection/yolox/yolox_l_8xb8-300e_coco.py

multimodal/src/autogluon/multimodal/data/process_mmlab/process_mmdet.py

zhiqiangdon · 2023-05-25T22:04:09Z

multimodal/src/autogluon/multimodal/models/mmdet_image.py

+            return rets
+        elif mode == "predict":
+            # for detailed data structure, see https://github.com/open-mmlab/mmdetection/blob/main/mmdet/structures/det_data_sample.py
+            return [{BBOX: ret.pred_instances, LABEL: ret.gt_instances} for ret in rets]


Why does predict mode require groundtruth labels?

model(mode="predict") is used for both predicting and evaluation. It's easier to include the ground truth here than merging ground truth with result in lit module / predictor. Here gt_instances will be None if there is no label available.

The evaluation is done after prediction. Why do we need labels in the prediction?

multimodal/src/autogluon/multimodal/predictor.py

multimodal/src/autogluon/multimodal/utils/environment.py

multimodal/src/autogluon/multimodal/configs/model/fusion_mlp_image_text_tabular.yaml

github-actions · 2023-05-25T22:57:54Z

Job PR-3188-0c266ee is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3188/0c266ee/index.html

github-actions · 2023-05-31T00:32:38Z

Job PR-3188-c00a1e1 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3188/c00a1e1/index.html

github-actions · 2023-05-31T01:29:19Z

Job PR-3188-f54327f is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3188/f54327f/index.html

multimodal/src/autogluon/multimodal/data/process_mmlab/process_mmlab_base.py

multimodal/src/autogluon/multimodal/utils/environment.py

multimodal/src/autogluon/multimodal/models/mmdet_image.py

github-actions · 2023-06-01T18:47:05Z

Job PR-3188-5990923 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3188/5990923/index.html

github-actions · 2023-06-01T20:59:17Z

Job PR-3188-e2ee022 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-3188/e2ee022/index.html

zhiqiangdon

LGTM. Thanks for the upgrade.

FANGAreNotGnu added 5 commits May 1, 2023 19:45

validation finished, loss to be modified

4834e09

basic code completed (train/val/test). yolov3 passed.

bd2a160

fix batch sizze

d4d4e0d

merge and modify test

69af2d1

change pip install version for mmdet and mmcv

1e54fb4

remove mmocr dependency since it's conflicts with mmcv 2.0

68e2e9a

FANGAreNotGnu added 4 commits May 5, 2023 20:40

fix yolov3 ckpt name

c51939b

Merge https://github.com/autogluon/autogluon into upgrade_mmdet_to3

d9887aa

fix output format for detection

3f94394

update yolox

a753d25

FANGAreNotGnu added the model list checked You have updated the model list after modifying multimodal unit tests/docs label May 10, 2023

FANGAreNotGnu added 3 commits May 10, 2023 22:16

black

24d926e

lint and model list

02e55a0

fixes

7cd1dc2

FANGAreNotGnu changed the title ~~[WIP][multimodal]Upgrade object detection backend to mmdet 3.0~~ [multimodal]Upgrade object detection backend to mmdet 3.0 May 11, 2023

fix docs

50bedc8

remove detection finetuning tutorials

70feb9d

FANGAreNotGnu requested review from zhiqiangdon, yongxinw and tonyhoo May 15, 2023 19:41

FANGAreNotGnu added 2 commits May 24, 2023 21:45

fix loss by adding sum to loss tensor

71f429f

merge

d7e99b5

FANGAreNotGnu force-pushed the upgrade_mmdet_to3 branch from 282961f to d7e99b5 Compare May 24, 2023 21:52

fix typo

61f8f6d

FANGAreNotGnu added 3 commits May 25, 2023 20:04

remove deprecated codes in mmcv.py

f43099d

lint

51ca022

../../../multimodal/src/autogluon/multimodal/models/mmdet_image.py

0c266ee

zhiqiangdon reviewed May 25, 2023

View reviewed changes

FANGAreNotGnu added 7 commits May 30, 2023 20:08

solve comments

c7dea99

solve comments on error msg

f63d754

lint

38b565c

fix circular import

91a1050

fix circular import

bd85591

fix circular import

c00a1e1

fix typo

f54327f

FANGAreNotGnu requested a review from zhiqiangdon May 30, 2023 22:56

zhiqiangdon reviewed May 31, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/data/process_mmlab/process_mmlab_base.py Outdated Show resolved Hide resolved

zhiqiangdon reviewed May 31, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/data/process_mmlab/process_mmlab_base.py Outdated Show resolved Hide resolved

zhiqiangdon reviewed May 31, 2023

View reviewed changes

FANGAreNotGnu added 6 commits May 31, 2023 20:53

refine mmlab package warning logic

6d2abd2

lint

517d048

refactor mmlab err msg

7aa6cd8

update msg for mmocr

275afdf

fix typo

5990923

Merge https://github.com/autogluon/autogluon into upgrade_mmdet_to3

e2ee022

zhiqiangdon approved these changes Jun 1, 2023

View reviewed changes

FANGAreNotGnu merged commit cccad2f into autogluon:master Jun 1, 2023
29 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[multimodal]Upgrade object detection backend to mmdet 3.0 #3188

[multimodal]Upgrade object detection backend to mmdet 3.0 #3188

FANGAreNotGnu commented May 2, 2023 •

edited

review-notebook-app bot commented May 3, 2023

tonyhoo commented May 5, 2023

github-actions bot commented May 11, 2023

FANGAreNotGnu commented May 11, 2023

github-actions bot commented May 11, 2023

github-actions bot commented May 13, 2023

github-actions bot commented May 25, 2023

github-actions bot commented May 25, 2023

zhiqiangdon May 25, 2023

FANGAreNotGnu May 30, 2023 •

edited

zhiqiangdon May 31, 2023

github-actions bot commented May 25, 2023

github-actions bot commented May 31, 2023

github-actions bot commented May 31, 2023

github-actions bot commented Jun 1, 2023

github-actions bot commented Jun 1, 2023

zhiqiangdon left a comment

[multimodal]Upgrade object detection backend to mmdet 3.0 #3188

[multimodal]Upgrade object detection backend to mmdet 3.0 #3188

Conversation

FANGAreNotGnu commented May 2, 2023 • edited

review-notebook-app bot commented May 3, 2023

tonyhoo commented May 5, 2023

github-actions bot commented May 11, 2023

FANGAreNotGnu commented May 11, 2023

github-actions bot commented May 11, 2023

github-actions bot commented May 13, 2023

github-actions bot commented May 25, 2023

github-actions bot commented May 25, 2023

zhiqiangdon May 25, 2023

Choose a reason for hiding this comment

FANGAreNotGnu May 30, 2023 • edited

Choose a reason for hiding this comment

zhiqiangdon May 31, 2023

Choose a reason for hiding this comment

github-actions bot commented May 25, 2023

github-actions bot commented May 31, 2023

github-actions bot commented May 31, 2023

github-actions bot commented Jun 1, 2023

github-actions bot commented Jun 1, 2023

zhiqiangdon left a comment

Choose a reason for hiding this comment

FANGAreNotGnu commented May 2, 2023 •

edited

FANGAreNotGnu May 30, 2023 •

edited