Data Augmentation for Video #1064

wenjun90 · 2021-08-02T06:50:48Z

Thank you and your team for your contribution.
Could I ask you a question for Data Augmentation, please?

How can we set the config for augmentation data during training?

Thank you very much!

dreamerlin · 2021-08-03T03:37:28Z

Ref: https://github.com/open-mmlab/mmaction2/blob/master/configs/recognition/tsn/tsn_fp16_r50_1x1x3_100e_kinetics400_rgb.py#L15-L31

You can set your Data Augmentation in config like this.

dreamerlin · 2021-08-03T03:37:59Z

And if you find this repo help you, you can give us a star : ) !

irvingzhang0512 · 2021-08-03T04:13:52Z

There are 4 kinds of data augmentation for training

Native MMAction2 data augmentation pipelines here, such as flip/resize/crop/colorjitter...
Third-party library Imgaug, such as RandAugment. Demos could be find [Feature] Support Imgaug for augmentations in the data pipeline. #492 [Improvement] Set RandAugment as Imgaug default transforms. #585
Third-party library PytorchVideo, such as RandAugment/AugMix. Demos could be find [Feature] Support Pytorchvideo Transforms #1008
Mixup and Cutmix, demos could be find [Feature] Support Mixup and Cutmix for Recognizers. #681

irvingzhang0512 · 2021-08-03T04:20:15Z

Besides, you can refer to the configs of tsm-r50/sthv1, which contains a lot of different data augmentation pipelines.

wenjun90 · 2021-08-03T10:27:42Z

Thank you @dreamerlin and @irvingzhang0512

Could I ask you more question, please @dreamerlin?
I still don't quite understand these parameters: dict(type='SampleFrames', clip_len=1, frame_interval=1, num_clips=3)

clip_len=1, it means that len of clip = 1s extract randomly from the total length of clip (for exemple length of clip = 10s)
frame_interval = 1 and num_clips = 3, what means?

Thank you very much!

irvingzhang0512 · 2021-08-03T10:31:47Z

@wenjun90 #655 (comment)

wenjun90 · 2021-08-17T22:24:22Z

Hey @dreamerlin and @irvingzhang0512 ,
Could I ask you a question for learning rate in slowfast, please?
I do my training with 1GPU and 8 video/gpu, so learning rate need to set 0.01?
Beacause lr=0.01 for 4 GPUs x 2 video/gpu and lr=0.08 for 16 GPUs x 4 video/gpu.
Thank you very much.

wenjun90 · 2021-08-23T08:22:58Z

Hi @irvingzhang0512 ,
I did training with pytorchvideo.AugMix, training time is more long than I don't use augmentation. It is normal?

irvingzhang0512 · 2021-08-23T10:08:34Z

Yes, it's normal, DataAugment requires a lot of cpu resources.

wenjun90 · 2021-08-28T06:19:01Z

thank @irvingzhang0512 ,

Could I ask you 2 question, please?

How can I use the config slowfast_r152_r50_4x16x1_256e_kinetics400_rgb with typedataset is "VideosDataset"?
Can I use the multiscale like that:
dict(
type='MultiScaleCrop',
input_size=224,
scales=(1, 0.875, 0.75, 0.66),
random_crop=False,
max_wh_scale_gap=1,
num_fixed_crops=13),
for slowfast

Thank you very much.

kennymckormick · 2021-09-06T04:44:47Z

You should:
1. set dataset_type as VideoDataset;
2. use data list for videos (each line is like video.mp4 label);
3. modify data pipeline, use DecordInit and DecordDecode.
That's OK, but the performance might be inferior.

bit-scientist · 2022-05-13T00:08:19Z

I am finding it hard to calculate a value for learning rate when different number of GPUs available. Since lr is set to 0.01 for 4 GPUs x 2 video/gpu and lr=0.08 for 16 GPUs x 4 video/gpu, what exact value should I choose for my lr if I have one GPU (32 Gb) with batchsize = 64?

kennymckormick · 2022-05-19T06:23:02Z

Since the total batch size is still 64, you can use 0.08.

WEIZHIHONG720 · 2023-11-29T13:03:58Z

Will the video data use "cutout" as data augmentation? If so, what is the typical cutout ratio setting? Thanks!

kennymckormick closed this as completed Sep 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data Augmentation for Video #1064

Data Augmentation for Video #1064

wenjun90 commented Aug 2, 2021

dreamerlin commented Aug 3, 2021

dreamerlin commented Aug 3, 2021 •

edited

Loading

irvingzhang0512 commented Aug 3, 2021 •

edited

Loading

irvingzhang0512 commented Aug 3, 2021 •

edited

Loading

wenjun90 commented Aug 3, 2021

irvingzhang0512 commented Aug 3, 2021

wenjun90 commented Aug 17, 2021

wenjun90 commented Aug 23, 2021

irvingzhang0512 commented Aug 23, 2021

wenjun90 commented Aug 28, 2021

kennymckormick commented Sep 6, 2021

bit-scientist commented May 13, 2022

kennymckormick commented May 19, 2022

WEIZHIHONG720 commented Nov 29, 2023

Data Augmentation for Video #1064

Data Augmentation for Video #1064

Comments

wenjun90 commented Aug 2, 2021

dreamerlin commented Aug 3, 2021

dreamerlin commented Aug 3, 2021 • edited Loading

irvingzhang0512 commented Aug 3, 2021 • edited Loading

irvingzhang0512 commented Aug 3, 2021 • edited Loading

wenjun90 commented Aug 3, 2021

irvingzhang0512 commented Aug 3, 2021

wenjun90 commented Aug 17, 2021

wenjun90 commented Aug 23, 2021

irvingzhang0512 commented Aug 23, 2021

wenjun90 commented Aug 28, 2021

kennymckormick commented Sep 6, 2021

bit-scientist commented May 13, 2022

kennymckormick commented May 19, 2022

WEIZHIHONG720 commented Nov 29, 2023

dreamerlin commented Aug 3, 2021 •

edited

Loading

irvingzhang0512 commented Aug 3, 2021 •

edited

Loading

irvingzhang0512 commented Aug 3, 2021 •

edited

Loading