Is it difficult to train/finetune ConvNeXtv2 compared with ConvNeXtv1? #15

linhduongtuan · 2023-01-12T02:21:04Z

Dear authors,
I have played around both ConvNeXt v1 and yours using TIMM codebase with my own datasets.
Using V1 I don't struggle with training/finetuning for my datasets and am pleasure with my obtained overall performance for TIMM's variants.
However, I can not achieve any comparative performance (overall accuracy as well as computed costs, of course) using your V2 variants with regarding every pretrained weights.

Can you give me any tip, trick, or treat for a set of your hyperparameters?

Thank in advance.
Linh

shwoo93 · 2023-01-26T23:45:31Z

Thanks for noting this issue.
One suggestion is to not weight decay the gamma/beta values in GRN during training (now updated as default behavior).

linhduongtuan · 2023-01-27T14:25:03Z

Thank for your explanation. I will try these models again.

linhduongtuan · 2023-02-02T02:23:03Z

I have been trying to train model ConvNeXt-V2-Tiny again following your new setup for the optimization. However, my obtained results, which don't not improve an overall accuracy as well as need much GPU memory comparing with V1, are still much lower than that of using ConvNeXt-Tiny. Can you double check the optimization recipe using CIFAR, MNIST, ect., for instance?
Linh

Metal079 · 2023-02-24T22:33:43Z

Can confirm it's difficult to fine-tune. ConvNextV1-base gets me 86%-88% on my dataset within 5 epochs while ConvNextV2-Base can't seem to get over 81% no matter how I tweak the hyperparameters.

hbellafkir · 2024-03-14T12:11:06Z

any updates on this issue? I'm having the same problem

blackpearl1022 · 2024-07-13T04:53:55Z

@Metal079 Any updates ?
I have same issues on my side.

Metal079 · 2024-07-14T03:21:30Z

@Metal079 Any updates ? I have same issues on my side.

No

shwoo93 mentioned this issue Jan 26, 2023

Problem with ConvNeXt V2-B 384x384 weights #24

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it difficult to train/finetune ConvNeXtv2 compared with ConvNeXtv1? #15

Is it difficult to train/finetune ConvNeXtv2 compared with ConvNeXtv1? #15

linhduongtuan commented Jan 12, 2023

shwoo93 commented Jan 26, 2023

linhduongtuan commented Jan 27, 2023

linhduongtuan commented Feb 2, 2023

Metal079 commented Feb 24, 2023

hbellafkir commented Mar 14, 2024

blackpearl1022 commented Jul 13, 2024

Metal079 commented Jul 14, 2024

Is it difficult to train/finetune ConvNeXtv2 compared with ConvNeXtv1? #15

Is it difficult to train/finetune ConvNeXtv2 compared with ConvNeXtv1? #15

Comments

linhduongtuan commented Jan 12, 2023

shwoo93 commented Jan 26, 2023

linhduongtuan commented Jan 27, 2023

linhduongtuan commented Feb 2, 2023

Metal079 commented Feb 24, 2023

hbellafkir commented Mar 14, 2024

blackpearl1022 commented Jul 13, 2024

Metal079 commented Jul 14, 2024