Issue training base model #19

christegho · 2019-11-26T16:57:22Z

I have been trying to train a base model for some time now.

I have had issues with the version of pytorch the code was built on. 0.3.1 would not work with CUDA versions past 8.0. But my GeGorce RTX 2080 would not work with CUDA versions below 9.0.

I managed to have the code base work with PyTorch 0.4.0 and 0.4.1, with CUDA 10.1.

I have two GPUs, each with 10986MB. I managed to have the base training run for many epochs, but then my whole machine would shut down all of the sudden, through the training. I suspect this is because of my RAM.

I did have to reduce the batch size and subdivisions, to get the training to start.

But this is all to say that I am not able to get a base model, and I am wondering if there is anyone who has a model to share?

I will commit my code for PyTorch >= 0.4.0 soon, on my fork, but it would be so nice to have weights I could use.

XinyiYS · 2019-12-02T09:05:45Z

You could try my trained base model: https://drive.google.com/open?id=1CSVFhfOHmRlbUsMu_eyBCvBWn_06a9zH

christegho · 2019-12-02T09:10:24Z

Hi Michael, Thanks for sharing your trained base model. This is very helpful!

…

On Mon, Dec 2, 2019 at 9:05 AM Michael ***@***.***> wrote: You could try my trained base model: https://drive.google.com/open?id=1CSVFhfOHmRlbUsMu_eyBCvBWn_06a9zH — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#19?email_source=notifications&email_token=ABZDIBMBRYNOKPJWGIPOQN3QWTFWVA5CNFSM4JR275VKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEFSYIVY#issuecomment-560301143>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABZDIBM7OX3ZGMAF6SV6DI3QWTFWVANCNFSM4JR275VA> .

XinyiYS · 2019-12-02T09:15:41Z

Thanks for sharing your trained base model. This is very helpful!

No problem, Chris. Give it a go. I didn't change any setting, it should give decent results on the base classes.

HuangLian126 · 2020-07-20T08:26:21Z

@christegho Hi, I try to train the base with torch 1.2.0 , torchvision 0.4.0 and CUDA 10.1. However, I get this error:

File "/home/hl/hl/Fewshot_Detection-master/region_loss.py", line 330, in forward
pred_boxes[0] = x.data + grid_x
RuntimeError: The size of tensor a (13) must match the size of tensor b (38870) at non-singleton dimension 3

The shape of x is torch[46,5,13,13], and the shape of x is torch[38870]. How do you fix this error?

Fly-dream12 · 2021-03-08T07:15:11Z

Have you solved it ? @ HuangLian126

li-yanling · 2021-06-30T02:48:22Z

@XinyiYS @christegho Could you please share your base model? The google drive link has expired. Many thanks!

XinyiYS · 2021-06-30T03:46:19Z

@XinyiYS @christegho Could you please share your base model? The google drive link has expired. Many thanks!

Hi yanling, sorry that I have removed the model from my google drive due to storage limit. Somehow I don't have a local backup of it. Apologies. Perhaps see if Chris would be able to provide a copy.

li-yanling · 2021-06-30T03:50:18Z

@XinyiYS @christegho Could you please share your base model? The google drive link has expired. Many thanks!

Hi yanling, sorry that I have removed the model from my google drive due to storage limit. Somehow I don't have a local backup of it. Apologies. Perhaps see if Chris would be able to provide a copy.
Hi Xinyi, thanks for your reply:)

XinyiYS mentioned this issue Dec 2, 2019

Hi, few shot tuning #11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue training base model #19

Issue training base model #19

christegho commented Nov 26, 2019 •

edited

Loading

XinyiYS commented Dec 2, 2019

christegho commented Dec 2, 2019 via email

XinyiYS commented Dec 2, 2019

HuangLian126 commented Jul 20, 2020

Fly-dream12 commented Mar 8, 2021

li-yanling commented Jun 30, 2021

XinyiYS commented Jun 30, 2021

li-yanling commented Jun 30, 2021

Issue training base model #19

Issue training base model #19

Comments

christegho commented Nov 26, 2019 • edited Loading

XinyiYS commented Dec 2, 2019

christegho commented Dec 2, 2019 via email

XinyiYS commented Dec 2, 2019

HuangLian126 commented Jul 20, 2020

Fly-dream12 commented Mar 8, 2021

li-yanling commented Jun 30, 2021

XinyiYS commented Jun 30, 2021

li-yanling commented Jun 30, 2021

christegho commented Nov 26, 2019 •

edited

Loading