Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

about some details #20

Open
dlyldxwl opened this issue Apr 12, 2018 · 17 comments
Open

about some details #20

dlyldxwl opened this issue Apr 12, 2018 · 17 comments

Comments

@dlyldxwl
Copy link

thanks for your job!
now i want to finetune the 135000 caffemodel by using myself dataest,and because of network ,i can't download ActivityNet dataest videos,so i don't know dataest format
i really hope you can help me.thx again!

@huijuan88
Copy link
Collaborator

huijuan88 commented Apr 12, 2018 via email

@dlyldxwl
Copy link
Author

thank you for you reply.
I fiuntune 135000 caffemodel, now the network can run. but I don't understand the information of Terminal output.

Accuracy: 0.984375
I0413 09:36:22.818658 4436 accuracy_layer.cpp:101] Class 0 accuracy : 1
I0413 09:36:22.818661 4436 accuracy_layer.cpp:101] Class 1 accuracy : 0.666667
TRAIN
I0413 09:36:22.826498 4436 solver.cpp:228] Iteration 63, loss = 0.973929
I0413 09:36:22.826509 4436 solver.cpp:244] Train net output #0: accuarcy = 0.714286
I0413 09:36:22.826514 4436 solver.cpp:244] Train net output #1: loss_cls = 0.531705 (* 1 = 0.531705 loss)
I0413 09:36:22.826519 4436 solver.cpp:244] Train net output #2: loss_twin = 0.38329 (* 1 = 0.38329 loss)
I0413 09:36:22.826520 4436 solver.cpp:244] Train net output #3: rpn_accuarcy = 0.984375
I0413 09:36:22.826522 4436 solver.cpp:244] Train net output #4: rpn_accuarcy_class = 1
I0413 09:36:22.826524 4436 solver.cpp:244] Train net output #5: rpn_accuarcy_class = 0.666667
I0413 09:36:22.826527 4436 solver.cpp:244] Train net output #6: rpn_cls_loss = 0.0271421 (* 1 = 0.0271421 loss)
I0413 09:36:22.826530 4436 solver.cpp:244] Train net output #7: rpn_loss_twin = 0.0317921 (* 1 = 0.0317921 loss)

  1. I believe the Accuray is rpn_accuarcy, and does the class0 ,1 accuracy indicate that RPN has the accuracy foreground and background classification ?and Train net output #0: accuarcy = 0.714286 indicate the accuarcy of the R-C3D network?
    2.the learing rate of conv1a,conv2a.. are 0, Do you think these layers needs backward computation when finetuned?
    3.I don't fine the demo.py file,which means I should write the py file when I want to use a video to detect the effect of caffemodel?

I hope you can answer my questions.
Wish you a happy day, thanks again!

@dlyldxwl
Copy link
Author

dlyldxwl commented Apr 13, 2018

I want to use caffemodel to detect a video, but I can't find the demo file ,if the folder has the file ,can you tell you the file postion?
if don't have the file, which means i need write it. In test_net.py, i find the input format of network is .pkl, but i want to use a video directtly, what should i do?

@huijuan88
Copy link
Collaborator

huijuan88 commented Apr 13, 2018 via email

@dlyldxwl
Copy link
Author

dlyldxwl commented Apr 13, 2018

thank you for your reply~
and Train net output #0: accuarcy = 0.714286 indicate R-C3D action classification accuarcy is 0.71?
LENGTH: [768] indicate that network input is 768 frame picture?
my dataest is small and class number is 5, can you give me some advice in parameter settings?Or use the network original parameters directly?
I just started learning action recognition, thanks!

@dlyldxwl
Copy link
Author

@huijuan88 The demo file has been written
and my dataest is small and video length is about 10 seconds, can you give me some advice in parameter settings?Or use the network original parameters directly?

@huijuan88
Copy link
Collaborator

huijuan88 commented Apr 15, 2018 via email

@dlyldxwl
Copy link
Author

@huijuan88
my video fps is 120,so I set FPS = 100 and retrain the work,
Train net output #0: accuarcy = 0.963636 , which means the network classification accuracy is 0.96? but i test some videos, result isn't good... can you give me some advices?
and i found a error in activitynet_log_analysis.py, line 113, after call get_segments function, we should set predict_data=[],Otherwise only the detection result of the first video can be output to result.json~

@huijuan88
Copy link
Collaborator

huijuan88 commented Apr 16, 2018 via email

@dlyldxwl
Copy link
Author

@huijuan88
Yes. now i am training the network by fiuntuning 13500caffemodel.
My dataest has 192 videos and 3class, and video length is about 10s, but videos fps are 120, now i want to finetune 135000caffemodel, can you give me some advice on parameter settings? for example: FPS, batchsize,and so on.
thx!

@sijun-zhou
Copy link

sijun-zhou commented Jul 13, 2018

Hi @dlyldxwl

I am a new for action detection and very interested in this field. I use the test script R-C3D/experiments/activitynet/test_net.py to do the test.
I am using a card of 1080Ti with 11G memory, but 2.5G was used by other students, so I was only left with 8.5G memory with GPU. But when I run the test_net.py script in ActivityNet , only loaded one 1 video's frams(768 images), but out of memory at the step:
blobs_out = net.forward(**forward_kwargs)
"""
F0713 15:08:15.452706 22317 syncedmem.cpp:56] Check failed: error == cudaSuccess (2 vs. 0) out of memory
*** Check failure stack trace: ***
Aborted (core dumped)
"""

I reduce the 768 images to 160 images. It is working fine with me with 8.5G memory left. But if I use 768 images nearly 5 times larger. So I guess I need 40G to 50G GPU memories. And it is difficult to run on pycaffe with multiple GPUs. Could you plz help me! I am a new to action detection. Really appreciated!

so could you plz tell me what is your GPU type and how many GPUs have you used when testing and training this code?
Thanks in advance!

@dlyldxwl
Copy link
Author

@sijun-zhou
hi , I am also a new for action detection.
Because of a competition, i trained this model, after that, i didn't work on it.
I could tell you my computer configuration, 12G memory and TITAN X(pascal).
if you memory isn't enough, i reckon you could use a small batch size or resolution, certainly, it is also work to fix more layers, but which can lead to a bad result.
Above are my suggestion, I can't ensure it can work well.
Finally, I wish you success in your experiment~

@sijun-zhou
Copy link

@dlyldxwl
Thanks very much! Really appreciate your reply. I'll look into detail of my problem! :)
BTW, you only use 1 TITAN X card?

Thanks,
Sijun

@dlyldxwl
Copy link
Author

@sijun-zhou
yes , i just have one TITANX card

@sijun-zhou
Copy link

@dlyldxwl
Thank you very much. I'll look into detail of my problem!

@YanYan0716
Copy link

@dlyldxwl i am a new for deep learning, so could you tell me how to use the caffemodel to detect a video just as you said, thanks a lot, best wish for you

@viswalal
Copy link

@dlyldxwl , could you please share the demo file you have written?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants