Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The problem of the precomputed optical flow #3

Closed
cctgem opened this issue Apr 11, 2021 · 9 comments
Closed

The problem of the precomputed optical flow #3

cctgem opened this issue Apr 11, 2021 · 9 comments
Labels
good first issue Good for newcomers

Comments

@cctgem
Copy link

cctgem commented Apr 11, 2021

感谢开源。有一个问题想请教。
光流如果是precompute的话,应该是frame by frame的,即frame_step == 1。
当训练到后期的时候会动态增加frame_step,即可能输入第10,20,30帧数据送网络,frame_step==10,对应的光流输入的是10->20,20->30,但是此时实际输入的光流是19->20,29->30。
不确定是否您做了对应的处理我没看到相应的代码,还是我理解错了离线做光流数据的方式。

@hzxie hzxie changed the title optical flow problem The problem of the precomputed optical flow Apr 11, 2021
@hzxie hzxie added the good first issue Good for newcomers label Apr 11, 2021
@hzxie
Copy link
Owner

hzxie commented Apr 11, 2021

Yes. You're right.
Actually, we recognize this issue and replace the precomputed optical flow with the real-time computed optical flow by FlowNet-CSS2, the results are almost the same for the DAVIS dataset.
One possible reason is that the motion of the objects is small in the DAVIS dataset and the padding of the bounding box increases the error tolerance of the precomputed optical flow.

Actually, for YouTube-VOS, we don't use the precomputed optical flow and replace it with the real-time computed optical flow by RAFT.

@hzxie hzxie closed this as completed Apr 11, 2021
@cctgem
Copy link
Author

cctgem commented Apr 11, 2021

Thanks for your reply. It seems that i need to update the code in train.py if i want to train from scratch.

@MaxChanger
Copy link

Why not use FlowNet-CSS2 on YouTube-VOS, but choose RAFT? @hzxie
Thanks.

@hzxie
Copy link
Owner

hzxie commented Apr 11, 2021

@MaxChanger
Good question. Because RAFT performs better than FlowNet-CSS2 when estimating the optical flow in YouTube-VOS.

@MaxChanger
Copy link

Can I understand it as FlowNet-CSS2 performs better than RAFT on DAVIS?But why?

@hzxie
Copy link
Owner

hzxie commented Apr 11, 2021

Yes, you are right. I am also confused about it.

@MaxChanger
Copy link

MaxChanger commented Apr 11, 2021

Well, it seems to be consistent with what I understand, and there is no theoretical explanation, only that the experimental effect is better. Very extensive experimental comparison. 👍👍
Thank you once again. @hzxie

@cctgem
Copy link
Author

cctgem commented Apr 11, 2021

Do you plan to release the model train on YouTube vos dataset?@hzxie

@hzxie
Copy link
Owner

hzxie commented Apr 11, 2021

@cctgem
Yes.
The model for YouTube-VOS was deleted by accident when I left SenseTime and needed to retrain again.
However, I am busy writing my Ph.D. thesis. Therefore, the model may be available later.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

3 participants