Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Understanding the output npy array #23

Open
Anirudh58 opened this issue Jul 18, 2019 · 1 comment
Open

Understanding the output npy array #23

Anirudh58 opened this issue Jul 18, 2019 · 1 comment

Comments

@Anirudh58
Copy link

Feeding 2 images of resolution 240x320 is generating a .npy array of shape (320, 240, 2) containing positive and negative values. I want to feed these as features for an action modeling problem. I want to get a better idea of what these values actually mean.
Is it the distance moved by that particular pixel in the next frame? or coordinate?
Would really appreciate if someone could give me clarity.
Thanks, in advance.

@varununayak
Copy link

I believe that the HXW dimensions are preserved (and not transposed).

For each pixel, optical flow is parameterized as follows [1]: one channel to specify the orientation (direction of motion in pixel space) and the other for the magnitude of motion.

[1] https://people.csail.mit.edu/celiu/OpticalFlow/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants