You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
in the documentation you say that when prediction 3d coordinates, the X and Y are "aligned to input image".
Could you please explain this in detail? Does it mean that the model outputs something in the range [-1, 1] or [0, 1] and you rescale it based on the image size?
If I wanted to rescale them in the range [-1, 1], should I just divide them by the image width and height respectively? Or do you perform a square cropping (e.g., 256x256) and one should divide the output accordingly?
Also, does [0, 0] correspond to the top left corner of the image?
The text was updated successfully, but these errors were encountered:
Hello,
in the documentation you say that when prediction 3d coordinates, the X and Y are "aligned to input image".
Could you please explain this in detail? Does it mean that the model outputs something in the range [-1, 1] or [0, 1] and you rescale it based on the image size?
If I wanted to rescale them in the range [-1, 1], should I just divide them by the image width and height respectively? Or do you perform a square cropping (e.g., 256x256) and one should divide the output accordingly?
Also, does [0, 0] correspond to the top left corner of the image?
The text was updated successfully, but these errors were encountered: