Output format and rescaling #22

tfederico · 2021-09-16T13:36:05Z

Hello,

in the documentation you say that when prediction 3d coordinates, the X and Y are "aligned to input image".
Could you please explain this in detail? Does it mean that the model outputs something in the range [-1, 1] or [0, 1] and you rescale it based on the image size?

If I wanted to rescale them in the range [-1, 1], should I just divide them by the image width and height respectively? Or do you perform a square cropping (e.g., 256x256) and one should divide the output accordingly?

Also, does [0, 0] correspond to the top left corner of the image?

tfederico · 2021-09-16T13:53:24Z

sorry, wrong repo

tfederico closed this as completed Sep 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output format and rescaling #22

Output format and rescaling #22

tfederico commented Sep 16, 2021

tfederico commented Sep 16, 2021

Output format and rescaling #22

Output format and rescaling #22

Comments

tfederico commented Sep 16, 2021

tfederico commented Sep 16, 2021