Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] The processing method on bounding box of VG and RefCOCO are different? #822

Closed
Richar-Du opened this issue Nov 19, 2023 · 4 comments

Comments

@Richar-Du
Copy link

Richar-Du commented Nov 19, 2023

Question

I have seen the issue in #606, and I agree that the processing method of VG is the same as stated in #606 (comment). However, this processing method does not work for RefCOCO, following is an image in RefCOCO processed by expand2square:
image

And the following is the same image without processing by expand2square:
image
It seems that the second one is correct.

So I wonder if the processing methods used in RefCOCO and VG are different.

@Richar-Du
Copy link
Author

I misunderstand it, sorry for the interrupt.

@NorthSummer
Copy link

Hi, I still haven't got it; the normalized [xa, ya, xb, yb] seems correct for VG data after padding, but is not correct for coco data after padding, right?

@boyugou
Copy link

boyugou commented May 18, 2024

Hi, I still haven't got it; the normalized [xa, ya, xb, yb] seems correct for VG data after padding, but is not correct for coco data after padding, right?

Is it true? I also haven't get whether the normalized coordinates in llava665k are adjusted for padding or not

@lzy37ld
Copy link

lzy37ld commented May 18, 2024

Same

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants