Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Datasets did not contain all the data #16

Closed
windstormer opened this issue Oct 7, 2021 · 2 comments
Closed

Datasets did not contain all the data #16

windstormer opened this issue Oct 7, 2021 · 2 comments

Comments

@windstormer
Copy link

Hi,
According to your paper, the IU X-Ray dataset contains 5226, 748, and 1496 images on train, val, and test, respectively.
However, the provided dataset you had published contains only 2069, 296, and 590 images, written in the annotation.json.
Is the shared dataset you used for training and testing?
If not, could you share your splitted dataset to me?
MIMIC-CXR dataset also had a similar problem, with 270790, 2130, and 3858, for the splitted dataset, which did not match the number present in your paper.

image

@tangyuxing
Copy link

Hi,

The number of images in the json file is indeed inconsistent with Table 1 of the paper.

In the json file, [2069, 296, 590] are the number of studies, while each study has one or multiple (normal 2: frontal and lateral view) x-ray images. The number of images in the json file is 5910 in total.

@zhjohnchan
Copy link
Contributor

Dear @tangyuxing and @windstormer,

Thanks for your attention to our paper! The statistics shown in the Table are from their original paper and the data provided is the pre-preprocessed one.

Best,
Zhihong

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants