Skip to content

First look at dataset

Valentyn1997 edited this page Jan 29, 2020 · 5 revisions

First look at dataset

Report of unusual cases - notebook

Goal project

  • Detect anomalies in hand xray images
  • Should work on unlabeled data → can't use classification task
  • First step: to detect obvious anomalies like metal plates (e.g.)

Example_obvious anomalies

Context dataset

Data is a subset of this dataset: MURA. "Study is manually labeled by ra- diologists as either normal or abnormal. To evaluate models robustly and to get an estimate of radiologist performance, we collect additional labels from six board- certified Stanford radiologists on the test set, consisting of 207 musculoskeletal studies. On this test set, the majority vote of a group of three radiologists serves as gold standard" (see abstract).

Overview over dataset

Images per person and study

Distribution_img_per_person_study

The range of amount of image ranges from 1 to 5 per study. One person can have more than one study. Most of the studies consist of 2-3 images.

Distribution label

label_distribution

More than 1400 studies are normal. About 500 studies are not normal.

Possible problems

  • The images have a frame around the hand → can influence the model

patient00008

  • The frame can vary

patient00050 patient00135 patient01853

  • Background color is not fixed

patient00256 patient00370 patient00551 patient5463

  • Noisy

patient00262

  • Text labels, sometimes more than one

patient00485

  • Arrow in the image

patient10147 patient10839

  • Different size

image1

Unusual cases

1. Different contrast levels/colours

patient00218 patient00218_3

2. Not whole hand in a picture / part of other body part in the picture

patient00457_3 patient02186 patient02699 patient02919

3. Different odd objects

patient00483 patient00608 patient00725 is classified as normal patient00831 patient03475

4. Quality of images is different

patient00485_3 patient01344 patient02465

Angles are different

patient00565_1 patient00934

Double hands

patient00588 patient03761

For the same view point positions are different

patient00648_2 patient00604_3 patient00608

Only parts of a hand

patient00218

Wired position

patient00376 patient10149

Part of other hand in the image

patient03947

2 images next to another in the same scan

patient09769

5. Misclassification?

patient02015 patient02560 patient10754