lq, hq data split #2

Nimbus1997 · 2022-12-06T06:35:57Z

Hello I came again😁

I was looking at your paper and came up with one question about how you split the lq and hq data. As mentioned in your paper and your github, you split your hq and lq data by score of eyeQ/MCFNet. - LQ for "usable" and HQ for "good"-.
But in the table in the paper, Original FIQA is not zero which means some pictures are marked as "good".
How could this possible?
Did you used "EyeQ/data/Label_EyeQ_train.csv" quality level? I did and found out that some datas mark as "usable" to be "good" or "reject" when I tested with MCF net.

QtacierP · 2022-12-06T09:10:56Z

Aha, there must be some problems/misunderstandings. Firstly, the data split is followed by the ground-truth label (which is the csv file you mentioned). Moreover, you say you have not found the data classified wrongly. It seems impossible, since the accuracy of MCF-Net on the test set is not 100% (which is around 80%~90%, and the grade for "usable" is the most challenging task). Be sure that you are testing on the EyeQ test set, and then calculate the ACC on this dataset.

QtacierP · 2022-12-06T09:16:15Z

The performance of MCF-Net is here. You can see all of these models cannot achieve the ACC of 100%. Moreover, the classification for "Usable" is the most difficult, since it is the most ambiguous level, so the performance on classifying "usable" should be lower than the average performance.

QtacierP · 2022-12-06T09:25:39Z

Here are some suggestions for this problem: (1) First, calculate the ACC on the EyeQ test dataset, and compare the results with the original paper. I guess there may be a performance gap between your implementation and the official implementation. (2) Second, check the weights in the network, be sure that you have loaded the correct weights from the pre-trained checkpoint. (3) Thirdly, check the pre-processing (especially the normalization), be sure all of these should be similar to the original MCF-Net. (4) Lastly, the FIQA only focuses on Good/Not Good, which is not related to the "Rejected" grade. Using the torch.argmax() to get the predicted label, and sum over the number of samples which are graded as "Good", formulated by X. The total number of samples is formulated as Y. The FIQA is simply calculated by X/Y.
I hope these suggestions can help you :)

Nimbus1997 · 2022-12-06T13:37:27Z

Aha so the quality labels of the "EyeQ/data/Label_EyeQ_train.csv" and "EyeQ/data/Label_EyeQ_test.csv" are the ground truth not the result of MCF Net! Thanks a million

How did you split your data to train/val/test? In your github, it is said on readme.md "Split the dataset into train/val/test according to the EyePACS challenge." but, in the kaggle challenge, I only found test/train separation 😔

QtacierP · 2022-12-07T05:51:29Z

We followed the DR classification task dataset split in my teammate's work (https://arxiv.org/pdf/2110.14160.pdf). I think you can just split 20% of the training data as the validation set, which seems also okay.

Nimbus1997 · 2022-12-08T13:37:21Z

Aha I see
Thanks a million😸

Nimbus1997 · 2023-02-24T10:50:01Z

Hello again :)

I have more questions about the data split you made. I read your teammate's work you mentioned (https://arxiv.org/pdf/2110.14160.pdf) but found out that the total number is different from the EyeQ data. - the total number in EyePACS: 88,702 and the total number in EyeQ when using only "usable" for low quality: 23,252.

So could I get how you split your data (with a data name list for each train/val/test)? If you can, I will give you my email!
Or could you please tell me how much data is in each train/val/test set (low and high quality separately)?

Thanks a lot always, you are a huge help to me.

QtacierP · 2023-02-24T11:01:40Z

We have followed the train/test split provided by the official EyeQ dataset, which can be found at https://github.com/HzFu/EyeQ/tree/master/data. However, as there was no validation dataset available, we have created a validation set by splitting the training set. Nevertheless, we have updated the data split in the repository, which you can now access. :)

QtacierP · 2023-02-24T11:03:07Z

The label "1" is "good", while label "0" is "usable".

Nimbus1997 · 2023-02-24T11:19:53Z

Oh wow..! 🥺🥺
Thank you so much again

Nimbus1997 · 2023-03-04T07:44:11Z

Hello, I found out that the label on the csv file you uploaded is slightly different from the EyeQ version.(https://github.com/HzFu/EyeQ).
Eyeq version>

your version >
Good(hq) = 12,905
Usable(lq) = 10,347
But the total (good + usable) number is the same (23252).

Could you please tell me how did you get the labels?

QtacierP · 2023-03-04T07:52:05Z

I used the label in EyeQ V1, but EyeQ has been updated to V2 now. You can check this branch https://github.com/HzFu/EyeQ/tree/95c63a743a68b1665d7ecb1e050a2d5b4f0f3408 for more details in V1.
I think V2 version may provide more accurate annotations for image assessment :)

Nimbus1997 · 2023-03-04T07:58:32Z

Aha I see thanks!

QtacierP · 2023-03-04T08:09:48Z

I apologize for any confusion. Upon reviewing my workspace, I discovered that the number of "good" images is 16818, as compared to 16817 in EyeQ. Additionally, the number of "usable" images is 6436, versus 6435 in EyeQ. It appears that there is only a one-image discrepancy between versions 1 and 2. As the CSV file I uploaded is only utilized for the public split, I will investigate whether there are any issues with these files.

QtacierP · 2023-03-04T08:16:44Z

After reviewing the codes and the number of uploaded CSV files, everything appears to be in order. However, it may be prudent to double-check the CSV files to ensure that they are accurate.

QtacierP · 2023-03-04T08:17:32Z

The "bad" label shown in the bash represents the "usable" grades in EyeQ.

Nimbus1997 · 2023-03-04T09:40:31Z

I am sorry, it was my mistake..
I found out that I had mistaken the number of your data set. When I counted again, it is same as the eyeq set.

Nimbus1997 closed this as completed Dec 8, 2022

Nimbus1997 reopened this Feb 24, 2023

Nimbus1997 closed this as completed Feb 24, 2023

Nimbus1997 reopened this Mar 4, 2023

Nimbus1997 closed this as completed Mar 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lq, hq data split #2

lq, hq data split #2

Nimbus1997 commented Dec 6, 2022

QtacierP commented Dec 6, 2022

QtacierP commented Dec 6, 2022

QtacierP commented Dec 6, 2022

Nimbus1997 commented Dec 6, 2022 •

edited

QtacierP commented Dec 7, 2022

Nimbus1997 commented Dec 8, 2022

Nimbus1997 commented Feb 24, 2023

QtacierP commented Feb 24, 2023

QtacierP commented Feb 24, 2023

Nimbus1997 commented Feb 24, 2023 •

edited

Nimbus1997 commented Mar 4, 2023 •

edited

QtacierP commented Mar 4, 2023

Nimbus1997 commented Mar 4, 2023

QtacierP commented Mar 4, 2023

QtacierP commented Mar 4, 2023

QtacierP commented Mar 4, 2023

Nimbus1997 commented Mar 4, 2023

lq, hq data split #2

lq, hq data split #2

Comments

Nimbus1997 commented Dec 6, 2022

QtacierP commented Dec 6, 2022

QtacierP commented Dec 6, 2022

QtacierP commented Dec 6, 2022

Nimbus1997 commented Dec 6, 2022 • edited

QtacierP commented Dec 7, 2022

Nimbus1997 commented Dec 8, 2022

Nimbus1997 commented Feb 24, 2023

QtacierP commented Feb 24, 2023

QtacierP commented Feb 24, 2023

Nimbus1997 commented Feb 24, 2023 • edited

Nimbus1997 commented Mar 4, 2023 • edited

QtacierP commented Mar 4, 2023

Nimbus1997 commented Mar 4, 2023

QtacierP commented Mar 4, 2023

QtacierP commented Mar 4, 2023

QtacierP commented Mar 4, 2023

Nimbus1997 commented Mar 4, 2023

Nimbus1997 commented Dec 6, 2022 •

edited

Nimbus1997 commented Feb 24, 2023 •

edited

Nimbus1997 commented Mar 4, 2023 •

edited