-
Notifications
You must be signed in to change notification settings - Fork 1
Description
Dear Authors,
First of all, thank you for your impressive work on
While my reproduction on ResNet18 closely matches the reported results in Table 1, I have observed a significant performance gap when evaluating on ResNet50 and ResNet101. This discrepancy persists even when I evaluate the distilled images provided by you.
Here are the detailed comparisons:
| Model | Paper Reported (Table 1) | My Eval on Author's Data | My Reproduction (Re-generated) |
|---|---|---|---|
| ResNet18 | 65.0 ± 0.7% | 66.8% | 64.8% (Matches well) |
| ResNet50 | 67.5 ± 0.8% | 62.8% | 59.8% |
| ResNet101 | 66.3 ± 1.3% | 55.2% | 58.4% |
Observations:
ResNet18 Alignment: The reproduction for ResNet18 is very successful, which suggests the environment and generation pipeline are likely correct.
Evaluation Gap: The fact that evaluating the provided distilled data locally yields significantly lower results for ResNet50 (-4.7%) and ResNet101 (-11.1%) suggests the issue might be related to the evaluation protocol rather than the distillation process itself.
Questions:
Could you please tell me what points should be noted during the evaluation on Resnet50/101?
Thank you for your time and help!