Neural-Relation-Graph

Official PyTorch implementation of "Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data", published at NeurIPS'23

Requirements

PyTorch 1.11 and timm 0.5.4

conda create --name relation python=3.8.8
conda activate relation

pip install -r requirements.txt
# [CUDA install example] pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 --extra-index-url https://download.pytorch.org/whl/cu113

Set IMGNET_DIR in ./imagenet/data.py to contain ImageNet train and val directories.
We provide main experiment checkpoints, including data features via Google Drive. To download, install gdown:

   pip uninstall --yes gdown
   pip install gdown -U --no-cache-dir

Notes

The default save directory is ./results. You can specify a different directory by using the --cache_dir option (for both download and detection).
For detection, you can reduce memory usage by half using half-precision with --dtype float16, with a marginal performance drop. Also, using smaller --chunk (e.g., 50) reduces memory usage, while leads to increased computation time.
We train MAE-Large for 50 epochs following official codes.
For ResNet50, we use checkpoint provided by timm.
To run experiments with reduced dataset size, set a hop size for sampling by --hop [int].

Label error detection

ImageNet with synthetic label error (8%)

Download model, features, and noisy labels (6.3GB):

python download.py -n mae_large_noise0.08_49

To conduct detection, run

python detect.py -n mae_large_noise0.08_49 --pow 4

The required GPU Memory is approximately 14GB. You can reduce memory usage by half using half-precision with --dtype float16, with a marginal performance drop. Also, using smaller --chunk (e.g., 50) reduces memory usage, while leads to increased computation time.

ImageNet validation set cleaning

Download model and features (6.4GB):

python download.py -n mae_large_49

To conduct detection, run

python detect_val.py -n mae_large_49 --pow 4

OOD detection

Download OOD datasets following this link.
Set OOD_DIR in ./imagenet/data.py (to contain dtd, iNaturalist, Places, SUN folders).
Download model and features (6.4GB for MAE-Large / 11GB for ResNet50):

python download.py -n [mae_large_49/resnet50]

To conduct OOD detection, run

python detect_ood.py -n [mae_large_49/resnet50] --pow 1

The required GPU Memory is approximately 14GB for MAE-Large and 18GB for ResNet50. You can reduce memory usage by half using half-precision with --dtype float16, with a marginal performance drop. Also, using smaller --chunk (e.g., 50) reduces memory usage while leading to increased computation time.

Language and speech datasets

Check ./language and ./speech

Applying our method to custom datasets

Prepare data features and probability vectors.
Update self._load_feat and self._load_noisy_label functions in detect.py for label error and detect_ood.py for OOD.
Run the updated Python scripts.
- Tune --pow, a temperature for the kernel function (suggestion: [1, 4, 6, 8]).

Citation

@article{kim2023neural,
  title={Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data},
  author={Kim, Jang-Hyun and Yun, Sangdoo and Song, Hyun Oh},
  journal={Advances in Neural Information Processing Systems},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
figure		figure
imagenet		imagenet
language		language
models		models
speech		speech
.gitignore		.gitignore
License		License
README.md		README.md
argument.py		argument.py
detect.py		detect.py
detect_ood.py		detect_ood.py
detect_val.py		detect_val.py
download.py		download.py
feature.py		feature.py
metric.py		metric.py
relation.py		relation.py
requirements.txt		requirements.txt

License

snu-mllab/Neural-Relation-Graph

Folders and files

Latest commit

History

Repository files navigation

Neural-Relation-Graph

Requirements

Notes

Label error detection

ImageNet with synthetic label error (8%)

ImageNet validation set cleaning

OOD detection

Language and speech datasets

Applying our method to custom datasets

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages