PACTran Metrics

This is the code repository for the paper: PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks (https://arxiv.org/abs/2203.05126).

This is not an officially supported Google product.

Introduction

PACTran is a theoretically grounded family of metrics for pretrained model selection and transferability measurement. The family is derived from the optimal PAC-Bayesian bound under the transfer learning setting. It contains three metric instantiations: PACTran-Dirichlet, PACTran-Gamma, PACTran-Gaussian. Our experiments showed that PACTran-Gaussian is a more consistent and effective transferability measure compared to other existing selection methods.

How to use

In the following, we provide the instructions of using the metrics (while using the caltech101 task in the Visual Task Adaptation Benchmark (VTAB) [1] and the Sup-100% checkpoint as an example). Run the following scripts in the parent dir of pactran_metrics.

Prerequisites:
- Tensorflow
- Tensorflow-Hub
- Tensorflow-datasets
- Numpy
- Scipy
- Scikit-learn
For penultimate layer feature prediction, run

python -m pactran_metrics.adapt_and_eval \
--hub_module 'https://tfhub.dev/vtab/sup-100/1'  \
--hub_module_signature default \
--finetune_layer default \
--work_dir /tmp/all_features/sup-100/caltech101/ \
--dataset 'caltech101' \
--batch_size 512 \
--batch_size_eval 512 \
--initial_learning_rate 0.001 \
--decay_steps 1500,3000,4500 \
--max_steps 1 \
--run_adaptation=False \
--run_evaluation=False \
--run_prediction=True \
--linear_eval=False

For whole network finetuning, run

python -m pactran_metrics.adapt_and_eval \
--hub_module 'https://tfhub.dev/vtab/sup-100/1'  \
--hub_module_signature default \
--finetune_layer default \
--work_dir /tmp/all_models/sup-100/caltech101/ \
--dataset 'caltech101' \
--batch_size 512 \
--batch_size_eval 512 \
--initial_learning_rate 0.001 \
--decay_steps 1500,3000,4500 \
--max_steps 5000 \
--run_adaptation \
--use_anil False

For top-layer only finetuning, run

python -m pactran_metrics.anil_classifier \
--hub_module="https://tfhub.dev/vtab/sup-100/1" \
--dataset="caltech101" \
--work_dir=/tmp/all_results/caltech101 \
--feature_dir=/tmp/all_features

For metrics estimation, run

python -m pactran_metrics.compute_metrics \
--hub_module="https://tfhub.dev/vtab/sup-100/1" \
--dataset="caltech101" \
--work_dir=/tmp/all_results/caltech101 \
--feature_dir=/tmp/all_features \
--num_examples=2

All available datasets (in ./data folder):
- "caltech101"
- "oxford_flowers102"
- "patch_camelyon"
- "sun397"
- "dmlab"
- "cifar"
- "ddsm"
- "oxford_iiit_pet"
- "smallnorb"
All available model URLs:

Reference:

[1] Zhai, X., Puigcerver, J., Kolesnikov, A., Ruyssen, P., Riquelme, C., Lucic, M., Djolonga, J., Pinto, A.S., Neumann, M., Dosovitskiy, A., et al.: A large-scale study of representation learning with the visual task adaptation benchmark. arXiv preprint arXiv:1910.04867 (2019)

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
data		data
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
adapt_and_eval.py		adapt_and_eval.py
anil_classifier.py		anil_classifier.py
compute_correlation.py		compute_correlation.py
compute_metrics.py		compute_metrics.py
config.py		config.py
data_loader.py		data_loader.py
data_loader_test.py		data_loader_test.py
loop.py		loop.py
loop_test.py		loop_test.py
model.py		model.py
model_test.py		model_test.py
registered_modulers.py		registered_modulers.py
registry.py		registry.py
registry_test.py		registry_test.py
requirements.txt		requirements.txt
test_utils.py		test_utils.py
trainer.py		trainer.py
trainer_test.py		trainer_test.py

License

google-research/pactran_metrics

Folders and files

Latest commit

History

Repository files navigation

PACTran Metrics

Introduction

How to use

Reference:

About

Resources

License

Stars

Watchers

Forks

Languages