Horama is a compact library designed for Feature Visualization experiments, initially providing the implementation code for the research paper Maco.
This repository also introduces various feature visualization methods, including a reimagined approach to the remarkable work by the Clarity team and an implementation of Feature Accentuation by Hamblin et al. For an official reproduction of Distill's work, complete with comprehensive notebooks, we highly recommend Lucent. However, Horama emphasizes experimentation and is not an official reproduction of any other paper aside from Maco within PyTorch.
Horama requires Python 3.6 or newer and several dependencies, including Numpy. It supports both Tensorflow and Torch. Installation is straightforward with Pypi:
pip install horama
With Horama installed, you can dive into feature visualization. The API is designed to be intuitive across both Tensorflow and Pytorch frameworks, requiring only a few hyperparameters to get started.
Example usage:
import torch
import timm
from horama import maco, fourier, plot_maco
%config InlineBackend.figure_format = 'retina'
model = timm.create_model('resnet18', pretrained=True).cuda().eval()
objective = lambda images: torch.mean(model(images)[:, 1])
image1, alpha1 = maco(objective)
plot_maco(image1, alpha1)
plt.show()
image2, alpha2 = fourier(objective)
plot_maco(image2, alpha2)
plt.show()
Complete API Guide Horama's API includes the following primary functions:
maco(objective_function,
total_steps=1000,
learning_rate=1.0,
image_size=1000,
model_input_size=224,
noise=0.1,
values_range=(-2.5, 2.5),
crops_per_iteration=6,
box_size=(0.20, 0.25),
device='cuda')
fourier(objective_function,
decay_power=1.5,
total_steps=1000,
learning_rate=1.0,
image_size=1000,
model_input_size=224,
noise=0.1,
values_range=(-2.5, 2.5),
crops_per_iteration=6,
box_size=(0.20, 0.25),
device='cuda')
When optimizing, it's crucial to fine-tune the hyperparameters. Parameters like the decay spectrum in the Fourier method significantly impact the visual output, controlling the energy distribution across frequencies. Additionally, adjust the values_range to match your model's preprocessing requirements, and ensure model_input_size matches the expected input size of your model.
@article{fel2023maco,
title={Unlocking Feature Visualization for Deeper Networks with MAgnitude Constrained Optimization},
author={Thomas, Fel and Thibaut, Boissin and Victor, Boutin and Agustin, Picard and Paul, Novello and Julien, Colin and Drew, Linsley and Tom, Rousseau and Rémi, Cadène and Laurent, Gardes and Thomas, Serre},
journal={Advances in Neural Information Processing Systems (NeurIPS)},
year={2023},
}
For a simpler and maintained implementation of the code for TensorFlow and the other feature visualization methods used in the paper, refer to the Xplique toolbox. Additionally, we have created a website called the LENS Project, which features the 1000 classes of ImageNet.
For code faithful to the original work of the Clarity team, we highly recommend Lucent.
- Thomas Fel - thomas_fel@brown.edu, PhD Student, Brown University & DEEL (ANITI)