Hubs and Hyperspheres

This repository contains the code for the paper "Hubs and Hyperspheres: Reducing Hubness and Improving Transductive Few-shot Learning with Hyperspherical Embeddings", CVPR 2023.

Abstract - Distance-based classification is frequently used in transductive few-shot learning (FSL). However, due to the high-dimensionality of image representations, FSL classifiers are prone to suffer from the hubness problem, where a few points (hubs) occur frequently in multiple nearest neighbour lists of other points. Hubness negatively impacts distance-based classification when hubs from one class appear often among the nearest neighbors of points from another class, degrading the classifier's performance. To address the hubness problem in FSL, we first prove that hubness can be eliminated by distributing representations uniformly on the hypersphere. We then propose two new approaches to embed representations on the hypersphere, which we prove optimize a tradeoff between uniformity and local similarity preservation -- reducing hubness while retaining class structure. Our experiments show that the proposed methods reduce hubness, and significantly improves transductive FSL accuracy for a wide range of classifiers.

Results aggregated over classifiers

1-shot

		mini	mini	tiered	tiered	CUB	CUB
	Embedding	Acc	Score	Acc	Score	Acc	Score
ResNet18	None	55.74	0.17	62.61	0.0	63.78	0.17
	L2	68.22	2.33	75.94	2.17	78.09	2.33
	Cl2	69.56	2.83	76.97	3.0	78.26	2.83
	ZN	60.0	2.33	66.21	2.5	67.43	2.67
	ReRep	60.76	4.0	67.07	3.67	69.6	4.17
	EASE	69.63	3.67	77.05	4.0	78.84	3.67
	TCPR	69.97	4.0	77.18	3.33	78.83	4.0
	noHub (Ours)	72.58	6.83	79.77	6.83	81.91	6.83
	noHub-S (Ours)	73.64	7.67	80.6	7.67	83.1	7.67

WideRes28-10	None	63.59	1.0	71.29	0.83	79.23	1.17
	L2	74.3	3.0	76.19	2.67	88.61	3.5
	Cl2	71.32	1.33	75.17	2.0	88.52	3.33
	ZN	64.27	2.5	65.64	2.5	76.0	1.5
	ReRep	65.51	3.0	71.83	3.17	83.1	3.5
	EASE	74.95	4.33	76.59	3.67	88.51	3.5
	TCPR	75.64	4.83	76.51	4.0	88.22	2.5
	noHub (Ours)	78.22	7.0	79.76	7.0	90.25	5.67
	noHub-S (Ours)	79.24	7.67	80.46	7.67	90.82	7.67

5-shot

		mini	mini	tiered	tiered	CUB	CUB
	Embedding	Acc	Score	Acc	Score	Acc	Score
ResNet18	None	69.83	0.83	74.38	0.67	76.01	1.17
	L2	81.58	2.33	86.05	1.83	88.43	2.83
	Cl2	81.95	2.67	86.43	3.0	88.49	2.5
	ZN	71.49	4.0	75.32	3.83	76.92	3.5
	ReRep	70.25	2.5	74.52	1.83	76.43	2.5
	EASE	81.84	3.5	86.4	3.17	88.57	3.5
	TCPR	82.1	4.0	86.54	3.83	88.79	4.33
	noHub (Ours)	82.58	5.5	86.9	4.5	89.13	6.0
	noHub-S (Ours)	82.61	6.5	87.13	6.67	88.93	5.33

WideRes28-10	None	78.77	1.5	84.1	1.67	89.49	1.67
	L2	85.65	4.0	86.29	3.83	93.47	3.67
	Cl2	83.14	1.33	85.47	1.5	93.49	4.0
	ZN	74.61	4.33	75.34	5.0	81.02	3.17
	ReRep	73.86	1.83	81.51	1.67	87.2	2.0
	EASE	85.51	3.5	86.29	3.33	93.34	3.5
	TCPR	86.03	6.0	86.37	4.0	93.3	3.0
	noHub (Ours)	86.44	5.67	87.07	5.5	93.65	4.17
	noHub-S (Ours)	85.95	5.5	87.05	5.83	93.76	5.0

Datasets

The datasets can be downloaded by following the instructions in the repo Realistic evaluation of transductive few-shot learning .

After downloading the datasets, use the files in data/split to separate the images into directories data/[mini|tiered|cub]/[train|val|test].

Feature extractors

Download the checkpoints from:

ResNet-18: Realistic evaluation of transductive few-shot learning.
WideRes28-10: S2M2 Charting the Right Manifold: Manifold Mixup for Few-shot Learning

And place them in the models directory.

Feature caching

When executing the evaluation script, setting --cache_dir /path/to/cached/features will save the computed features to the specified directory. By setting --use_cached True in subsequent runs, this will make repeated evaluations on the same dataset/feature extractors much faster.

Installing dependencies

Run

conda env create -f environment.yml

to create a conda environment with the base requirements.

Then, inside the newly created environment, install the other dependencies with pip:

pip install -r requirements.txt

Running with Docker

The docker directory contains a build script and a Dockerfile to build a docker image with all required dependencies.

Running evaluation

Evaluation is performed by running

python evaluate.py --checkpoint "<path/to/feature/extractor/checkpoint.ckpt>" --n_shots "<shots>" --dataset "[mini|tiered|cub]" --classifier "<classifier>" --embedding "<embedding>" "<optional arguments>"

where "<classifier>" and "<embedding>" are one of the implemented classifiers and embedding methods, respectively (see below).

Alternatively, the arguments can be provided in a yaml file:

python evaluate.py -c path/to/config/file.yml "<optional arguments>"

See src/config/templates for examples of config files.

Implemented embedding methods

Name	`--embedding`
None	none
L2	l2
CL2	cl2
ZN	zn
ReRep	rr
EASE	ease
TCPR	tcpr
noHub (Ours)	nohub
noHub-S (Ours)	nohubs

Implemented classifiers

Name	`--classifier`
SmpleShot	simpleshot
Laplacianshot	laplacianshot
α-TIM	alpha_tim
iLPC	ilpc
Oblique Manifold	om
SIAMESE	siamese

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hubs and Hyperspheres

Results aggregated over classifiers

Datasets

Feature extractors

Feature caching

Installing dependencies

Running with Docker

Running evaluation

Implemented embedding methods

Implemented classifiers

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data/split		data/split
docker		docker
features/debug		features/debug
models		models
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt

License

uitml/noHub

Folders and files

Latest commit

History

Repository files navigation

Hubs and Hyperspheres

Results aggregated over classifiers

Datasets

Feature extractors

Feature caching

Installing dependencies

Running with Docker

Running evaluation

Implemented embedding methods

Implemented classifiers

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages