About

All day, every day we unconsciously segment the world into tidy pieces. From distinguishing and identifying elements of the physical space around us to breaking down abstract ideas into something more familiar-- decomposition on all planes of conception is fundamental to our brains.

In mathematical signal processing we use a wide variety of approaches to replicating this automatic decomposition in computer programs. There are techniques that simply compute the axes of highest variation amongst data (like Principal Component Analysis), and there are techniques that seek to parameterize human knowledge in order to pull out interpretable components (like Morphological Component Analysis). There are more explicit approaches like mathematically beamforming data streams to isolate different source signals, and there are more implicit ones like training self-supervised neural networks to encode latent axes of interest as they see fit.

Welcome to my personal PyTorch library for exploring some of these concepts (and to host unit-tested code for reproducing results from my PhD thesis).

Some tools herein:

dictionary learning
differentiable convex optimization algorithms (AKA "unrolled" learnable encoders, e.g. LISTA, LSALSA)
Variational Autoencoder
some Python wrappers for SQL

Tools I'm working on:

morphological component analysis tools
Beyond Backprop style layer-parallelized training

Some preliminary visualizations

Formal Sparse Coding Background

It is often useful to represent a signal or image in terms of its basic building blocks. For example, a smiley-face can be efficiently described as "a circle, two dots, and a curve". At least, that is more efficient than "pixel 1: value 0.1. Pixel 2: value 1" and so on for thousands of pixels. This is a rudimentary example of "sparse representation"-- i.e., if we have a dictionary of shapes and curves, we can often describe an image as a weighted-sum of those dictionary elements. The fewer the number of dictionary atoms used, the more efficient/sparse the representation is. We refer to the list of weights to use as a code, and it is sparse when most of its weights are zero.

Sparse coding is the problem of generating a dictionary from which sparse codes can be computed for every sample of a given dataset. This repository provides some tools and classes for various sparse coding experiments. As of now, the focus is on learning a linear dictionary (e.g. for vectors, including vectorized image patches) from data. The training process yields a dictionary-- i.e. a matrix, whose rows are the dictionary elements-- which can be used along with a sparse code to represent a signal.

CIFAR, ASIRRA-, and Fashion-MNIST-based atoms, with patch-sizes 10x10, 16x16, and 10x10, respectively.

This procedure is originally described in "Emergence of simple-cell receptive field properties by learning a sparse code for natural images", by Olshausen and Field Nature, 381:607–609, 1996. It is famously used in "Learning Fast Approximations of Sparse Coding" (Gregor and Lecun) and recently in "LSALSA: efficient sparse coding in single and multiple dictionary settings" (Cowen, Saridena, Choromanska).

We train by minimizing with respect to the matrix/dictionary/decoder $\mathbf{A}$ :

$F(\mathbf{A}) = \frac1P \sum_{p=1}^P \frac12|| \mathbf{y}(p)-\mathbf{A}\mathbf{x^*}(p)||_2^2 + \alpha||\mathbf{x^*}(p)||_1,$

where $\alpha \geq 0$ is a scalar parameter that balances sparsity with reconstruction error, $\mathbf{A}$ is the dictionary, $\mathbf{y}(p)$ is the p-th training data sample, and $\mathbf{x}(p)$ is its corresponding optimal sparse code.

What do we mean by optimal sparse code? And why would we optimize an L1 term that does not include $\mathbf{A}$ (hence giving a zero subgradient)? The procedure is as follows.

Select a batch of image patches (or whatever training data):
Compute optimal codes for each . How? Fix $\mathbf{A}$ . With fixed $\mathbf{A}$ , is convex with respect to ! So, we compute the argument-minimimum with respect to , to obtain an optimal code. We call $\mathbf{x^*}(p)$ the optimal code of $\mathbf{y}(p)$ , given the current dictionary. In this repo we compute optimal codes using an algorithm called FISTA. Note: $\mathbf{x^*}(p)$ depends on $\mathbf{A}$ , but it does NOT depend on the algorithm used to encode $\mathbf{y}(p)$ , since it is a convex problem with a unique solution)
Next, we un-fix $\mathbf{A}$ , compute the gradient of with respect to and perform backpropagation using the batch.
Re-normalize the columns of .
Go back to Step 1 and pull out a fresh batch, unless has converged.

In summary, we do not couple the problems of sparse coding (producing codes) and training a decoder (a.k.a. dictionary). Rather, we iterate between them.

After successful optimization, the following should hold:

$\mathbf{y}(p) \approx \mathbf{A}\mathbf{x}(p),$ for .

In other words, the sparse vector $\mathbf{x}(p)$ multiplied with the (learned) dictionary $\mathbf{A}$ provides an efficient approximation to the signal $\mathbf{y}(p)$ .

TO-DO

save dictionary objects
put lua version on (maybe...)
color version
training script for encoders
re-formulate "learned FISTA"
look into SSNAL (see past team emails)
C++ Tensorflow framework....! )

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
.github/workflows		.github/workflows
.idea		.idea
SCRATCH/celeb-dict-mse		SCRATCH/celeb-dict-mse
UNIT_TESTS		UNIT_TESTS
bin		bin
experiments		experiments
legacy-code		legacy-code
lib		lib
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
requirements.txt		requirements.txt
test_all_unit_tests.py		test_all_unit_tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Some preliminary visualizations

Formal Sparse Coding Background

TO-DO

About

Releases

Packages

Languages

License

BenCowen/SparseCoding

Folders and files

Latest commit

History

Repository files navigation

About

Some preliminary visualizations

Formal Sparse Coding Background

TO-DO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages