Neural Filters

This repo contains experiments with a blend of classical and neural-network based filtering algorithms.

Task

Regression of coordinates from a sequence of 1d images as toy proxy for more sophisticated state estimation.
Labels are corrupted by gaussian noise as well as salt-and-pepper noise.
Input images are generated from the corrupted (!) labels by painting a gaussian with mean corresponding to the coordinate signal into the image.
The coordinate signal is synthetic, realized by a concatenation of constant, sinusoidal, and triangular waveforms with subsequent smoothing.

Below, a visualization of the data is shown. The red line shows the uncorrupted signal

Questions

Can "smoothness" penalties yield better predictions?
Suppress noise and ignore outliers, in particular.
How do models with a classical structure like Kalman filter perform versus LSTMs?
Can the models be trained end-to-end?

Not covered:

How does the smoothness penalty compare to pre-filtering the labels?

These questions have been answered in the literature for various use cases but I wanted to make my own experiments.

The idea with the smoothness penalty can be extended to enforcing arbitrary differential constraints on the predicted signal f, i.e. g(f(x), df/dx(x), df^2/dx^2(x), ...) = 0. In some examples here, this is realized with a penalty for the "acceleration" df^2/dx^2(x) to limits its magnitude below a threshold. It is also possible (but not shown here) to penalize the amplitude of selected frequency components because the Fourier transform to obtain the amplitudes is differentiable.

Code

LSTM-Denoise.ipynb: LSTM based filters
KF-Denoise.ipynb: Kalman filters
PF-Denoise.ipynb: Particle filters

The rest is boilerplate and utility.

Dependencies: Pytorch, Numpy, Matplotlib

Examples

Blue line shows prediction. The models are trained with L2 loss on the coordinate prediction.

Single frame regression

A linar layer attached to a small convolutional backbone directly regresses to the coordinate.

Kalman filter

The observation model, based on the same backbone, outputs mean and variance of a normal distribution. The state evolution model is the linear piecewise constant velocity with noisy acceleration model. It's coefficients are fixed. The posterior variance from the observation model is also shown. But it's overall scale depends on the parameters of the motion model.

LSTM with smoothness loss

An LSTM on top of the CNN backbone. In addition to the L2 loss, the second derivative of the output is heavily penalized. ]

Particle filter with smoothness loss

Observation model is a neural network taking the particle state and the observation image as input and outputs a score how well the particle matches to the observation. Another neural network performs the prediction step for the particles. Noise is injection into the motion model to give it some randomness and allow particles to diffuse. Resampling is also performed during training.

Note that the model cannot really shine here because the input is relatively unambiguous. Particle filter models are effective for representing multi modal state distributions as might arise if for instance the input image had two peaks and the model could not immediately decide which one corresponds to the true state.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
img		img
.gitignore		.gitignore
KF-Denoise.ipynb		KF-Denoise.ipynb
LSTM-Denoise.ipynb		LSTM-Denoise.ipynb
PF-Denoise.ipynb		PF-Denoise.ipynb
data.py		data.py
eval_.py		eval_.py
models.py		models.py
readme.md		readme.md
train.py		train.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

img

img

.gitignore

.gitignore

KF-Denoise.ipynb

KF-Denoise.ipynb

LSTM-Denoise.ipynb

LSTM-Denoise.ipynb

PF-Denoise.ipynb

PF-Denoise.ipynb

data.py

data.py

eval_.py

eval_.py

models.py

models.py

readme.md