# Special deep learning architectures

Here we go through some of the most innovative architectures that have had a substantial impact to AI or its application fields.

## Neural autoencoder

One of the most influential ideas was the autoencoder structure.

<div>
<img src="pictures/Autoencoder_schema.png" width=300>
</div>

### MLP Autoencoder

**Example:** MLP autoencoder for MNIST Fashion

See the Colab code

 * https://colab.research.google.com/drive/1T4A68XyDcMJE9xtymo9V8y0sKnM-lNvG?usp=sharing 

### Convolutional autoencoder

"Fully convolutional neural networks" introduced in

 * J Long, E Shelhamer, T Darrell (2015): "Fully convolutional networks for semantic segmentation". In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. [PDF](https://openaccess.thecvf.com/content_cvpr_2015/papers/Long_Fully_Convolutional_Networks_2015_CVPR_paper.pdf)

### Deconvolution

The transpose (inverse) of convolution

**Example:** PyTorch ConvTranspose2d

 * [Colab example](https://colab.research.google.com/drive/1YwLn1PtM6qqW290eAhb-Cx_BtJqSdGKd?usp=sharing)


### Applications

#### Image denoising

 * See [Colab example](https://colab.research.google.com/drive/1-b2mVrTgjdeiuww7ONFTneakZ-3xElWC?usp=sharing)

#### Anomaly detection

 * See [Colab](https://colab.research.google.com/drive/1hLUVaEjEKVdnLPLnukKcX1qucdggbLG0?usp=sharing)

##### Detection threshold selection

<div>
<img src="pictures/gaussian_fractiles.jpg" width="600">
</div>

### Image segmentation

The idea works also for annotated images

<div><img src="pictures/image_segmentation_autoencoder.png" width=600></div>

See also [Nvidia Canvas](https://www.nvidia.com/en-us/studio/canvas/)

## Autoencoder-inspired architectures

### U-Net

See: https://en.wikipedia.org/wiki/U-Net

### Generative Adversarial Networks (GANs)

StyleGAN: https://en.wikipedia.org/wiki/StyleGAN

Code: https://github.com/NVlabs/stylegan

Recently replaced by [Diffusion models](https://en.wikipedia.org/wiki/Diffusion_model)

### DeepFake

See: https://www.alanzucconi.com/2018/03/14/understanding-the-technology-behind-deepfakes/

Video: https://www.youtube.com/watch?v=OmB7fmi8JwY

Code: plenty

### The power of vast training data and self-supervision

Frozen people: https://mannequin-depth.github.io/

## Other important architectures

### Neural radiance field (Nerf)

For 3D models

See https://en.wikipedia.org/wiki/Neural_radiance_field

Recently replaced by Gaussian splatting (see e.g. https://repo-sam.inria.fr/fungraph/3d-gaussian-splatting/ )

## References

For the crazy new ideas you need to follow the top AI conferences every year, for example:

 * [NeurIPS](https://neurips.cc) - The top-tier conference of AI
 * [CVPR](https://cvpr.thecvf.com) - The top-tier conference of computer vision
 * [ICASSP](https://2024.ieeeicassp.org) - The top-tier conferene of audio processing
 * [SIGGRAPH](https://www.siggraph.org) - The top-tier conference on computer graphics
