Autoencoder based image compression: can the learning be quantization independent?

This repository is a Tensorflow implementation of the paper "Autoencoder based image compression: can the learning be quantization independent?", ICASSP, 2018.

ICASSP 2018 paper | Project page with visualizations

The code is tested on Linux and Windows.

Prerequisites

Python (code tested using Python 2.7.9 and Python 3.6.3)
numpy (version >= 1.11.0)
tensorflow (optional GPU support), see TensorflowInstallationWebPage (for Python 2.7.9, the code was tested using Tensorflow 0.11.0; for Python 3.6.3, the code was tested using Tensorflow 1.4.0; the code must thus work using any Tensorflow 0.x or 1.x, x being the subversion index)
cython (code tested with cython 0.25.2)
matplotlib (code tested with matplotlib 1.5.3)
pillow (code tested with pillow 3.4.2)
scipy (code tested wth scipy 0.18.1)
six
glymur (code tested with Glymur 0.8.10), see GlymurWebPage
ImageMagick, see ImageMagickWebPage

Cloning the code

Clone this repository into the current folder.

git clone https://github.com/thierrydumas/autoencoder_based_image_compression.git
cd autoencoder_based_image_compression/kodak_tensorflow/

Compilation

Compilation of the C++ lossless coder via Cython.

cd lossless
python setup.py build_ext --inplace
cd ../

Compilation of HEVC/H.265.
- For Linux,
```
cd hevc/HM-16.15/build/linux/
make
cd ../../../../
```
- For Windows, use Visual Studio 2015 and the solution file at "hevc/HM-16.15/build/HM_vc2015.sln". For more information, see HEVCSoftwareWebPage.

Quick start: reproducing the main results of the paper

Creation of the Kodak test set containing 24 luminance images.
```
python creating_kodak.py
```
Comparison of several trained autoencoders, JPEG2000, and H.265 in terms of rate-distortion on the Kodak test set.
```
python reconstructing_eae_kodak.py
```
After running Step 2, the reconstructions of the luminance images in the Kodak test set and the rates and the PSNRs associated to the compression of the luminance images via the trained autoencoders, JPEG2000, and H.265 are stored in the folder "eae/visualization/test/checking_reconstructing/kodak/".

Quick start: training an autoencoder

First of all, ImageNet images must be downloaded. In our case, it is sufficient to download the ILSVRC2012 validation images, "ILSVRC2012_img_val.tar" (6.3 GB), see ImageNetDownloadWebPage. Let's say that, in your computer, the path to "ILSVRC2012_img_val.tar" is "path/to/folder_0/ILSVRC2012_img_val.tar" and you want the unpacked images to be put into the folder "path/to/folder_1/" before the script "creating_imagenet.py" preprocesses them. The creation of the ImageNet training and validaton sets of luminance images is then done via
```
python creating_imagenet.py path/to/folder_1/ --path_to_tar=path/to/folder_0/ILSVRC2012_img_val.tar
```
The training of an autoencoder on the ImageNet training set is done via the command below. 1.0 is the value of the quantization bin widths at the beginning of the training. 14000.0 is the value of the coefficient weighting the distortion term and the rate term in the objective function to be minimized over the parameters of the autoencoder. The script "training_eae_imagenet.py" enables to split the entire autoencoder training into several successive parts. The last argument 0 means that "training_eae_imagenet.py" runs the first part of the entire training. For each successive part, the last argument is incremented by 1.
```
python training_eae_imagenet.py 1.0 14000.0 0
```

Full functionality

The documentation "documentation_kodak/documentation_code.html" describes all the functionalities of the code of the paper.

A simple example

Another piece of code is a simple example for introducing the code of the paper. This piece of code is stored in the folder "svhn". Its documentation is in the file "documentation_svhn/documentation_code.html". If you feel comfortable with autoencoders, this piece of code can be skipped. Its purpose is to clarify the training of a rate-distortion optimized autoencoder. That is why a simple rate-distortion optimized autoencoder with very few hidden units is trained on tiny images (32x32 SVHN digits).

Citing

@InProceedings{autoencoder_based_icassp2018,
  author = {Dumas, Thierry and Roumy, Aline and Guillemot, Christine},
  title = {Autoencoder based image compression: can the learning be quantization independent?},
  booktitle = {ICASSP},
  year = {2018}
}

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
documentation_kodak_tensorflow		documentation_kodak_tensorflow
documentation_svhn		documentation_svhn
kodak_tensorflow		kodak_tensorflow
svhn		svhn
README.md		README.md
supplementary_material_0.pdf		supplementary_material_0.pdf
supplementary_material_1.pdf		supplementary_material_1.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Autoencoder based image compression: can the learning be quantization independent?

Prerequisites

Cloning the code

Compilation

Quick start: reproducing the main results of the paper

Quick start: training an autoencoder

Full functionality

A simple example

Citing

About

Releases

Packages

Languages

thierrydumas/autoencoder_based_image_compression

Folders and files

Latest commit

History

Repository files navigation

Autoencoder based image compression: can the learning be quantization independent?

Prerequisites

Cloning the code

Compilation

Quick start: reproducing the main results of the paper

Quick start: training an autoencoder

Full functionality

A simple example

Citing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages