GitHub - Magnushhoie/weightless_NN_decompression: Proof of concept for neural network decompression without storing any weights

Tldr

This notebook.ipynb demonstrates this proof of concept:

Encoder neural network: Compresses data.txt -> compressed.txt using a simple LSTM neural network
Decoder neural network: Decompresses compressed.txt directly from the file and without transmitting neural weights.
How: Encoder and decoder are trained in a deterministic, synchronized process, training on currently seen (de-compressed) data. This guarantees both networks share the exact same state over time, removing need to externally store weights.

Rationale

Neural network-based language models are ideally suited for compressing text, as they can efficiently predict the next word in a sentence. Instead of storing all words directly, we can instead find where each word is in the neural network's top predicted words, and only use the index instead.

Character counts (excluding spaces):

Original words: An apple a day keeps the doctor away (30 characters)
Neural network predicted word indices: 40 9 6 3 1 1 1 1 (9 characters)

Even this naive implementation achieves an impressive compression ratio of 9/30 = 0.30.

However, this only works if we already have the neural network weights available. If we were to include these, we'd likely thow away any compression gains. Unless there is a method to completely skip storing them ...

The below proof of concept details a a way to avoid storing the weights, by learning them on-the-go from the compressed data itself.

The idea comes from this 2019 NNCP paper, which holds the currently world record for smallest compressed version of Wikipedia file (~1 GB -> 100 MB). You can read more in this HackerNews post.

Implementation details

We encode sequences of digits like "000000", "000001", etc., and store the compressed data in compressed.txt. Instead of using the index of the most likely next word, we'll be even more efficient and use an Arithmetic Compressor.

Despite not saving the neural network's weights, we can then decompress this data, retrieving the original sequences. This is achieved by ensuring that both the encoder and decoder evolve identically during their respective processes.

Both encoder and decoder start with the same initial model. As they process the sequences, they update their models identically, ensuring synchronized evolution.

Encoder neural network:

Initialize a neural network with all weights set to the same value (we need the weight updates to be deterministic)
(Nb: We save the first sequence without compressing it)
For each digit in a sequence, predict the next digit using a neural network model (learning a probability distribution)
Update the neural network based on the loss
When done with a sequence, compress the next one based on the learned probability distribution so far using an Arithmetic Compressor.

Decoder neural network:

Initialize the same neural network with the same fixed value
(Nb: load the first uncompressed sentence)
Predict each digit of a sequence using the current state of the neural network
Update the neural network based on the loss (mirroring the encoder state)
Decompress the next sequence based on the learned probability distribution.
With the now decompressed sequence, train on it, and learn to decompress the next one, until all sequences are decoded

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
README.md		README.md
compressed.txt		compressed.txt
data.txt		data.txt
decompressed.txt		decompressed.txt
notebook.ipynb		notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

compressed.txt

compressed.txt

data.txt

data.txt

decompressed.txt

decompressed.txt

notebook.ipynb

notebook.ipynb

Repository files navigation

Tldr

Rationale

Implementation details

Read more

About

Releases

Packages

Languages

Magnushhoie/weightless_NN_decompression

Folders and files

Latest commit

History

Repository files navigation

Tldr

Rationale

Implementation details

Read more

About

Topics

Resources

Stars

Watchers

Forks

Languages