Huffman-Coding

A C++ compression and decompression program based on Huffman Coding.

Introduction

This project is to design compression and decompression programs based on Huffman Coding. The idea of Huffman Coding is to minimize the weighted expected length of the code by means of assigning shorter codes to frequently-used characters and longer codes to seldom-used code.

Implementation Details

The programs can compress and decompress text consisting of 128 ASCII characters.

Compression

Get the text as input, count and store the frequencies of different characters;
Construct Priority Queue and then Huffman Tree;
Calculate Huffman Encode Table;
Encode the given text using huffman table.

Decompression

Decode the compressed text based on the huffman tree.

The algorithm serializes the encoded text as a binary/decimal sequence and the corresponding Huffman binary tree in order to be able to decode back the text in the future.

Here is an example: Frequency of each character in the given text:

Huffman codes and encoded text in a binary format, decoded text and compression ratio:

As can be seen, string is compressed to about 6% of the original string which was given as input.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
frequencies.jpg		frequencies.jpg
huffmanCodes.jpg		huffmanCodes.jpg
huffmanCoding.cpp		huffmanCoding.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Huffman-Coding

Introduction

Implementation Details

Compression

Decompression

About

Releases

Packages

Languages

Amisha2001/Huffman-Coding

Folders and files

Latest commit

History

Repository files navigation

Huffman-Coding

Introduction

Implementation Details

Compression

Decompression

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages