Skip to content
/ ICM-v1 Public

Official implementation of the paper "Scalable Image Coding for Humans and Machines Using Feature Fusion Network".

Notifications You must be signed in to change notification settings

final-0/ICM-v1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scalable Image Coding for Humans and Machines

Image Coding for Machines



This is the official implementation of the following papers.

・Image Coding for Machines with Edge Information Learning Using Segment Anything (arXiv)

Paper arXiv

・Scalable Image Coding for Humans and Machines Using Feature Fusion Network (arXiv)

Paper arXiv


Installation

  1. Clone this repository from GitHub.
git clone https://github.com/final-0/ICM-v1.git
  1. Change the current directory to the "ICM-v1" folder.
cd ICM-v1/
  1. Install the required dependencies in the current directory.
pip3 install -r requirements.txt 

Recommended Specs

For testing only, a GPU with about 11GB of memory is sufficient. (e.g. 1080ti, 2080ti)


Usage

Download model checkpoints. You can obtain "ica.pth.tar" and "icm.pth.tar" from this link. These checkpoints can be used by placing them in the "param" folder.

param ---- param_details.txt
       |-- icm.pth.tar (image compression model for Machines)
       |-- ica.pth.tar (additional information compression model)

If you want to compress images for "Machines", run the following command :

python3 coding_m.py --checkpoint param/icm.pth.tar --input image/input

Add “--real” to the command to obtain a bit-stream:

python3 coding_m.py --checkpoint param/icm.pth.tar --input image/input --real

If you want to compress images for "Humans" & "Machines", run the following command :

python3 coding_hm.py --checkpoint_m param/icm.pth.tar --checkpoint_a param/ica.pth.tar --input image/input


Reconstructed Samples

About

Official implementation of the paper "Scalable Image Coding for Humans and Machines Using Feature Fusion Network".

Topics

Resources

Stars

Watchers

Forks

Languages