Skip to content

ShinyWang33/eSL-Net

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

eSL-Net

show
Figure 1. Our eSL-Net reconstructs high-resolution, sharp and clear intensity images for event cameras by APS frames and the corresponding event sequences.

This is code for the paper Event Enhanced High-Quality Image Recovery by Bishan Wang, Jingwei He, Lei Yu, Gui-Song Xia, Wen Yang.

You can find a pdf of the paper here. The paper has been accepted by ECCV2020. If you use of this code or the synthetic dataset, please cite the following publications:

@inproceedings{wang2020event,
  title={Event Enhanced High-Quality Image Recovery},
  author={Wang, Bishan and He, Jingwei and Yu, Lei and Xia, Gui-Song and Yang, Wen},
  booktitle={European Conference on Computer Vision},
  year={2020},
  organization={Springer}
}

Run

  • Pretrained model with SR : code/pre_trained/model_withsr_pretrained.pt

  • Pretrained model without SR : code/pre_trained/model_withoutsr_pretrained.pt

  • Model of eSL-Net with SR: code/model_sr.py

  • Model of eSL-Net without SR: code/model_withoutsr.py

  • Example files with event data: data_example

    • Data of APS frames: data_example/cups/images、 data_example/pic2/images

    • Data of events: data_example/cups/mat、 data_example/pic2/mat

      if you have new event data, you can preprocess the events referring to data_example/cups/mat and data_example/pic2/mat.

    • the path of loading input for eSL-Net: realdata_list.txt

  • Run reconstruction:

    cd code
    

    if you want to reconstruct images without SR:

    python test_realdata.py --sr=0 --model=pre_trained/model_withoutsr_pretrained.pt --num_frame=3 --output_path=realdata_dn/
    

    if you want to reconstruct images with SR:

    python test_realdata.py --sr=1 --model=pre_trained/model_withsr_pretrained.pt --num_frame=3 --output_path=realdata_sr/
    

Synthetic Dataset

This synthetic dataset is generated from high-resolution sharp images of GoPro dataset and ESIM. And the process of generating the synthetic dataset is described in detailed in our paper.

Downloads are available via Baidu Net Disk.

Type Train Validation
HR clear sharp images train_sharp_hr(password: 1e2d) val_sharp_hr(password: we5s)
LR clear sharp images train_sharp_lr(password: 5qv6) val_sharp_lr(password: fqkk)
LR noisy blurry images train_blur_lr(password: qbpb) val_blur_lr(password: ngvv)
Event sequences with noises train_esim(password: 8m73) val_esim(password: gwhk)

Contents

Introduction

With extremely high temporal resolution, event cameras have a large potential for robotics and computer vision. However, the recovering of high-quality images from event cameras is a very challenge problem, where the following issues should be addressed simultaneously.

  • Low frame-rate and blurry intensity images: The APS (Active Pixel Sensor) frames are with relatively low frame-rate. And the motion blur is inevitable when recording highly dynamic scenes.

  • High level and mixed noises: The thermal effects or unstable light environment can produce a huge amount of noisy events. Together with the noises from APS frames, the reconstruction of intensity image would fall into a mixed noises problem.

  • Low spatial-resolution: The leading commercial event cameras are typically with very low spatial-resolution. And there is a balance between the spatial-resolution and the latency

In our paper, we propose an explainable network, an event-enhanced Sparse Learning Network (eSL-Net), to recover the high-quality images from event cameras. Since events depict brightness changes, with the enhanced degeneration model by the events, the clear and sharp high-resolution latent images can be recovered from the noisy, blurry and low-resolution intensity observations. Exploiting the framework of sparse learning, the events and the low-resolution intensity observations can be jointly considered. Furthermore, without additional training process, the proposed eSL-Net can be easily extended to generate continuous frames with frame-rate as high as the events.

haha
Figure 2. Architecture of the proposed eSL-Net.

Results of Reconstruction

Qualitative Comparisons of Reconstruction on the synthetic dataset

syn_result1 syn_result2 syn_result3
Qualitative comparison of eSL-Net to EDI, CF and MR with SR method on the synthetic dataset.
Methods EDI+RCAN 4x CF+RCAN 4x MR+RCAN 4x eSL-Net 4x
PSNR(dB) 12.88 12.89 12.89 25.41
SSIM 0.4647 0.4638 0.4643 0.6727
Quantitative comparison of our outputs to EDI, CF and MR with SR method on the synthetic dataset.

Qualitative Comparisons of Reconstruction on the real dataset

real_result1 real_result2 real_result3
Qualitative comparison of eSL-Net to EDI, CF and MR with SR method on the real dataset.
real data/BRISQUE EDI+RCAN 4x CF+RCAN 4x MR+RCAN 4x eSL-Net 4x
camerashake1 55.8542 109.122 83.9851 55.6984
indoordrop 64.1578 65.8033 80.7871 62.5109
Quantitative comparison of eSL-Net to EDI, CF and MR with SR method on the real dataset by BRISQUE measure, where lower values indicate higher quality.

High frame-rate Reconstruction

In the following videos, The left side is the original APS frame by bicubic upsampling for 4 times, and the right side are the high frame rate, high resolution reconstructed results of eSL-Net.

Event Camera——DAVIS240:

camerashake
rotatevideo

Event Camera——DAVIS346:

j4
e4

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages