This is code for the paper Event Enhanced High-Quality Image Recovery by Bishan Wang, Jingwei He, Lei Yu, Gui-Song Xia, Wen Yang.
You can find a pdf of the paper here. The paper has been accepted by ECCV2020. If you use of this code, please cite the following publications:
@inproceedings{wang2020event,
title={Event Enhanced High-Quality Image Recovery},
author={Wang, Bishan and He, Jingwei and Yu, Lei and Xia, Gui-Song and Yang, Wen},
booktitle={European Conference on Computer Vision},
year={2020},
organization={Springer}
}
-
Pretrained model : code/pretraining/model_epoch_57.pth
-
An example file with event data: data_example
-
Data of APS frames: camerashake_blurimage
-
Data of events after preprocessing: camerashake_event_frame_txt
if you have new event data, the preprocessing of events can refer to code/generate_test_txt.m
-
the path of loading input for eSL-Net: test.txt
-
-
Model of eSL-Net: model_n2.py
-
Run reconstruction:
cd code
python test_real_data.py
-
The results of reconstruction can be be viewed in the data_example/camerashake_results
With extremely high temporal resolution, event cameras have a large potential for robotics and computer vision. However, the recovering of high-quality images from event cameras is a very challenge problem, where the following issues should be addressed simultaneously.
-
Low frame-rate and blurry intensity images: The APS (Active Pixel Sensor) frames are with relatively low frame-rate. And the motion blur is inevitable when recording highly dynamic scenes.
-
High level and mixed noises: The thermal effects or unstable light environment can produce a huge amount of noisy events. Together with the noises from APS frames, the reconstruction of intensity image would fall into a mixed noises problem.
-
Low spatial-resolution: The leading commercial event cameras are typically with very low spatial-resolution. And there is a balance between the spatial-resolution and the latency
In our paper, we propose an explainable network, an event-enhanced Sparse Learning Network (eSL-Net), to recover the high-quality images from event cameras. Since events depict brightness changes, with the enhanced degeneration model by the events, the clear and sharp high-resolution latent images can be recovered from the noisy, blurry and low-resolution intensity observations. Exploiting the framework of sparse learning, the events and the low-resolution intensity observations can be jointly considered. Furthermore, without additional training process, the proposed eSL-Net can be easily extended to generate continuous frames with frame-rate as high as the events.
Methods | EDI+RCAN 4x | CF+RCAN 4x | MR+RCAN 4x | eSL-Net 4x |
---|---|---|---|---|
PSNR(dB) | 12.88 | 12.89 | 12.89 | 25.41 |
SSIM | 0.4647 | 0.4638 | 0.4643 | 0.6727 |
real data/BRISQUE | EDI+RCAN 4x | CF+RCAN 4x | MR+RCAN 4x | eSL-Net 4x |
---|---|---|---|---|
camerashake1 | 55.8542 | 109.122 | 83.9851 | 55.6984 |
indoordrop | 64.1578 | 65.8033 | 80.7871 | 62.5109 |
In the following videos, The left side is the original APS frame by bicubic upsampling for 4 times, and the right side are the high frame rate, high resolution reconstructed results of eSL-Net.
Event Camera——DAVIS240:
Event Camera——DAVIS346: