KVALD is a real-time video processing application designed to reduce glare and stabilize brightness in video streams. It uses a combination of a deep learning-based mask generator and a differentiable Kalman filter to achieve this.
The core of KVALD consists of the following components:
- Mask Generation Network: A U-Net based model that generates a probability mask to identify glare hotspots.
- Differentiable Kalman Filter: A Kalman filter that uses the generated mask to predict and update the state of the video stream, smoothing out brightness variations.
- Spatial and Temporal Smoothing: Additional smoothing algorithms to further improve the quality of the output video.
Note on Architecture: This project has been refactored to be a pure Python implementation, removing the C++ components for simplicity and faster prototyping.
To run this project, you will need Python and the dependencies listed in requirements.txt. You can install them using pip:
pip install -r requirements.txtYou can then run the main application:
python proof_of_concept.py