V3.8.2 Update#87
Merged
VisionDepth merged 19 commits intoMain-Stablefrom Feb 11, 2026
Merged
Conversation
v3.8.2 - Readded ONNX model from Depth inference list that was missing
Adapters for DAV3 and Video Depth Anything Depth models for integration
Video Depth Anything Backend
Config files for DAV3
model backend for integration
DAV3 Model back end
Windows Updater for latest release
### Main GUI & Workflow Improvements - Renamed **Depth Estimation** tab to **Depth Engine** to reflect multi-backend depth processing. - Added native DA3 and Video Depth Anything engines directly into the unified depth selector. - Improved model list consistency so UI options always match available backends. - Added clearer ONNX model identification in the console during load. - Fixed mismatched slider labels and tooltips in the 3D Generator tab. - Reworked **Encoding Settings** dialog layout for cleaner spacing and readability. - Moved **Clip Range** controls into Processing Options with translated labels and tooltips. - Added optional **Convergence Crosshairs** overlay in Preview GUI for faster tuning. - Fixed File menu actions failing to trigger dialogs (Load Preset and Output Path). - Simplified File menu by removing redundant Save/Load Settings in favor of presets. - Integrated built-in **VisionDepth3D Updater** accessible from Help → Check Updates. - Added confirmation prompt before launching updater for safe auto-closing behavior. - Reduced console warning spam for cleaner runtime output.
### Depth Estimation Improvements & Fixes - Introduced native **Depth Anything 3 (DA3)** backend with full integration into image and video workflows. - Added native **Video Depth Anything (VDA)** backend with sequence-aware temporal inference. - Unified DA3, VDA, ONNX, and Hugging Face models under a single depth engine pipeline. - Normalized all depth outputs into a consistent 0–1 range for reliable blending and 3D rendering. - Added warm-up passes for DA3 and VDA to eliminate first-frame hitching. - Improved batching support and fallback handling for multi-frame depth inference. - Added configurable target FPS control for VDA to reduce inference load on high-FPS sources. #### ONNX Stability & Model Fixes - Fixed Distill-Any-Depth ONNX models failing due to tensor shape mismatches. - Enforced correct 518×518 inference resolution for Distill-Any-Depth models. - Added automatic ONNX model detection and resolution enforcement. - Switched ONNX preprocessing to aspect-ratio-preserving padding instead of stretching. - Enabled safe ONNX Runtime graph optimizations for improved stability and performance. - Fixed ONNX warm-up errors and broadcast failures. #### Video Depth Handling Improvements - Fixed letterbox (black bar) regions incorrectly affecting depth inference. - Improved multi-frame letterbox detection to prevent flicker. - Filled letterbox areas with neutral depth to prevent pop-out artifacts and white banding. #### Performance Optimizations - Removed redundant image resizing during video inference. - Consolidated resizing into a single pass per frame. - Enabled CUDA `channels_last` memory layout for supported Hugging Face models. - Improved FP16 inference handling for faster CUDA performance. - Optimized ONNX session configuration to reduce memory overhead. - Improved batch handling to reduce per-frame processing cost. - Reduced console warning spam for cleaner runtime output.
- Added optional **Convergence Crosshairs overlay** to the Preview GUI for faster and more precise convergence tuning. - Significantly improved real-time Preview GUI smoothness by resetting render state between sessions to prevent drift and jitter. - Eliminated “settling” artifacts at the start of previews by reinitializing depth normalization and convergence trackers per render. - Improved floating window behavior during the first frames of preview playback for more stable stereo alignment. - Increased live preview FPS by reducing GPU memory churn and persistent buffer reuse. - Reduced preview stutter caused by warm-up spikes and redundant tensor allocations. - Improved frame pacing for smoother SBS output during live preview. - Enhanced stability when mixing screen capture with GPU depth inference.
- Major real-time performance boost across GPUs, with live 3D preview running approximately **40 to 70 percent faster** depending on resolution and hardware. - Eliminated frequent GPU memory reallocations by introducing persistent CUDA buffers for depth inference and stereo rendering. - Smoother live depth updates through optimized GPU tensor reuse and reduced CPU to GPU transfer overhead. - Added independent **Depth FPS control**, allowing depth inference to run at a lower rate than preview rendering for better responsiveness and stability. - Reduced temporal jitter in live depth maps using improved EMA smoothing while preserving depth responsiveness. - Minimized preview hitching caused by first-frame warm-up and inference spikes. - Improved frame pacing for more consistent SBS output in live mode. - Increased stability when combining screen capture with GPU depth inference.
- Significantly smoother offline and real-time 3D rendering by fully resetting internal render state at the start of each render session. - Eliminated temporal drift, convergence carry-over, and accumulated smoothing artifacts between consecutive renders. - Improved depth range calibration per clip with fresh percentile normalization for more consistent parallax response. - Stabilized floating window behavior and convergence transitions during the first frames of each render. - Increased real-time preview FPS and reduced jitter across long renders. - Fixed output sizing across all 3D modes including: - VR formats - Passive Interlaced displays - Single-eye exports - Corrected floating window calculations to operate per-eye instead of full SBS width. - Added safety resizing to guarantee final encoded frames always match target output resolution. - Added optional **Convergence Crosshairs overlay** in the Preview GUI for faster and more precise tuning. - Cleaned up UI inconsistencies: - Foreground and Background shift labels now match their actual sliders - Tooltips correctly reflect each control’s function - Reworked Encoding Settings layout for better readability and workflow. - Moved Clip Range controls into Processing Options for a cleaner main interface. - Fixed File menu actions: - Preset loading now works correctly from the dropdown - Output path dialog now opens properly from both menu and hotkey - Removed redundant Save/Load Settings in favor of streamlined Preset workflow
Updated changelog for VisionDepth3D v3.8.2 with performance improvements, new depth engines, and various fixes.
Added new dependencies to requirements.txt for additional functionality.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
VisionDepth3D v3.8.2 Major Engine & Performance Update
This pull request introduces one of the largest upgrades to VisionDepth3D to date, focusing on:
New Depth Engines
Depth Inference Fixes & Quality Improvements
Massive 3D Rendering Performance Gains
Encoding & Pipeline Reliability
UI & Workflow Improvements
Summary
This update significantly improves:
and introduces next-generation depth backends for both image and video processing to VD3D