V3.8.2 Update by VisionDepth · Pull Request #87 · VisionDepth/VisionDepth3D

VisionDepth · 2026-02-11T20:30:56Z

VisionDepth3D v3.8.2 Major Engine & Performance Update

This pull request introduces one of the largest upgrades to VisionDepth3D to date, focusing on:

New Depth Engines

Native Depth Anything 3 (DA3) backend with unified image & video support
Native Video Depth Anything (VDA) temporal depth backend for sequence-aware video inference

Depth Inference Fixes & Quality Improvements

Fixed Distill-Any-Depth ONNX model failures and enforced correct inference resolution
Improved aspect-ratio preserving preprocessing for ONNX models
Normalized all depth outputs into a consistent 0–1 range
Stabilized letterbox handling to prevent black bar depth artifacts

Massive 3D Rendering Performance Gains

Full render-state reset per session (no drift, jitter, or convergence carry-over)
Faster real-time preview and smoother offline rendering
Improved floating window stability and per-eye geometry handling

Encoding & Pipeline Reliability

Fixed FFmpeg/OpenCV codec routing issues
Enabled fast FFmpeg piping for reliable high-quality output

UI & Workflow Improvements

Renamed Depth Estimation tab to Depth Engine
Cleaner Encoding Settings layout
Fixed File menu actions (preset loading, output path dialog)
Integrated built-in VisionDepth3D Updater

Summary

This update significantly improves:

Depth accuracy
Real-time performance
Render stability
Encoding reliability
Workflow polish

and introduces next-generation depth backends for both image and video processing to VD3D

v3.8.2 - Readded ONNX model from Depth inference list that was missing

Adapters for DAV3 and Video Depth Anything Depth models for integration

Video Depth Anything Backend

Config files for DAV3

model backend for integration

DAV3 Model back end

Windows Updater for latest release

### Main GUI & Workflow Improvements - Renamed **Depth Estimation** tab to **Depth Engine** to reflect multi-backend depth processing. - Added native DA3 and Video Depth Anything engines directly into the unified depth selector. - Improved model list consistency so UI options always match available backends. - Added clearer ONNX model identification in the console during load. - Fixed mismatched slider labels and tooltips in the 3D Generator tab. - Reworked **Encoding Settings** dialog layout for cleaner spacing and readability. - Moved **Clip Range** controls into Processing Options with translated labels and tooltips. - Added optional **Convergence Crosshairs** overlay in Preview GUI for faster tuning. - Fixed File menu actions failing to trigger dialogs (Load Preset and Output Path). - Simplified File menu by removing redundant Save/Load Settings in favor of presets. - Integrated built-in **VisionDepth3D Updater** accessible from Help → Check Updates. - Added confirmation prompt before launching updater for safe auto-closing behavior. - Reduced console warning spam for cleaner runtime output.

### Depth Estimation Improvements & Fixes - Introduced native **Depth Anything 3 (DA3)** backend with full integration into image and video workflows. - Added native **Video Depth Anything (VDA)** backend with sequence-aware temporal inference. - Unified DA3, VDA, ONNX, and Hugging Face models under a single depth engine pipeline. - Normalized all depth outputs into a consistent 0–1 range for reliable blending and 3D rendering. - Added warm-up passes for DA3 and VDA to eliminate first-frame hitching. - Improved batching support and fallback handling for multi-frame depth inference. - Added configurable target FPS control for VDA to reduce inference load on high-FPS sources. #### ONNX Stability & Model Fixes - Fixed Distill-Any-Depth ONNX models failing due to tensor shape mismatches. - Enforced correct 518×518 inference resolution for Distill-Any-Depth models. - Added automatic ONNX model detection and resolution enforcement. - Switched ONNX preprocessing to aspect-ratio-preserving padding instead of stretching. - Enabled safe ONNX Runtime graph optimizations for improved stability and performance. - Fixed ONNX warm-up errors and broadcast failures. #### Video Depth Handling Improvements - Fixed letterbox (black bar) regions incorrectly affecting depth inference. - Improved multi-frame letterbox detection to prevent flicker. - Filled letterbox areas with neutral depth to prevent pop-out artifacts and white banding. #### Performance Optimizations - Removed redundant image resizing during video inference. - Consolidated resizing into a single pass per frame. - Enabled CUDA `channels_last` memory layout for supported Hugging Face models. - Improved FP16 inference handling for faster CUDA performance. - Optimized ONNX session configuration to reduce memory overhead. - Improved batch handling to reduce per-frame processing cost. - Reduced console warning spam for cleaner runtime output.

- Added optional **Convergence Crosshairs overlay** to the Preview GUI for faster and more precise convergence tuning. - Significantly improved real-time Preview GUI smoothness by resetting render state between sessions to prevent drift and jitter. - Eliminated “settling” artifacts at the start of previews by reinitializing depth normalization and convergence trackers per render. - Improved floating window behavior during the first frames of preview playback for more stable stereo alignment. - Increased live preview FPS by reducing GPU memory churn and persistent buffer reuse. - Reduced preview stutter caused by warm-up spikes and redundant tensor allocations. - Improved frame pacing for smoother SBS output during live preview. - Enhanced stability when mixing screen capture with GPU depth inference.

- Major real-time performance boost across GPUs, with live 3D preview running approximately **40 to 70 percent faster** depending on resolution and hardware. - Eliminated frequent GPU memory reallocations by introducing persistent CUDA buffers for depth inference and stereo rendering. - Smoother live depth updates through optimized GPU tensor reuse and reduced CPU to GPU transfer overhead. - Added independent **Depth FPS control**, allowing depth inference to run at a lower rate than preview rendering for better responsiveness and stability. - Reduced temporal jitter in live depth maps using improved EMA smoothing while preserving depth responsiveness. - Minimized preview hitching caused by first-frame warm-up and inference spikes. - Improved frame pacing for more consistent SBS output in live mode. - Increased stability when combining screen capture with GPU depth inference.

- Significantly smoother offline and real-time 3D rendering by fully resetting internal render state at the start of each render session. - Eliminated temporal drift, convergence carry-over, and accumulated smoothing artifacts between consecutive renders. - Improved depth range calibration per clip with fresh percentile normalization for more consistent parallax response. - Stabilized floating window behavior and convergence transitions during the first frames of each render. - Increased real-time preview FPS and reduced jitter across long renders. - Fixed output sizing across all 3D modes including: - VR formats - Passive Interlaced displays - Single-eye exports - Corrected floating window calculations to operate per-eye instead of full SBS width. - Added safety resizing to guarantee final encoded frames always match target output resolution. - Added optional **Convergence Crosshairs overlay** in the Preview GUI for faster and more precise tuning. - Cleaned up UI inconsistencies: - Foreground and Background shift labels now match their actual sliders - Tooltips correctly reflect each control’s function - Reworked Encoding Settings layout for better readability and workflow. - Moved Clip Range controls into Processing Options for a cleaner main interface. - Fixed File menu actions: - Preset loading now works correctly from the dropdown - Output path dialog now opens properly from both menu and hotkey - Removed redundant Save/Load Settings in favor of streamlined Preset workflow

Updated changelog for VisionDepth3D v3.8.2 with performance improvements, new depth engines, and various fixes.

Added new dependencies to requirements.txt for additional functionality.

old

VisionDepth added 19 commits January 21, 2026 15:19

Update Model List

a66255c

v3.8.2 - Readded ONNX model from Depth inference list that was missing

Add files via upload

861d021

Adapters for DAV3 and Video Depth Anything Depth models for integration

Add files via upload

3400f49

Video Depth Anything Backend

Add files via upload

00477b3

Add files via upload

2370d93

Config files for DAV3

Add files via upload

918d444

model backend for integration

Add files via upload

b47f5cb

DAV3 Model back end

Add files via upload

1d8ab72

Windows Updater for latest release

Revise changelog for VisionDepth3D v3.8.2 release

6b8f5b7

Updated changelog for VisionDepth3D v3.8.2 with performance improvements, new depth engines, and various fixes.

Update requirements.txt with new packages

5851734

Added new dependencies to requirements.txt for additional functionality.

Update copyright year in LICENSE.txt

b936625

Delete presets/Best3DSettings.json

7851e9c

old

Delete presets/balanced_depth.json

24ec5d0

old

Add files via upload

40b0f41

VisionDepth merged commit 31dcfde into Main-Stable Feb 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

V3.8.2 Update#87

V3.8.2 Update#87
VisionDepth merged 19 commits intoMain-Stablefrom
v3.8.2

VisionDepth commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

VisionDepth commented Feb 11, 2026

VisionDepth3D v3.8.2 Major Engine & Performance Update

New Depth Engines

Depth Inference Fixes & Quality Improvements

Massive 3D Rendering Performance Gains

Encoding & Pipeline Reliability

UI & Workflow Improvements

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant