V3.8#81
Merged
VisionDepth merged 16 commits intoMain-Stablefrom Dec 18, 2025
Merged
Conversation
- Added buttons for encoder settings and processing options. - Implemented multi-language support and tooltips for new dialog boxes. - Adjusted preview image window size and video info layout to prevent window overflow. - 3D tab columns now stack correctly when resizing the window on smaller screens.
- Reworked Auto Crop Black Bars to use first-frame detection with cached crop reuse. - Prevents per-frame crop jitter and depth/frame misalignment. - Improves stability for cinema content with subtle letterboxing. - Keep Audio checkbox now respects the user-selected output container instead of forcing MP4.
### Depth Models - Fixed ONNX model loading: - Distill-Any-Depth (inference resolution 518×518, batch size 8) - Video Depth Anything (inference resolution 512×288, batch size 8) - Implemented LBM depth model (development version). Thanks to Aether for the implementation fix. - Removed depth models from the dropdown that returned no `d_type`. - Fixed Hugging Face model downloads and caching so zoo models consistently save inside the app `weights/` directory (no more extra `.cache` downloads). - Updated Transformers image processor loading to prefer `use_fast=True` when available (with automatic fallback when unsupported). ### Depth Backend - Implemented temporal smoothing in the depth pipeline to reduce flicker and improve temporal stability of depth map output. - Packaged VisionDepth3D.exe with Distill-Any-Depth (ONNX), Video Depth Anything (ONNX), and Depth Anything v2 Giant weights.
- Changed entry box to be easier to input - Console loading text changed
- changed processor output text to match rest of system - Added SSResNet Model support for future implementation
- Standardized startup console messages to clearly reflect which subsystems are initializing (Torch, depth estimation, upscaler, external 3D pipeline, language, settings).
- Changed size of frame to smaller size so Drop down doesnt get cropped to small
Updated changelog for version 3.8, detailing new features, improvements, and fixes across various tabs including Depth Estimation and 3D Render. Emphasized stability, model compatibility, and UI enhancements.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.