Currently, the VMR comparison succeeds even when there are non-baselined issues. This behavior can be problematic from a user experience (UX) perspective, as it causes the build to show as "green" despite the presence of issues.
To improve the accuracy and clarity of the build status, we should adjust the comparison logic to fail when non-baselined issues are detected.