Metal Graphics Backend #6385

stenzek · 2018-02-17T08:43:43Z

Off topic comments not related to testing and feedback will be strictly moderated.

This branch introduces (yet another) backend to Dolphin: Metal. It is largely a work-in-progress, and several features are missing. The main motivation of developing this backend was to ensure the new videocommon interface design provided sufficient capabilities to move the majority of logic currently in the backends to common code.

Thus, the Metal backend has the abstract framebuffer and pipeline branches as prerequisites. All functionality is built using these primitives, avoiding any Metal-specific code where possible, outside of the derived abstract classes themselves.

Rather than mixing Objective C and C++, I used a C++ metal wrapper library (mtlpp). This in my opinion improves code clarity, and also has the benefit of handling object lifetimes, removing the need to sprinkle release and retain calls all over the source.

These two reasons are why the diff line count is huge. The backend itself is only around 5k lines. I'm not expecting anyone to review it just yet, at least not until the prerequisites are merged.

The changes to shadergen are rather intrusive, in some ways the code we were generating did not fit well with the Metal shading language. Perhaps there is a better way to support all the languages, aside from using an external translator.

Also, being a heavier language (C++-based), shader compilation "stutter" is likely worse than OpenGL, ubershaders and UID caches will be the solution here. But these will be implemented in common rather than the backend, so until that happens the backend will not support ubershaders.

Rough TODO list is as follows:

Matches OpenGL behavior.

This is needed to differentiate between the open-source Mesa drivers and their binary counterparts for Intel and AMD.

microbug · 2018-02-26T19:23:21Z

A more-or-less drop-in solution (as far as I know) has just become available: https://www.anandtech.com/show/12465/khronos-group-extends-vulkan-portability-with-opensource.

It's not often we switch out to draw to the EFB anyway.

Since we use the common pipelines here and draw vertices if a batch is currently being built by the vertex loader, we end up trampling over its pointer, as we share the buffer with the loader, and it has not been unmapped yet. Force a pipeline flush to avoid this.

We would want to improve the granularity here in the future, but for now, this should avoid any performance loss from switching to the VideoCommon shader cache.

As these are stored in a map, operator< will become a hot function when doing lookups, which happen every frame. std::tie generated a rather large function here with quite a few branches.

This enables shaders to be compiled while the game is starting, instead of blocking startup. If a shader is needed before it is compiled, emulation will block.

…l-base

…etal-base

…l-base

stenzek · 2018-03-03T14:35:40Z

Line count has grown even further due to now depending on 6 related branches. The actual backend itself is only +5000 lines.

I've also switched from generating MSL directly to generating GLSL, then SPIR-V, and using SPIRV-Cross to translate these shaders to MSL. This means that we don't need to significantly modify shadergen, as the changes there were quite intrusive.

This means the Metal backend now supports ubershaders, although performance could be terrible, I haven't really benchmarked it. Oh, and you should be able to get framerates above 60fps now. I've implemented psuedo-triple-buffering, it's not ideal, but seems to work fine.

stenzek · 2018-03-03T14:38:24Z

@microbug I suspect that going via MoltenVK will be slower than this backend, as there is a considerable difference in the API versus Vulkan, which requires translation. But feel free to make a PR integrating MoltenVK and prove me wrong ;)

I did try for laughs a while back, and I couldn't get the swap chain functioning. Entirely possible I did something wrong though. Also you won't be able to get a higher frame rate than the screen refresh with MoltenVK.

microbug · 2018-03-04T00:05:24Z

@stenzek I'm nowhere near experienced enough to make a PR integrating MoltenVK. Just thought it might be useful, but a (faster) native Metal backend is of course preferable.

This doesn't mean the resulting binaries will require 10.13 to run, however, it enables us to use the new Metal functions introduced in 10.13.

Works around the cached value in the buildbots of 10.9

pizuz · 2018-08-01T14:55:42Z

Just gave this build a spin. Looks quite stable.

Twilight Princess fails to render the mini-map and therefore runs at full speed. Also glitches galore (water overlay effects being opaque, sense mode producing a white screen ).
Mario ~~Galaxy~~Sunshine runs quite well, aside lots of flickering and glitching textures.
Wind Waker runs well aside a few depth issues.

EDIT: Some more testing revealed some performance differences in Twilight Princess regarding the Hyrule Field Slowdowns (Post-Lanayru Spring):

OpenGL: 15 VPS
MoltenVK (10.0.17): 25 VPS
Metal: 3545 VPS (glitches aside)

Interestingly Metal and Vulkan cap GPU usage at about 70% during those phases, while OpenGL produces close to 100% GPU usage. All four physical CPU cores get about 30% load equally (Hyperthreading doesn't seem to be active, apparently)

Setup: 3.1 GHz Haswell i7 and GeForce 750M in a late 2013 iMac running macOS 10.13.6

So, basically yes, MoltenVK is slower than pure Metal (dunno the impact of the graphics not rendering properly). Will test it against Windows, if I get to it.

MrGcGamer · 2018-08-06T23:02:42Z

When will we see the metal backend get implemented in Dolphin? Or is something like a beta already out? (I have a MakBookPro mid2010 so Wind Waker runs terribly with opengl and I don’t know how to improve the performance and than I heard a Metal backend is in work)

lukearnould · 2018-08-06T23:04:49Z

This pull request has been indefinitely postponed, but stenzek is working on a Vulkan backend for macOS using MoltenVK that should offer much improved performance compared to OpenGL.

MrGcGamer · 2018-08-06T23:12:57Z

Do you know how long it’s probably going to take until it’s finished?

lukearnould · 2018-08-06T23:16:49Z

@MrGcGamer It's making good progress, you can follow it here: #7039

MrGcGamer · 2018-08-06T23:32:44Z

One little question left is my mac supported for moltenvk (geforce 320m)?

lukearnould · 2018-08-06T23:44:59Z

@MrGcGamer After some searching it seems your MacBook doesn't support Metal. Per this article the cutoff for Metal support is the NVIDIA GTX 400 series GPUs. Your GPU is one generation too old.

No Metal means no MoltenVK either (because that essentially wraps Vulkan to be run under Metal).

So whether a native Metal backend or a MoltenVK backend were to be implemented, either way your computer would not support it.

I'm sorry. There's nothing the Dolphin developers can do about it.

moda20 · 2018-08-12T16:57:42Z

This is probably not the best place to ask, but I am running dolphin on high Sierra (hackintosh) and i have a huge lag even with native resolution and minimum graphics. will the metal graphics backend help fix this ? and I still can't see it when choosing backends on config.

lukearnould · 2018-08-12T17:18:32Z

@moda20 The Metal backend isn’t completed yet, that’s why you can’t select it. Even if you downloaded a build of this pull request, it probably wouldn’t be usable due to bugs.

If the Metal backend is completed it will give much improved performance. However this pull request is indefinitely postponed, so I’d be watching this other pull request that’s working to implement a Vulkan backend on macOS using MoltenVK: #7039. It will still be a big performance improvement, but not quite as good as a native Metal backend.

In the meantime you could try asynchronous shaders in the ubershader graphics options. But since you’re running a hackintosh anyway, you’re best off using Dolphin on Linux or Windows. That’s the only way you’ll get decent performance anytime soon.

cesar18pena · 2018-08-13T01:50:24Z

@stenzek do you have an idea about how much time is PR is gonna by postponed? I try using this PR version with Metal backend, I tested the games Wind Waker and Metroid Prime 2 and I could see a big improvement(really big) in Mac OS X.

This work is amazing, I hope you could complete this PR, even after Vulcan for MacOS to improve greatly gameplay in MAC.

PD: Are there specifics trainings or documentations that you recommend to understand and use C++ and Metal. To help and create a better dolphin for Mac users.

stenzek · 2018-08-13T03:13:58Z

MoltenVK is the solution for now. I simply don't have the time to maintain another backend, even if I did complete the missing features.

At some point in the future I may resurrect this branch, but for now it's dead. If someone does want to pick it up, there's an untested/rebased version in stenzek/metal.

stenzek added the WIP / do not merge Work in progress (do not merge) label Feb 17, 2018

stenzek added 3 commits February 20, 2018 01:20

Vulkan: Use reversed depth range in viewport

9c286af

Matches OpenGL behavior.

Vulkan: Provide a more accurate method of detecting drivers/vendors

31b3462

This is needed to differentiate between the open-source Mesa drivers and their binary counterparts for Intel and AMD.

DriverDetails: Add bug to disable reversed depth range on broken drivers

0e1c0c4

stenzek added 25 commits March 1, 2018 17:31

VideoBackends: Restore the framebuffer as part of the API state

887e383

It's not often we switch out to draw to the EFB anyway.

VideoCommon: Drop references to AbstractRawTexture

e125eaa

AbstractTexture: Add property/attribute accessor helpers

4316f5f

AbstractTexture: Support multisampled abstract texture

6374a4c

AbstractTexture: Add support for depth textures/formats

2a6d9e4

VideoCommon: Add support for Abstract Framebuffers

4c24a69

OGL: Make ProgramShaderCache thread safe

286eeaf

Renderer: Add a base Initialize() method to match Shutdown()

603899e

Renderer: Add backbuffer format to base class

cc00245

OSD: Use RGBA byte order

3c180cc

VideoCommon: Add common implementation of RasterFont

71d9909

Externals: Add mtlpp Metal C++ Wrapper library

1359fbc

OGL: Fix abstract pipelines on drivers without binding layout support

a6bf90f

VKPipeline: Fix render pass and add pipeline layout fields

87374b2

AbstractPipeline: Allow setting pipeline to null

ef0da64

D3D: Make StateCache thread safe

215df11

D3D: Make NativeVertexFormat thread safe

eed35df

VKShader: Fix incorrect loading of binary shaders

c577c4a

Move shader caches to VideoCommon

38ce52d

OGL: Re-implement async shader compiling

060dcbb

OGL: Add some basic state tracking

c8bfdc8

We would want to improve the granularity here in the future, but for now, this should avoid any performance loss from switching to the VideoCommon shader cache.

Renderer: Remove now-redundant Set{Rasterization,Depth,Blending}State

04151d0

ShaderCache: Use memcmp for comparing pipeline UIDs

fd90843

As these are stored in a map, operator< will become a hot function when doing lookups, which happen every frame. std::tie generated a rather large function here with quite a few branches.

VideoConfig: Collapse ubershader configuration fields to a single value

e86c4b8

stenzek added 8 commits March 4, 2018 00:09

ShaderCache: Implement background shader compilation

ba70677

This enables shaders to be compiled while the game is starting, instead of blocking startup. If a shader is needed before it is compiled, emulation will block.

ShaderCache: Add EFB copy, blit, and clear shaders to cache

ca1fe03

Externals: Add SPIRV-Cross

48454c7

Merge remote-tracking branch 'origin/vulkan-reversed-depth' into meta…

8064460

…l-base

Merge remote-tracking branch 'origin/videocommon-more-shaders' into m…

ffb64fb

…etal-base

Merge remote-tracking branch 'origin/videocommon-font' into metal-base

0ced836

Merge remote-tracking branch 'origin/externals-spirv-cross' into meta…

222c285

…l-base

Merge remote-tracking branch 'origin/externals-mtlpp' into metal-base

5007e5f

stenzek added 5 commits March 5, 2018 02:13

CMake: Change OSX deployment target to 10.13

1f51523

This doesn't mean the resulting binaries will require 10.13 to run, however, it enables us to use the new Metal functions introduced in 10.13.

VideoCommon: Add Metal as an "API Type" for shader generation

605a3cb

VideoCommon: Support emitting Metal-style GLSL

3a91846

Add experimental Metal graphics backend

9cb4c5d

Temporarily force macOS deployment target to 10.13

edc9ed8

Works around the cached value in the buildbots of 10.9

dolphin-emu deleted a comment from christ776 Apr 22, 2018

stenzek closed this Aug 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metal Graphics Backend #6385

Metal Graphics Backend #6385

stenzek commented Feb 17, 2018 •

edited

Loading

microbug commented Feb 26, 2018

stenzek commented Mar 3, 2018 •

edited

Loading

stenzek commented Mar 3, 2018

microbug commented Mar 4, 2018

pizuz commented Aug 1, 2018 •

edited

Loading

MrGcGamer commented Aug 6, 2018

lukearnould commented Aug 6, 2018 •

edited

Loading

MrGcGamer commented Aug 6, 2018

lukearnould commented Aug 6, 2018

MrGcGamer commented Aug 6, 2018

lukearnould commented Aug 6, 2018

moda20 commented Aug 12, 2018

lukearnould commented Aug 12, 2018 •

edited

Loading

cesar18pena commented Aug 13, 2018

stenzek commented Aug 13, 2018

Metal Graphics Backend #6385

Metal Graphics Backend #6385

Conversation

stenzek commented Feb 17, 2018 • edited Loading

microbug commented Feb 26, 2018

stenzek commented Mar 3, 2018 • edited Loading

stenzek commented Mar 3, 2018

microbug commented Mar 4, 2018

pizuz commented Aug 1, 2018 • edited Loading

MrGcGamer commented Aug 6, 2018

lukearnould commented Aug 6, 2018 • edited Loading

MrGcGamer commented Aug 6, 2018

lukearnould commented Aug 6, 2018

MrGcGamer commented Aug 6, 2018

lukearnould commented Aug 6, 2018

moda20 commented Aug 12, 2018

lukearnould commented Aug 12, 2018 • edited Loading

cesar18pena commented Aug 13, 2018

stenzek commented Aug 13, 2018

stenzek commented Feb 17, 2018 •

edited

Loading

stenzek commented Mar 3, 2018 •

edited

Loading

pizuz commented Aug 1, 2018 •

edited

Loading

lukearnould commented Aug 6, 2018 •

edited

Loading

lukearnould commented Aug 12, 2018 •

edited

Loading