Swap minhook for Microsoft Detours #12964

LeonarddeR · 2021-10-20T05:56:18Z

Link to issue number:

Related to #8420

Summary of the issue:

NVDA has been using minhook from the start to hook several Windows APIs, particularly related to the display model. However, the version of minhook currently in use is very old. I tried to update it once, but that failed and was reverted in #8456.

Description of how this pull request fixes the issue:

Microsoft always had their own hooking library, but it was closed source in the past. Now it is open, it offers us the following benefits:

Support for hooking on ARM64, which should make the display model work.
It is well documented and much more widely used and proven than Minhook
Its code is more recent than minhook's.

Testing strategy:

Apart from testing in Alpha, this can be tested by checking whether screen review mode still works in several apps, including Windows Explorer, Notepad and other Win32 apps. Additionaly, it would be good to verify that #8420 does not return. I can't test that myself.

Test display model and general smoke test with:

ARM64: Swap minhook for Microsoft Detours #12964 (comment)
Windows 7: Swap minhook for Microsoft Detours #12964 (comment)
Windows 10
Windows 11: Swap minhook for Microsoft Detours #12964 (comment)
32bit Windows

Known issues with pull request:

None known so far.

Change log entries:

For Developers

Switched from Minhook to Microsoft Detours as a hooking library for NVDA. Hooking with this library is mainly used to aid the display model.

Code Review Checklist:

Pull Request description:
- description is up to date
- change log entries
Testing:
- Unit tests
- System (end to end) tests
- Manual testing
API is compatible with existing add-ons.
Documentation:
- User Documentation
- Developer / Technical Documentation
- Context sensitive help for GUI changes
UX of all users considered:
- Speech
- Braille
- Low Vision
- Different web browsers
- Localization in other languages / culture than English

seanbudd

This looks like a very useful change - have you tested this with an ARM64 device yet?

readme.md

LeonarddeR · 2021-10-20T07:06:43Z

Have you tested this with an ARM64 device yet?

Afraid not as I don't own such a device.

lukaszgo1 · 2021-10-20T15:48:53Z

Is this work mature enough to be used for testing? I rely pretty heavily on display model and would like to avoid it broken in Alpha if possible hence my question.

LeonarddeR · 2021-10-20T17:10:13Z

I would really love it if you could test drive the try build for a bit to see how it behaves.

seanbudd · 2021-10-20T22:28:38Z

@LeonarddeR - would a signed try build be better for testing this? Happy to create one from this branch when ready.

LeonarddeR · 2021-10-21T05:59:39Z

@LeonarddeR - would a signed try build be better for testing this? Happy to create one from this branch when ready.

That would be very helpful indeed!

seanbudd · 2021-10-21T22:23:36Z

@LeonarddeR @lukaszgo1 Here is the signed try-build and appveyor build link

zstanecic · 2021-10-21T22:50:21Z

Hi! How this should affect users on w11? How to test this on the user side? what should be tested?

feerrenrut · 2021-10-22T03:22:51Z

General approach looks good. I'll need to do a more detailed review once successful testing is reported on arm devices. I assume this will also need testing on 32 bit and older Windows releases. A list of the intended test targets would be helpful.

LeonarddeR · 2021-10-22T05:46:48Z

@seanbudd wrote:

@LeonarddeR @lukaszgo1 Here is the signed try-build and appveyor build link

Thanks, I'll, test this for some days on my Windows 11 machine.

Hi! How this should affect users on w11? How to test this on the user side? what should be tested?

Honestly, the intention is that users don't get affected. It is like changing an essential part of a car to something broader supported without breaking the car.

General approach looks good. I'll need to do a more detailed review once successful testing is reported on arm devices.

I'mnot sure how to do this, as I don't know an ARM device user who can test. How was this done by NV Access in the past? May be @jcsteh?

I assume this will also need testing on 32 bit and older Windows releases. A list of the intended test targets would be helpful.

While this is trictly spoken correct, like Minhook, Detours behaves exactly the same on all versions of Windows, and hooking has never been a problem when a new major version of Windows came out. I think it is enough to get this to Alpha for broader testing as soon as ARM64 tests are performed successfully.

Having said that, something like a system test for display model behaviour could be helpful.

zstanecic · 2021-10-22T06:05:25Z

Hi Leonard and all, I am running this on portable version, and for now I don’t see the bugs on x64.

jcsteh · 2021-10-22T06:10:01Z

General approach looks good. I'll need to do a more detailed review once successful testing is reported on arm devices.

I'mnot sure how to do this, as I don't know an ARM device user who can test. How was this done by NV Access in the past? May be @jcsteh?

I implemented ARM64 support for NVDA as part of a project at Mozilla. I do still have a Mozilla ARM64 machine floating around here which I could probably test on if it still works; I haven't used it in years. I don't know when I could get to this, though.

That said, I've never seen ARM64 API hooking as being particularly important. Display model is mostly useful in legacy apps... but it seems highly unlikely that a legacy app would still be using legacy GDI and yet be recompiled for ARM64. I only say this because if I'm unable to get to this in a reasonable time frame, I would suggest you could ship this with ARM64 support disabled (as we currently have with MinHook) and then deal with ARM64 at some future point.

jcsteh · 2021-10-22T06:12:51Z

I vaguely recall @michaelDCurran might also have an ARM64 machine? I'm not sure.

LeonarddeR · 2021-10-22T06:27:14Z

Display model is mostly useful in legacy apps... but it seems highly unlikely that a legacy app would still be using legacy GDI and yet be recompiled for ARM64.

We rely on the display model for some Inhouse Windows apps, the management console for example (see the mmc appModule).

Basically I agree that's not super important indeed.

michaelDCurran · 2021-10-22T07:00:32Z

Had to send that ARM64 prototype machine back to the company we borrowed it from about 4 years ago I'm sorry. So currently no physical ARM machines here.

zstanecic · 2021-10-22T07:43:50Z

I propose merging this to alpha. Maybe somebody has these. I am running this from yesterday, and it works very good on w11.

lukaszgo1 · 2021-10-22T08:08:26Z

That said, I've never seen ARM64 API hooking as being particularly important. Display model is mostly useful in legacy apps... but it seems highly unlikely that a legacy app would still be using legacy GDI and yet be recompiled for ARM64.

There are still parts of MS Office i.e. controls used to edit custom lists in MS Excel which cannot be used without GDI Hooks, Until recently this probably worked on ARM64 as people were using normal 32-bit versions of Office, but some time ago Microsoft released Office which is recompiled for ARM64

I only say this because if I'm unable to get to this in a reasonable time frame, I would suggest you could ship this with ARM64 support disabled (as we currently have with MinHook) and then deal with ARM64 at some future point.

The only currently known improvement from this PR is support for ARM64 therefore merging this without it seems pretty pointless (we don't have any problems with hooking for other architectures after all) and therefore there is no rush with getting this merged.

Having said that the most usages of Windows for ARM64 with NVDA is probably on the new M1 mac computers where people virtualize it. I believe @pitermach does so. Would you be willing to do some tests for us assuming you indeed have such a device with a Windows for ARM64 Vm?

LeonarddeR · 2021-10-22T09:05:42Z

@lukaszgo1 wrote:

... there is no rush with getting this merged.

While that is true, it is API breaking at the C++ level and therefore I prefer to have it in an API breaking release. Theoretically, add-ons could use the minhook dll bundled with NVDA for all sort of magic things they really should avoid. Additionally, #11768 was delayed for similar reasons, though availability of newer WinRT headers played a role in there as well.

pitermach · 2021-10-22T11:39:12Z

Hi,

I tried the try build linked a few comments back inside a Windows 10 ARM virtual machine in UTM running build 21370, NVDA was running in portable mode. Screen Review appeared to work inside the NVDA menu, but was completely blank in other dialogs I looked at (run dialog, the WinVer message box, notepad, and a properties dialog for a file in explorer).

I can try updating this VM to a newer version of Windows (since this build will expire soon anyway) or I can do other tests if you need me to like running this build installed. For the moment I don't think I can try this in Windows 11 because I'm not sure UTM supports a virtual TPM, but I can look into that if possible or point someone who has a parallels license at this thread.

lukaszgo1 · 2021-10-22T12:13:24Z

Screen Review appeared to work inside the NVDA menu,

Just to make sure - how screen review behaves with the normal builds of NVDA when in NVDA menu on ARM?

but was completely blank in other dialogs I looked at (run dialog, the WinVer message box, notepad, and a properties dialog for a file in explorer).

That's pretty bad and debugging this is going to be painful. It looks like QEMU can emulate ARM on x64 so that might be an option assuming it would not be unbearably slow.

jcsteh · 2021-10-22T12:25:32Z

NVDA itself is an x86 process, so screen review has always worked there. Winver, Run dialog, etc. are all native ARM64.

@LeonarddeR, I think you missed the ifdef around gdiHooks_inProcess_initialize in nvdaHelper/remote/inProcess.cpp.

josephsl · 2021-10-22T12:59:12Z

Hi, I think the most effective way to test this would be Parallels 17.1 on macOS Big Sur or later with Windows 11 build 22000 or above. Another way is asking if we can borrow a Surface Pro X from Microsoft. Thanks.

LeonarddeR · 2021-11-04T17:47:41Z

I got rid of the macro. I would like to remove apiHook_hookFunction as well, e.g. in the header:

template<typename funcType>
bool apiHook_hookFunction_safe(funcType realFunction, funcType fakeFunction, funcType* targetPointerRef);

In the cpp, swap this:

bool apiHook_hookFunction(void* realFunction, void* fakeFunction, void** targetPointerRef) {

with

template<typename funcType>
bool apiHook_hookFunction_safe(funcType realFunction, funcType fakeFunction, funcType* targetPointerRef) {

It compiles, but than fails linking. I have not enough understanding of templating to fix this. Would anyone be able to help out here?

lukaszgo1 · 2021-11-07T19:18:12Z

1. How much memory your machine has

16 GB

2. Whether this is an x86 or x64 process (I assume it is safe to say X64, looking at the name of the executable).

Yes - this is a 64-bit executable

3. How much memory the executable is using at the time of error

574 408 K of ram

4. What OS you're running on the system

Windows 7 X64

5. Whether this is reproducible on another system.

I've tried on two different machines - one with 8 GB of ram running Windows 10, and the second with 6 GB of ram running Windows 7, and this issue cannot be reproduced on any of them.
I'm inclined to say that it is specific to something on my main machine, and while it still would be nice to get to the bottom of it this should not block this PR.

LeonarddeR · 2021-11-16T09:27:47Z

I'm inclined to say that it is specific to something on my main machine, and while it still would be nice to get to the bottom of it this should not block this PR.

I agree it would be helpful to go to the bottom of this. There are two things we do not know though:

May be Detours really can't hook. Question would be why detours can't and minhook could, but as hooking this app doesn't make much sense in the first place, there's no real problem here.
May be minhook doesn't raise an error even though it can't hook correctly on your machine as well.

I think it would be helpful to have this merged ASAP. If any C++ hero could address my question in #12964 (comment), that would be extremely helpful.

jcsteh · 2021-11-16T10:57:03Z

In the cpp, swap this:
bool apiHook_hookFunction(void* realFunction, void* fakeFunction, void** targetPointerRef) {
with
template<typename funcType>
bool apiHook_hookFunction_safe(funcType realFunction, funcType fakeFunction, funcType* targetPointerRef) {
It compiles, but than fails linking. I have not enough understanding of templating to fix this. Would anyone be able to help out here?

Templated functions are created by the compiler when they're called, with each instantiation of the template effectively created as a separate function. When another module (e.g. gdiHooks) tries to call this, it tries to find the exported function in an external module. Since this specific version of this function was never created in apiHook, you get a linker error.

If you want to do this with just a template, you need to do it in the header file so that all modules can create the templated functions as needed.

jcsteh · 2021-11-16T10:58:21Z

To put this another way, templates are effectively syntactic sugar. They're basically macros, just with a lot more safety.

LeonarddeR · 2021-11-16T11:19:27Z

Ah, thanks for the great explanation. I guess I'll leave it this way since moving the whole function to the header file, including calls to the logger and detours function is pretty ugly I suppose.

jcsteh · 2021-11-16T11:38:36Z

Yeah. It's perfectly reasonable to have a helper like this that's only used by a template. If this were a class, I'd say the helper should be private and the template should be public, but it isn't a class, so we can't do that.

lukaszgo1 · 2021-11-16T11:40:14Z

I agree it would be helpful to go to the bottom of this. There are two things we do not know though:

It looks like we won't be able to find out - as of today these errors are not occurring anymore across several restarts, even though I haven't done any changes to my machine in the last week.

lukaszgo1 · 2021-11-24T13:33:49Z

Is this PR planned for inclusion in 2022.1? If so perhaps it should be added to the milestone.

LeonarddeR · 2021-11-24T15:53:57Z

I think it makes sense to do so as Microsoft seems to intend to expand on their ARM64 interests.

I added the milestone so it can at least be tracked. NV Access can always decide to further delay this pr.

nvdaHelper/remote/injection.cpp

Co-authored-by: Michael Curran <mick@nvaccess.org>

feerrenrut

I think this is looking pretty good. Happy to discuss whether these changes should be in this PR or a subsequent one.

feerrenrut · 2022-01-06T05:45:37Z

nvdaHelper/remote/apiHook.h

- * a helper template used internally by apiHook_hookFunction_safe
- */
-template<typename funcType> funcType _apiHook_hookFunction_tpl(const char* moduleName, const char* functionName, funcType funcSyg, funcType fakeFunction) { return (funcType)apiHook_hookFunction(moduleName,functionName,(void*)fakeFunction); }
+bool apiHook_hookFunction(void* realFunction, void* fakeFunction, void** targetPointerRef);

 /**
 * Safely hooks a given function from a given module with a given fake function.


Lets expand on what makes this "safe", Eg what danger does this avoid?

It seems to me, the only thing this ensures is that the three parameters have matching function signatures.
EG

// the function to be replaced bool testReal(int x, bool y){ return false; } // a valid replacement auto *real_func = testReal; bool testFake(int x, bool y){ return real_func(x, y); } // a broken replacement auto *real_func2 = testReal; bool testBroken(int x, float y){ return false; } void doHookTest(){ auto res = apiHook_hookFunction_safe(testReal, testFake, &real_func); // auto res2 = apiHook_hookFunction_safe(testReal, testBroken, &real_func2); // this line causes a compiler error }

Note I have used auto to get the type of the real function pointer, you could instead do decltype(testReal) *real_func = testReal;. But I think that is overly verbose.

nvdaHelper/remote/apiHook.h

feerrenrut · 2022-01-06T08:06:19Z

nvdaHelper/remote/apiHook.h

-
+ * @param realFunction the name of the function you wish to hook.
+ * @param fakeFunction the function you wish  to be called instead of the original one.
+ * @param targetPointerRef Pointer to the target pointer to which the detour will be attached.


I'm not sure this is accurate, this just seems to be a way to maintain access to the original function (by saving its address). Look at the implementation in nvdaHelper/remote/apiHook.cpp:52, it overrides this with the realFunction.

The same pattern used by Detours: https://github.com/microsoft/Detours/wiki/Using-Detours

Given this, we don't really need the realFunction argument, though it is helpful for type checking, making it harder to mess up.

feerrenrut · 2022-01-06T08:12:44Z

nvdaHelper/remote/gdiHooks.cpp

-	real_ScriptStringOut=apiHook_hookFunction_safe("USP10.dll",ScriptStringOut,fake_ScriptStringOut);
-	real_ScriptTextOut=apiHook_hookFunction_safe("USP10.dll",ScriptTextOut,fake_ScriptTextOut);
+	// Hook needed functions
+	apiHook_hookFunction_safe(TextOutA, hookClass_TextOut<char>::fakeFunction, &hookClass_TextOut<char>::realFunction);


The definition of these realFunction vars are more complicated than they need to be. I think they should be initialized with the real function rather than NULL

Co-authored-by: Reef Turner <feerrenrut@users.noreply.github.com>

LeonarddeR · 2022-01-06T16:00:21Z

@feerrenrut I'm not sure what you want me to do with the remaining comments. You're right that the safe part is meant to type check, and that's why I left the real function argument in tact even though it is not strictly necessary any more.

feerrenrut · 2022-01-07T09:40:03Z

I think the doc strings for those functions should be updated to be more clear about their design (and explaining what safe means), and also the targetPointerRef param. Then in a subsequent PR lets update the definitions of the variables used for the targetPointerRef param. They can just be done with auto *savedRealFunctionTarget = realFunction;

feerrenrut

Looks good to me. Thanks @LeonarddeR

LeonarddeR added 2 commits October 19, 2021 19:30

Swap minhook for Microsoft Detours

676f430

Rename detours submodule to lower case

bead51e

seanbudd reviewed Oct 20, 2021

View reviewed changes

readme.md Show resolved Hide resolved

Better logging here and there

a2dc347

Log a debug instead of error

d4df157

LeonarddeR changed the title ~~Proof of concept: Swap minhook for Microsoft Detours~~ Swap minhook for Microsoft Detours Oct 22, 2021

LeonarddeR marked this pull request as ready for review October 22, 2021 09:23

LeonarddeR requested a review from a team as a code owner October 22, 2021 09:23

LeonarddeR requested a review from seanbudd October 22, 2021 09:23

Get rid of the macro

2788f66

LeonarddeR added this to the 2022.1 milestone Nov 24, 2021

michaelDCurran requested changes Nov 24, 2021

View reviewed changes

nvdaHelper/remote/injection.cpp Outdated Show resolved Hide resolved

LeonarddeR and others added 3 commits November 25, 2021 06:25

Update nvdaHelper/remote/injection.cpp

d1aeb74

Co-authored-by: Michael Curran <mick@nvaccess.org>

Merge remote-tracking branch 'origin/master' into detours

2717d94

Merge remote-tracking branch 'origin/master' into detours

60be601

feerrenrut self-assigned this Dec 14, 2021

feerrenrut suggested changes Jan 6, 2022

View reviewed changes

LeonarddeR and others added 2 commits January 6, 2022 09:17

Update nvdaHelper/remote/apiHook.h

9e7d320

Co-authored-by: Reef Turner <feerrenrut@users.noreply.github.com>

Merge remote-tracking branch 'origin/master' into detours

9bacc63

LeonarddeR added 2 commits January 7, 2022 15:04

Attempt to clarify doc string

2cb699a

Macro > template

b67c173

LeonarddeR requested review from feerrenrut and michaelDCurran January 7, 2022 14:05

feerrenrut approved these changes Jan 10, 2022

View reviewed changes

Update changes file for PR nvaccess#12964

78e0fd1

This comment has been minimized.

Sign in to view

feerrenrut merged commit 1e4f869 into nvaccess:master Jan 10, 2022

Swap minhook for Microsoft Detours #12964

Swap minhook for Microsoft Detours #12964

Conversation

LeonarddeR commented Oct 20, 2021 • edited

Link to issue number:

Summary of the issue:

Description of how this pull request fixes the issue:

Testing strategy:

Known issues with pull request:

Change log entries:

Code Review Checklist:

seanbudd left a comment • edited

Choose a reason for hiding this comment

LeonarddeR commented Oct 20, 2021

lukaszgo1 commented Oct 20, 2021

LeonarddeR commented Oct 20, 2021

seanbudd commented Oct 20, 2021

LeonarddeR commented Oct 21, 2021 • edited

seanbudd commented Oct 21, 2021

zstanecic commented Oct 21, 2021

feerrenrut commented Oct 22, 2021

LeonarddeR commented Oct 22, 2021

zstanecic commented Oct 22, 2021 via email • edited by feerrenrut

jcsteh commented Oct 22, 2021

jcsteh commented Oct 22, 2021

LeonarddeR commented Oct 22, 2021

michaelDCurran commented Oct 22, 2021 via email

zstanecic commented Oct 22, 2021 via email • edited by feerrenrut

lukaszgo1 commented Oct 22, 2021

LeonarddeR commented Oct 22, 2021 • edited

pitermach commented Oct 22, 2021

lukaszgo1 commented Oct 22, 2021

jcsteh commented Oct 22, 2021 • edited

josephsl commented Oct 22, 2021 via email

LeonarddeR commented Nov 4, 2021

lukaszgo1 commented Nov 7, 2021

LeonarddeR commented Nov 16, 2021

jcsteh commented Nov 16, 2021

jcsteh commented Nov 16, 2021

LeonarddeR commented Nov 16, 2021

jcsteh commented Nov 16, 2021 via email

lukaszgo1 commented Nov 16, 2021

lukaszgo1 commented Nov 24, 2021

LeonarddeR commented Nov 24, 2021 • edited

feerrenrut left a comment

Choose a reason for hiding this comment

feerrenrut Jan 6, 2022

Choose a reason for hiding this comment

feerrenrut Jan 6, 2022

Choose a reason for hiding this comment

feerrenrut Jan 6, 2022

Choose a reason for hiding this comment

LeonarddeR commented Jan 6, 2022

feerrenrut commented Jan 7, 2022

feerrenrut left a comment

Choose a reason for hiding this comment

This comment has been minimized.

LeonarddeR commented Oct 20, 2021 •

edited

seanbudd left a comment •

edited

LeonarddeR commented Oct 21, 2021 •

edited

zstanecic commented Oct 22, 2021 via email •

edited by feerrenrut

zstanecic commented Oct 22, 2021 via email •

edited by feerrenrut

LeonarddeR commented Oct 22, 2021 •

edited

jcsteh commented Oct 22, 2021 •

edited

LeonarddeR commented Nov 24, 2021 •

edited