Mailsync leaks handles #388

c-lan · 2017-12-03T02:11:23Z

Mailsync processes running in the background leak massive amounts of OS handles. All of the leaked handles are handles to events that were never signaled.

Basically all operations on the account (including sync) increase the handle count by amounts varying from 1 to over 100k. Simply opening a message that Mailspring needs to download creates >1k of handles.
Log file is available here

Over time this leads to millions of open handles, which causes excessive memory usage and slight reduction of performance when opening unread messages. OS functionality is degraded - after certain number of handles, the NtQuerySystemInformation(SystemHandleInformation) API for querying all handles open in the OS starts to fail and debugging/security/anti-cheat tools cease to work correctly.

I encountered this issue with two Gmail accounts I tested.

Are there any related issues?

No

What operating system are you using?

Windows 10 x64 version 16299

What version of Mailspring are you using?

1.0.9

--

Bug?

Do you have any third-party plugins installed? If so, which ones?

No

Is the issue related to a specific email provider (Gmail, Exchange, etc.)?

Can't determine, all my accounts are on Gmail

Is the issue reproducible with a particular attachment, message, signature, etc?

Any message, any Gmail account

The text was updated successfully, but these errors were encountered:

bengotow · 2017-12-03T21:32:09Z

Hey! Thanks for the detailed report, this is really interesting. Let me see if this is also happening on Mac and Linux, it's definitely odd that it'd be windows-specific.

bengotow · 2017-12-04T01:18:10Z

Quick update: I've been able to reproduce this. Inspecting the mailsync process in windbg shows that all of these leaked event handles are created here:

0x00007ffa47cc59d4: ntdll!NtCreateEvent+0x0000000000000014
0x00000000538146e2: wow64!whNtCreateEvent+0x0000000000000062
0x0000000053816245: wow64!Wow64SystemServiceEx+0x0000000000000155
0x0000000053801c87: wow64cpu!ServiceNoTurbo+0x000000000000000b
0x000000005381bdb2: wow64!RunCpuSimulation+0x0000000000000022
0x000000005381bcc0: wow64!Wow64LdrpInitialize+0x0000000000000120
0x00007ffa47c96e91: ntdll!_LdrpInitialize+0x00000000000000dd
0x00007ffa47c96d5e: ntdll!LdrInitializeThunk+0x000000000000000e
0x0000000076f8707c: ntdll_76f10000!NtCreateEvent+0x000000000000000c
0x00000000767fccd6: KERNELBASE!CreateEventExW+0x0000000000000066
0x00000000767fcc33: KERNELBASE!CreateEventA+0x0000000000000033
0x00000000658a28c1: pthreadVC2!pthread_timechange_handler_np+0x00000000000015c4
--------------------------------------
Handle = 0x00000000000ef940 - OPEN
Thread ID = 0x0000000000000f04, Process ID = 0x00000000000007f0

It looks like this particular event is broadcast to condition_variables (which mailsync uses pretty extensively) so they can evaluate whether they should wake (https://www.sourceware.org/pthreads-win32/manual/pthread_timechange_handler_np.html).

It's unclear why these aren't getting cleaned up, and also unclear why pthread_timechange_handler_np is being called so frequently. It's docs say "To improve tolerance against operator or time service initiated system clock changes... this routine can be called by an application when it receives a WM_TIMECHANGE message from the system.", which doesn't sound like something that should be happening a zillion times.

bengotow · 2017-12-04T01:22:07Z

Interesting - this page notes that virtual machine time sync can do this (https://www.greyware.com/kb/KB2015.401.asp: "Another possible cause of unexpected clock change are time sync features of virtual machines. Be sure that the VMWare Tools/Hyper-V Integration Services Time Sync features are turned off for your VMs.")

Any chance you're running Mailspring on Windows inside a VM? (I'm using VMWare Fusion)

c-lan · 2017-12-04T01:24:28Z

No, but I have Hyper-V enabled. It always runs a hypervisor underneath my host system, but as far as I know no time sync features touch the host machine.

bengotow · 2017-12-04T01:30:29Z

Hey! Thanks for the quick reply. In my VM, I disabled Time Synchronization from the virtual machine settings and it didn't have any effect, but disabling Hyper-V in Windows (by typing bcdedit /set hypervisorlaunchtype off on an elevated command prompt) fixed the issue.

Definitely need to find a workaround for this (since Hyper-V is enabled by default I think...) but this should narrow it down a lot!

c-lan · 2017-12-04T01:35:43Z

I just tried disabling Hyper-V according your suggestion and was still able to reproduce the bug.

c-lan · 2017-12-04T01:52:05Z

Also, your htrace stacktrace is a bit misleading. pthreadVC2!pthread_timechange_handler_np+0x00000000000015c4 is in fact inside pthread_mutex_init exported function.

Edit: after a quick look I think the handles are created by the constructor named mailcore::AutoreleasePool::AutoreleasePool(void). The corresponding destructor function does not directly nor indirectly call pthread_mutex_destroy.

Edit2: not sure if this is the matching source, but could this be the culprit? The destructor does not free the mutex.
https://github.com/MailCore/mailcore2/blob/f708ce74e23b61ec6e5ae958eba0b8bcd8831a1e/src/core/basetypes/MCObject.cpp#L43

c-lan · 2017-12-04T02:50:43Z

Not sure if github sends notifications about comment edits, so let me ping: @bengotow

bengotow · 2017-12-04T04:17:23Z

Hey! Ahh good catch—that MCObject lock is definitely suspicious. Just read it over and it seems like there should be a call to pthread_mutex_destroy(&mLock); in the object destructor. I'm going to insert it and see if things look better.

bengotow · 2017-12-04T04:40:21Z

I recompiled the app with that additional line and it 1) doesn't crash and 2) prevents the handle count from increasing indefinitely. (I also turned Hyper-V back on and confirmed that I see the 20k+ open handles without the change.)

I think that's a wrap! Really glad we got to the bottom of this. We've had some reports of Mailspring causing Windows to "hang for an extended period of time" when the computer wakes from sleep, and I think it was caused by this issue (having a zillion leaked handles around when WM_TIMECHANGE is emitted).

I'm gonna PR this change into Mailcore2 and we'll see if @dinhviethoa has any thoughts.

dinhvh · 2017-12-04T04:42:17Z

Go ahead and send this PR! Thanks a lot!

bengotow · 2017-12-08T02:26:05Z

Hey! The fix for this has shipped in 1.0.10 - thanks for the help getting this narrowed down and fixed quickly.

foundry376-bot · 2021-03-08T14:48:01Z

This issue has been mentioned on Mailspring Community. There might be relevant details there:

https://community.getmailspring.com/t/event-handle-leak-in-mailsync-exe-on-windows/1022/1

bengotow added a commit that referenced this issue Dec 4, 2017

Free MCObject locks to prevent leaked handles #388

fdbffbf

bengotow added bug done-pending-release windows labels Dec 4, 2017

bengotow mentioned this issue Dec 4, 2017

[Windows] Significant performance degradation, especially when waking from sleep #35

Closed

bengotow closed this as completed Dec 8, 2017

Foundry376 locked and limited conversation to collaborators Mar 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mailsync leaks handles #388

Mailsync leaks handles #388

c-lan commented Dec 3, 2017

bengotow commented Dec 3, 2017

bengotow commented Dec 4, 2017

bengotow commented Dec 4, 2017 •

edited

Loading

c-lan commented Dec 4, 2017

bengotow commented Dec 4, 2017

c-lan commented Dec 4, 2017

c-lan commented Dec 4, 2017 •

edited

Loading

c-lan commented Dec 4, 2017

bengotow commented Dec 4, 2017

bengotow commented Dec 4, 2017

dinhvh commented Dec 4, 2017

bengotow commented Dec 8, 2017

foundry376-bot commented Mar 8, 2021

Mailsync leaks handles #388

Mailsync leaks handles #388

Comments

c-lan commented Dec 3, 2017

Are there any related issues?

What operating system are you using?

What version of Mailspring are you using?

Do you have any third-party plugins installed? If so, which ones?

Is the issue related to a specific email provider (Gmail, Exchange, etc.)?

Is the issue reproducible with a particular attachment, message, signature, etc?

bengotow commented Dec 3, 2017

bengotow commented Dec 4, 2017

bengotow commented Dec 4, 2017 • edited Loading

c-lan commented Dec 4, 2017

bengotow commented Dec 4, 2017

c-lan commented Dec 4, 2017

c-lan commented Dec 4, 2017 • edited Loading

c-lan commented Dec 4, 2017

bengotow commented Dec 4, 2017

bengotow commented Dec 4, 2017

dinhvh commented Dec 4, 2017

bengotow commented Dec 8, 2017

foundry376-bot commented Mar 8, 2021

bengotow commented Dec 4, 2017 •

edited

Loading

c-lan commented Dec 4, 2017 •

edited

Loading