Mac OS is crashing when debugger stops on a breakpoint #47

dekelev · 2020-01-13T10:51:29Z

I've encountered an issue with multiple Mac laptops running the latest Catalina OS.

The scenario is running 2 processes with Feathers services & feathers-distributed and stopping the first process on a debug breakpoint. when it happens, the second process keeps sending hello messages on the channels. this is causing "socket stress" and leads to TCP Zero-Window issue. a short while after that, the OS is crashing and reboots (sort of Apple bug).

The solution that I've come with is to run the second process as a fork of the first process, send it heartbeat messages every second. when the fork detects the absent of the heartbeat messages for more than 2 seconds, it will stop all channels opened by feathers-distributed and will resume them when heartbeat is received again.

I'm wondering if this a known issue, because it is reproduced easily locally with Redis or broadcast, though the workaround applies only to Redis. with broadcast, the process will fail after resuming the channels due to closed socket.

This is the gist of stopping/resuming all channels:

const stopChannels = app => {
  stop(app.serviceSubscriber);
  stop(app.servicePublisher);

  for (const service of Object.values(app.services)) {
    stop(service.requester);
    stop(service.responder);
    stop(service.serviceEventsSubscriber);
    stop(service.serviceEventsPublisher);
  }
};

const startChannels = app => {
  start(app.serviceSubscriber);
  start(app.servicePublisher);

  for (const service of Object.values(app.services)) {
    start(service.requester);
    start(service.responder);
    start(service.serviceEventsSubscriber);
    start(service.serviceEventsPublisher);
  }
};

const stop = channel => {
  if (channel)
    channel.discovery.stop();
};

const start = channel => {
  if (channel)
    channel.discovery.start();
};

We can integrate this into the library, since the project should not manage the list of opened channels.

The text was updated successfully, but these errors were encountered:

claustres · 2020-01-13T11:10:08Z

We experienced some issues with network performance using cote defaults, so that we provided different options by default in the module.

It might be interesting to have methods to stop/start events distribution but the fact that it is restricted to Redis can be a problem, moreover this design decision can't be only taken because a specific debugger or a specific OS version is crashing, we might need more use cases IMHO.

dekelev · 2020-01-13T11:31:06Z

Locally, I only work with Redis, so I haven't spent much time figuring out if the broadcast can benefit from a similar solution.

I'm using the default feathers-distributed options and did notice that cote was shipping with much lower defaults.

Even though the hello messages are sent every 10 seconds, the amount of services we have (~150) made it unusable when stopping on a breakpoint. it usually happens between 10 to 60 seconds and it will always crashes the OS.

claustres · 2020-01-13T11:44:04Z

We also observed this kind of problem because we have about 100 services. However it seems to me pretty strange you experience this problem only after a couple of seconds. Indeed as far as I understand each service will send a hello message every 10s by default so that we get 600 messages after 60s, not much to crash an OS !

We anyway ask questions about this on the cote slack.

dekelev · 2020-01-13T12:27:50Z

Thanks. It usually takes a minute or two to crash the OS with ~150 services (without Feathers events publishing) and for a developer, holding on a breakpoint for more than a minute is not a rare use-case.

claustres · 2020-01-15T14:34:21Z

Closing in favor of #48.

dekelev changed the title ~~Mac OSX is crashing when debugger stops on a breakpoint~~ Mac OS is crashing when debugger stops on a breakpoint Jan 13, 2020

claustres closed this as completed Jan 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mac OS is crashing when debugger stops on a breakpoint #47

Mac OS is crashing when debugger stops on a breakpoint #47

dekelev commented Jan 13, 2020

claustres commented Jan 13, 2020

dekelev commented Jan 13, 2020 •

edited

Loading

claustres commented Jan 13, 2020 •

edited

Loading

dekelev commented Jan 13, 2020

claustres commented Jan 15, 2020

Mac OS is crashing when debugger stops on a breakpoint #47

Mac OS is crashing when debugger stops on a breakpoint #47

Comments

dekelev commented Jan 13, 2020

claustres commented Jan 13, 2020

dekelev commented Jan 13, 2020 • edited Loading

claustres commented Jan 13, 2020 • edited Loading

dekelev commented Jan 13, 2020

claustres commented Jan 15, 2020

dekelev commented Jan 13, 2020 •

edited

Loading

claustres commented Jan 13, 2020 •

edited

Loading