Ruby: Provide reset mechanism for low-level resources #8798

blowmage · 2016-11-19T01:56:25Z

Please add a mechanism to be called on the Ruby GRPC client to reset low-level resources. This will be useful when the Ruby GRPC client is in a process that has been forked and resources need to be reset. This will help when Ruby GRPC is used in Rails apps using Puma or Unicorn, which regularly fork processes.

In #7951 @soltanmm-google said:

Forking processes and using gRPC across processes is not supported behavior due to very low-level resource issues. Short of having a way to introspect the kernel resources we're using and micromanaging them from some wrapped call to fork in a system-dependent way (which... just... no, heck no) this isn't tractable.

The Ruby GRPC client does not need to micromanage kernel resources to detect when its process has been forked, as this can be called by the user on process fork. All we need is a mechanism to do so.

murgatroid99 · 2016-11-22T01:09:20Z

@ctiller @nicolasnoble What would we need to do to create this?

soltanmm-google · 2016-12-13T04:07:25Z

If Ruby provides a way to just drop the module s.t. it's entirely unloaded, and if Ruby's wrapper calls grpc_shutdown deterministically on module clean-up, then that should be sufficient, and gRPC doesn't need to do anything (or really even should), right?

And that isn't something gRPC should provide as an explicit capability if the language supports it.

jrun · 2017-02-02T21:07:53Z

Is there anything I can do to help move this issue forward? This issue prevents me from being able to use grpc dependent google-cloud-ruby libraries in my internal services.

apolcyn · 2017-02-02T21:42:16Z

@jrun this is under discussion but there are some difficulties around it.

But getting some more specific use cases and problems would be helpful. For example, do you need "pre-fork" and "port-fork" hooks to reset an active gRPC library? Possibly deferred library startup is sufficient?

jrun · 2017-02-02T22:17:54Z

Thanks for following up. In our specific use case a "post-fork" callback is needed to reset active gRPC connections. Unfortunately a deferred library startup isn’t sufficient.

Our backend Ruby services use a prefork model. A post-fork callback is used to re-establish network connections (e.g. database, messaging, caching services) in the forked process. A deferred library startup isn’t sufficient because the master process often needs to access the same APIs as the child processes.

For example the master may initially read it’s configuration from Cloud Datastore to determine the set of child workers to spawn. The child workers then may need to read/write to Cloud Datastore as part of their operation.

Another example is Stackdriver Error Reporting. The master and child workers need to be able to write to the Error Reporting service.

jrun · 2017-02-02T22:41:04Z

I want to clarify one point in an effort to avoid any miscommunication. We don't need the gRPC library to be provided a "post-fork" callback. We need a method to call that re-establishes the underlying gRPC connections which will be called from our own "post-fork" callback.

jrun · 2017-02-06T19:24:20Z

@apolcyn Is there anything additional I can provide? Does the use case I provide make sense?

apolcyn · 2017-02-06T19:36:21Z

@jrun thanks for data point, this is helpful and makes sense. AFAICS supporting the case described here will require some complicated changes to the core C-library that grpc-ruby is wrapping. But this is important - taking a look at how feasible this is.

Gubbi · 2017-08-02T19:54:59Z

@apolcyn Is there a temporary workaround recommended until this is fixed at the C-library level?

apolcyn · 2017-08-02T20:31:49Z

@Gubbi avoiding use grpc library in the parent process before forking is the best thing to do AFAIK. Note that since the change in #10670, the library won't initialize until the first grpc object (e.g. channel/stub, server), is created.

ebenoist · 2018-04-04T22:44:16Z

Having an explicit reset or shutdown and start hook would make this issue much simpler to deal with. Is it not possible to stop the underlying event loop and call the grpc_init() function again? Would exposing grpc_rb_shutdown and grpc_ruby_once_init_internal allow the underlying library to be reset?

apolcyn · 2018-04-05T02:13:58Z

@ebenoist something along the lines of what you described should be possible now, and it's actually something I've been meaning to do. What I'm thinking is we can expose global "before fork" and "after fork" hooks which applications are responsible for calling before and after forking, and which will themselves basically shut down and restart the grpc library. We can probably get such an API available within the next couple of releases (not the soon-to-come 1.11 release, but probably in the 1.12 or 1.13 releases).

ebenoist · 2018-04-05T14:08:35Z

@apolcyn Thank you so much for your prompt response. That API makes a lot of sense to me and I'd be eager to test it out for you folks. Please let me know if there is anything I can do to help.

stale · 2019-09-05T05:41:48Z

This issue/PR has been automatically marked as stale because it has not had any update (including commits, comments, labels, milestones, etc) for 180 days. It will be closed automatically if no further update occurs in 1 day. Thank you for your contributions!

blowmage · 2019-09-05T14:40:51Z

This issue is still outstanding, AFAIK. We still want to be able to reset the Ruby GRPC client's low-level resource after a fork.

jeremywadsack · 2020-01-03T21:55:29Z

This affects us as well using Google Cloud Monitoring (Stackdriver). I am trying to write metics from background Resque jobs. Resque forks from the main process for each job and the jobs all fail with this:

grpc cannot be used before and after forking

Using grpc 1.25.0.

I've looked through the discussions here and I'm not sure that the "fix" in #16332 (https://github.com/grpc/grpc/pull/16332/files#diff-40f6e37e5d9670d49001d5551bc9da82R275) is correct. It checks if the PID has changed which happens when the API forks.

If GRPC is now lazy loaded (I think this says that: googleapis/google-cloud-ruby#2917 (comment)), then if we load it on each fork, then shouldn't that work without hanging?

blowmage · 2020-05-06T19:18:36Z

This issue is still outstanding.

stale · 2020-08-05T01:54:44Z

This issue/PR has been automatically marked as stale because it has not had any update (including commits, comments, labels, milestones, etc) for 30 days. It will be closed automatically if no further update occurs in 7 day. Thank you for your contributions!

anujbiyani · 2020-08-06T16:56:56Z

Copying @blowmage 's bump from earlier:

This issue is still outstanding, AFAIK. We still want to be able to reset the Ruby GRPC client's low-level resource after a fork.

stale · 2020-11-05T01:20:41Z

This issue/PR has been automatically marked as stale because it has not had any update (including commits, comments, labels, milestones, etc) for 30 days. It will be closed automatically if no further update occurs in 7 day. Thank you for your contributions!

blowmage · 2020-11-05T15:31:20Z

This issue is still outstanding, AFAIK. We still want to be able to reset the Ruby GRPC client's low-level resource after a fork.

fabirydel · 2020-12-21T22:38:39Z

Is there any update regarding this issue? I'm using Google Cloud Logging and must absolutely send logs from both the parent and child processes. As it stands, I can't use the stackdriver gem because of this issue, has anyone come up with a workaround?

jrun · 2020-12-22T02:42:18Z

@fabirydel If you're systems have google-fluentd installed, you can configure a forward source that listens on 127.0.0.1. The fluent out_google_cloud has the details on what the JSON payload should be.

ianks · 2021-03-02T19:23:51Z

We are running into this as well. Really painful.

stale · 2021-06-02T17:48:10Z

This issue/PR has been automatically marked as stale because it has not had any update (including commits, comments, labels, milestones, etc) for 30 days. It will be closed automatically if no further update occurs in 7 day. Thank you for your contributions!

matthewford · 2021-09-30T11:36:30Z

bump

syed-mohsin · 2022-04-27T05:00:04Z

bump :)

apolcyn · 2023-07-07T18:27:28Z

FYI there is work in progress on this in #33430 (should get into the 1.57 release)

Adds experimental fork support to gRPC/Ruby Works towards #8798 (see caveats for why this wasn't marked fixed yet) Works towards #33578 (see caveats for why this wasn't marked fixed yet) This leverages existing `pthread_atfork` based C-core support for forking that python/php use, but there's a bit extra involved mainly because gRPC/Ruby has additional background threads. New tests under `src/ruby/end2end` show example usage. Based on #33495 Caveats: - Bidi streams are not yet supported (bidi streams spawn background threads which are not yet fork safe) - Servers not supported - Only linux supported

apolcyn · 2023-07-10T20:36:32Z

Reopening because #33430 isn't a complete fix (forking with bidi streams is not yet supported with that, in particular)

Adds experimental fork support to gRPC/Ruby Works towards grpc#8798 (see caveats for why this wasn't marked fixed yet) Works towards grpc#33578 (see caveats for why this wasn't marked fixed yet) This leverages existing `pthread_atfork` based C-core support for forking that python/php use, but there's a bit extra involved mainly because gRPC/Ruby has additional background threads. New tests under `src/ruby/end2end` show example usage. Based on grpc#33495 Caveats: - Bidi streams are not yet supported (bidi streams spawn background threads which are not yet fork safe) - Servers not supported - Only linux supported

nathanielmanistaatgoogle added the lang/ruby label Nov 22, 2016

nathanielmanistaatgoogle assigned nicolasnoble and ctiller Nov 22, 2016

jrun mentioned this issue Feb 22, 2017

Ruby client hangs when connecting to cloud datastore with the Puma webserver #6577

Closed

bmclean mentioned this issue Apr 18, 2017

Ruby gRPC processes get stuck during termination in gpr_cv_wait() #10658

Closed

ctiller removed their assignment Dec 13, 2017

ericgribkoff mentioned this issue May 10, 2018

gRPC Python Client Side Fork Support #15334

Closed

nicolasnoble assigned apolcyn and unassigned nicolasnoble May 16, 2018

nicolasnoble added kind/enhancement priority/P2 labels May 16, 2018

blowmage mentioned this issue Aug 8, 2019

Allow loading grpc/errors.rb before grpc.rb #19893

Merged

stale bot added the disposition/stale label Sep 5, 2019

stale bot removed the disposition/stale label Sep 5, 2019

stale bot added the disposition/stale label May 6, 2020

stale bot removed the disposition/stale label May 6, 2020

stale bot added the disposition/stale label Aug 5, 2020

stale bot removed the disposition/stale label Aug 6, 2020

stale bot added the disposition/stale label Nov 5, 2020

stale bot removed the disposition/stale label Nov 5, 2020

stale bot added the disposition/stale label Jun 2, 2021

apolcyn added disposition/never stale and removed disposition/stale labels Jun 2, 2021

ahayworth mentioned this issue Jun 5, 2022

Rewrite emailservice in ruby open-telemetry/opentelemetry-demo#109

Merged

chrisholmes mentioned this issue Dec 19, 2022

Unable to use Infinite Tracing with a forking webserver (puma) newrelic/newrelic-ruby-agent#1706

Closed

apolcyn mentioned this issue Jun 22, 2023

[ruby] experimental client side fork support #33430

Merged

laurynas mentioned this issue Jul 4, 2023

Allow setting credentials as lambda coinbase/temporal-ruby#254

Open

apolcyn closed this as completed in #33430 Jul 10, 2023

apolcyn reopened this Jul 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ruby: Provide reset mechanism for low-level resources #8798

Ruby: Provide reset mechanism for low-level resources #8798

blowmage commented Nov 19, 2016

murgatroid99 commented Nov 22, 2016

soltanmm-google commented Dec 13, 2016 •

edited

Loading

jrun commented Feb 2, 2017

apolcyn commented Feb 2, 2017

jrun commented Feb 2, 2017

jrun commented Feb 2, 2017

jrun commented Feb 6, 2017

apolcyn commented Feb 6, 2017

Gubbi commented Aug 2, 2017

apolcyn commented Aug 2, 2017

ebenoist commented Apr 4, 2018

apolcyn commented Apr 5, 2018

ebenoist commented Apr 5, 2018

stale bot commented Sep 5, 2019

blowmage commented Sep 5, 2019

jeremywadsack commented Jan 3, 2020 •

edited

Loading

blowmage commented May 6, 2020

stale bot commented Aug 5, 2020

anujbiyani commented Aug 6, 2020

stale bot commented Nov 5, 2020

blowmage commented Nov 5, 2020

fabirydel commented Dec 21, 2020

jrun commented Dec 22, 2020

ianks commented Mar 2, 2021

stale bot commented Jun 2, 2021

matthewford commented Sep 30, 2021

syed-mohsin commented Apr 27, 2022

apolcyn commented Jul 7, 2023

apolcyn commented Jul 10, 2023

Ruby: Provide reset mechanism for low-level resources #8798

Ruby: Provide reset mechanism for low-level resources #8798

Comments

blowmage commented Nov 19, 2016

murgatroid99 commented Nov 22, 2016

soltanmm-google commented Dec 13, 2016 • edited Loading

jrun commented Feb 2, 2017

apolcyn commented Feb 2, 2017

jrun commented Feb 2, 2017

jrun commented Feb 2, 2017

jrun commented Feb 6, 2017

apolcyn commented Feb 6, 2017

Gubbi commented Aug 2, 2017

apolcyn commented Aug 2, 2017

ebenoist commented Apr 4, 2018

apolcyn commented Apr 5, 2018

ebenoist commented Apr 5, 2018

stale bot commented Sep 5, 2019

blowmage commented Sep 5, 2019

jeremywadsack commented Jan 3, 2020 • edited Loading

blowmage commented May 6, 2020

stale bot commented Aug 5, 2020

anujbiyani commented Aug 6, 2020

stale bot commented Nov 5, 2020

blowmage commented Nov 5, 2020

fabirydel commented Dec 21, 2020

jrun commented Dec 22, 2020

ianks commented Mar 2, 2021

stale bot commented Jun 2, 2021

matthewford commented Sep 30, 2021

syed-mohsin commented Apr 27, 2022

apolcyn commented Jul 7, 2023

apolcyn commented Jul 10, 2023

soltanmm-google commented Dec 13, 2016 •

edited

Loading

jeremywadsack commented Jan 3, 2020 •

edited

Loading