could not restore from backend restart #615

Thinkfly · 2016-06-15T06:42:40Z

i use nghttp 1.11.1 for grpc proxy.sometimes when i restart backend,nghttp could not auto restore. i must restart nghttp, it will be ok. how shoud i config the nghttp to solve this?thx.

my config like this:
backend=127.0.0.1,23100;/test.TestService/;proto=h2;no-tls;fall=1;rise=1
backend=192.168.0.18,23100;/test.TestService/;proto=h2;no-tls;fall=1;rise=1

the log is below:

15/Jun/2016:13:01:40 +0800 PID21693 [INFO] shrpx_client_handler.cc:853 [CLIENT_HANDLER:0x7f1ac1045000] Downstream address group_idx: 5
15/Jun/2016:13:01:40 +0800 PID21693 [INFO] shrpx_client_handler.cc:882 [CLIENT_HANDLER:0x7f1ac1045000] No working downstream address found
15/Jun/2016:13:01:40 +0800 PID21693 [INFO] shrpx_downstream.cc:542 [DOWNSTREAM:0x7f1ac10b0300] dconn_ is NULL
15/Jun/2016:13:01:40 +0800 PID21693 [INFO] shrpx_http2_upstream.cc:62 [UPSTREAM:0x7f1ac1049140] Stream stream_id=2741 is being closed
15/Jun/2016:13:01:40 +0800 PID21693 [INFO] shrpx_downstream.cc:160 [DOWNSTREAM:0x7f1ac10b0300] Deleting
15/Jun/2016:13:01:40 +0800 PID21693 [INFO] shrpx_downstream.cc:190 [DOWNSTREAM:0x7f1ac10b0300] Deleted

tatsuhiro-t · 2016-06-15T15:59:08Z

Thank you for reporting this issue.
Fix committed via cddb411

Thinkfly · 2016-06-17T04:51:59Z

@tatsuhiro-t ,thank you for help.but I tried commit cddb411 and i have 2 backend, when i restart all of backend one by one, i still get 503 ,is it "fall=1;rise=1" is a best practice? and how can i config nghttpx for backend HA.

tatsuhiro-t · 2016-06-17T13:07:13Z

It works for me. I tested with 2 nghttpd as backend. Could you tell us the exact reproduction step?

fall/rise is recent addition, and I'm not sure which is the best practice. haproxy has similar feature, so perhaps, we can use their BCP?

The multiple --backend option is the answer for HA. I'm wondering why your case does not work.

Thinkfly · 2016-06-19T15:16:44Z

I try restart backend one by one and repeat it, when restart the second round, it will happen.but now i try restart by delay a little of seconds , it's not happen again.Maybe i restart too faster before.Thx.

tatsuhiro-t · 2016-06-19T15:26:49Z

If there is a moment that all backend servers are down, and a request just comes at that particular moment, nghttpx may return 503 since there is no working servers. Note that nghttpx takes some time to detect that backend server gets back to online.

Thinkfly · 2016-06-19T15:32:31Z

How long it takes to detect the backend?

tatsuhiro-t · 2016-06-20T14:51:12Z

nghttpx uses exponential backoff, and if it reached maximum (failed to connect a backend server 10 times in row), the interval of health check is ~130 seconds. I'm fine to add a new configuration to cap the maximum health check interval.

tatsuhiro-t · 2016-06-21T15:15:23Z

Added --backend-max-backoff option:

  --backend-max-backoff=<DURATION>
              Specify  maximum backoff  interval.  This  is used  when
              doing health  check against offline backend  (see "fail"
              parameter  in --backend  option).   It is  also used  to
              limit  the  maximum   interval  to  temporarily  disable
              backend  when nghttpx  failed to  connect to  it.  These
              intervals are calculated  using exponential backoff, and
              consecutive failed attempts increase the interval.  This
              option caps its maximum value.
              Default: 2m

tatsuhiro-t · 2016-06-24T13:16:13Z

Closing since originator reported the issue was fixed.
And now we offer the maximum timeout to change the health check interval, which makes this issue less happening.

tatsuhiro-t added bug app/nghttpx labels Jun 15, 2016

tatsuhiro-t added this to the v1.12.0 milestone Jun 15, 2016

tatsuhiro-t closed this as completed Jun 24, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

could not restore from backend restart #615

could not restore from backend restart #615

Thinkfly commented Jun 15, 2016 •

edited

tatsuhiro-t commented Jun 15, 2016

Thinkfly commented Jun 17, 2016 •

edited

tatsuhiro-t commented Jun 17, 2016

Thinkfly commented Jun 19, 2016

tatsuhiro-t commented Jun 19, 2016

Thinkfly commented Jun 19, 2016

tatsuhiro-t commented Jun 20, 2016

tatsuhiro-t commented Jun 21, 2016

tatsuhiro-t commented Jun 24, 2016

could not restore from backend restart #615

could not restore from backend restart #615

Comments

Thinkfly commented Jun 15, 2016 • edited

tatsuhiro-t commented Jun 15, 2016

Thinkfly commented Jun 17, 2016 • edited

tatsuhiro-t commented Jun 17, 2016

Thinkfly commented Jun 19, 2016

tatsuhiro-t commented Jun 19, 2016

Thinkfly commented Jun 19, 2016

tatsuhiro-t commented Jun 20, 2016

tatsuhiro-t commented Jun 21, 2016

tatsuhiro-t commented Jun 24, 2016

Thinkfly commented Jun 15, 2016 •

edited

Thinkfly commented Jun 17, 2016 •

edited