Requests queueing if there is no backend connection #940

krizhanovsky · 2018-03-06T22:42:15Z

tfw_sched_get_srv_conn() call in tfw_http_req_cache_cb() assumes that it's quite improbable that there is no connection to any of the backends, so if all the backend simultaneously reset connections Tempesta returns 502 response code. However, the recent tests show that the state is usual and must be correctly handled.

Firstly, Tempesta FW must use the same rescheduling mechanism, with requests eviction by the timeout, as for rescheduled requests that were ever sent to a serer. Secondly, a /proc/tempesta/servers/%group_name%/perfstat counter must be introduced.

Last, current rescheduling mechanism loops in trying a new server/connections for each request to be rescheduled, however at the point it knows precisely that there are no live connections. Instead it should wait until a request timer elapses or a new server connection established. In the first case a request must be deleted. In the second the connection should be tried, but gracefully, not for all the pending requests at once.

Somewhat linked with #687 since the message queues must be adjusted.

Maybe makes sense to implement #1454 before to facilitate testing and debugging of the current task.

Probably the problem can be solved with dynamic allocation a new server connection if all the current connections are busy #710 and handle the current static number of connections as a minimal provision.

The text was updated successfully, but these errors were encountered:

krizhanovsky · 2018-05-25T14:45:53Z

No one well-doing HTTP proxy sends 50x if there is no backend connections, instead all of them, being TCP socket users, just naturally reduce client requests rate. Whereas Tempesta FW, being a part of the TCP/IP stack, handles all client requests replying with 50x or responses received from a backend servers. Thus, Tempesta FW must slow down client request rates and accurately process all the requests w/o errors. Consequently, the counter mentioned above just has no sense. See discussions: #488 (comment) #1012 (comment)

krizhanovsky · 2018-09-29T15:08:58Z

#702 is a test for the issue and there is pull request with some work on this #906 . Please finish the PR as well.

krizhanovsky · 2020-05-08T12:18:27Z

Browsers automatically retry safe request on receiving the 502 error code, so the issue concerns only non-safe requests being sent under too heavy load and/or failing backends. This is system administrator ability to disable connection resets on backend side. Thus, I decrease the task severity.

krizhanovsky added the enhancement label Mar 6, 2018

krizhanovsky added this to the 0.6 KTLS milestone Mar 6, 2018

krizhanovsky mentioned this issue Mar 6, 2018

[Functional tests failure] Failure of scheduler tests based on stress test #702

Open

krizhanovsky modified the milestones: 0.6 KTLS, 0.7 HTTP/2 Mar 6, 2018

krizhanovsky mentioned this issue Mar 6, 2018

Improve the architecture that supports the correct order of HTTP responses #687

Open

This was referenced Mar 31, 2018

HTTP QoS for asymmetric DDoS mitigation #488

Open

Workaround for 50x error codes #996

Open

krizhanovsky added the bug label May 25, 2018

krizhanovsky mentioned this issue Sep 29, 2018

Stress tests 702 error handling #906

Closed

vankoven mentioned this issue Dec 22, 2018

Use load generatator with constant rate limit tempesta-tech/tempesta-test#71

Open

vankoven mentioned this issue Feb 8, 2019

Functional test case reconf.test_stress_sticky.SchedSticky fails #1152

Closed

krizhanovsky added crucial question Questions and support tasks labels Jul 2, 2019

krizhanovsky modified the milestones: 1.0 Stability - GA, 0.9 TDBv0.2 - Beta Aug 9, 2019

krizhanovsky modified the milestones: 0.9 TDBv0.2 - Beta, 0.8 TLS 1.3 & Performance Oct 13, 2019

krizhanovsky self-assigned this Oct 13, 2019

krizhanovsky modified the milestones: 0.8 TLS 1.3 & Performance, 1.0 Stability - GA Oct 13, 2019

krizhanovsky modified the milestones: 1.0 Stability - GA, 0.9 TDBv0.2 - Beta Nov 6, 2019

krizhanovsky removed the crucial label May 8, 2020

krizhanovsky mentioned this issue Oct 10, 2020

Tempesta & backend servers health statistics #1454

Closed

4 tasks

krizhanovsky modified the milestones: 0.9 - TDB, 1.2 TBD Jan 3, 2022

krizhanovsky mentioned this issue Feb 16, 2022

Missing functional tests for failovering #821

Open

6 tasks

krizhanovsky mentioned this issue Aug 9, 2022

healthmon returns 502 if server_count > 2 #1684

Open

krizhanovsky modified the milestones: 1.xx TBD, 1.x: TBD Apr 19, 2023

krizhanovsky removed their assignment Apr 19, 2023

krizhanovsky modified the milestones: 1.x: TBD, 1.0 - GA Apr 19, 2023

krizhanovsky modified the milestones: 1.0 - GA, 0.9 - LA Oct 31, 2023

krizhanovsky assigned EvgeniiMekhanik Nov 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Requests queueing if there is no backend connection #940

Requests queueing if there is no backend connection #940

krizhanovsky commented Mar 6, 2018 •

edited

krizhanovsky commented May 25, 2018

krizhanovsky commented Sep 29, 2018

krizhanovsky commented May 8, 2020

Requests queueing if there is no backend connection #940

Requests queueing if there is no backend connection #940

Comments

krizhanovsky commented Mar 6, 2018 • edited

krizhanovsky commented May 25, 2018

krizhanovsky commented Sep 29, 2018

krizhanovsky commented May 8, 2020

krizhanovsky commented Mar 6, 2018 •

edited