Master h2 rxbuf padding #3661

mbgrydeland · 2021-08-04T11:16:01Z

This PR solves two H/2 related bugs:

Adds a buffer for incoming data frames between the session handling thread and the individual H/2 stream threads. This avoids having the session handling thread block while waiting for a stream thread to come around and request the incoming data, which would block most progress on the H/2 session.
Implements handling of padding bytes in incoming data frames. Previously we would ingore their presence and treat all the bytes as data bytes.

The patch set also improves on panic output for H/2.

bsdphk · 2021-08-05T07:37:14Z

What is the math behind not just using malloc buffers ?

dridi · 2021-08-05T07:45:30Z

What is the math behind not just using malloc buffers ?

My assumption is that we use mempools for the duration of the task and in this case it's more opportunistic (it has to be h2 and there needs to be a request body). It's also a form of transient storage since this is a request body.

Not sure why @mbgrydeland chose this, I only assumed.

mbgrydeland · 2021-08-05T08:38:15Z

What is the math behind not just using malloc buffers ?

It is the unbounded outside influence memory usage that is the concern, and why I wanted to tie it into the stevedores. That way the stevedore's may purge content using LRU as the result of clients asking us to buffer request body data. Each H/2 stream may ask us to buffer 64k of data, so the total sum may become significant easily.

Though this uses the Transient stevedore, which by default is unbounded. So out of the box there will be no limit.

bsdphk · 2021-08-06T08:49:41Z

I can see the point in reusing the stevedore machinery.

But re-purposing Transient is wrong: The limits I would want for this are very different from the limits, if any, I'd want for Transient.

One solution: Invent a new "magic" stevedore for this.

Other solution: Make which stevedore is used a parameter.

I probably prefer the latter.

bsdphk

This review is from a quick flexelint run

There are also a number of complaints about "if (size > cache_param->fetch_maxchunksize)" constructs mixing signed/unsigned.

bin/varnishd/storage/stevedore.c

bin/varnishd/http2/cache_http2_panic.c

mbgrydeland · 2021-08-13T09:03:18Z

FWIW we don't use "stevedore" in the varnishd manual, I would prefer "h2_rxbuf_storage".

Valid point. I will reword things accordingly.

bsdphk · 2021-08-18T06:45:36Z

I think I'm OK with this now.

mbgrydeland · 2021-08-18T12:50:53Z

Rebased, squashed and reordered for a better bisect experience

mbgrydeland · 2021-08-18T12:51:32Z

FWIW we don't use "stevedore" in the varnishd manual, I would prefer "h2_rxbuf_storage".

Valid point. I will reword things accordingly.

Renamed to "h2_rxbuf_storage"

nigoroll · 2021-08-23T17:18:06Z

I do not find time for an appropriate review at the moment and do not want to hold back this PR.

This is an API for getting an arbitrary buffer through the stevedores. The stevedore in question may then deploy LRU nuking or other measures to control resource usage.

The H/2 session thread does have a VSL buffer already set up, but the 'wrk->vsl' pointer was not set. This caused issue for e.g. LRU_NukeOne() as it wants to log. Set the buffer for the duration that the worker is dedicated as an H/2 session thread.

This implements stream data handling using a buffer between the H/2 session thread and each stream thread. This is needed to avoid head of line blocking on the session socket when a data frame is received for a stream thread that is not yet ready to receive it. The buffer used will have to be as large as the send window the peer expects at the time the stream is opened. This will typically be 65535 unless the h2_initial_window_size parameter has been changed. Stream window updates will then be issued only once data is removed from the buffer by the request body being consumed from the request handling thread, limited in size to what space is then available in the buffer.

H2 streams waiting for request body data will timeout after timeout_idle seconds if no new data on the stream is being received. This will ensure that individual H2 streams can be reaped if there is no data received from the peer.

We have a strict min at the protocol default here. This is because we don't have the 'use settings only after peer ack' in place yet. If the value is lower than the protocol default, the very first stream could get a flow control error.

This makes it easier to not have to know exactly when and how many window updates to expect in a test case.

With the new request body data handling, Varnish changes behaviour significantly wrt to stream window updates sent to the client. Window updates will only be sent once the data is consumed by the client through the request body VFP handling. Test cases that rely on receiving a window update to sync the H/2 stream needs to be adopted.

This is the test case that fails if these changes aren't in tree. Note the commented out rxwinup commands that are necessary for the proper fail mode when run without the varnishtest window update changes.

According to the spec the padding is an 8-bit field, and fields should be treated as unsigned unless otherwise specified, which it is not for any of the padding related places. Allow varnishtest to generate padding up to 255 bytes long.

This was found lacking in our H2 implementation. Previously we would have included any padding bytes in the request body. Possibly it would have caused errors if there also was a C-L present, or more likely just corrupt request bodies. If the client sends nothing but padding bytes and ends up consuming the entire stream window with no actual request bytes buffered, the request thread side of things would not send any stream window updates. Handle this corner case by sending a window update from the session thread.

This parameter allows the user to choose which storage backend / stevedore that the H/2 receive buffers are allocated from. By default it uses Transient.

dridi · 2021-08-25T09:31:46Z

FWIW, we could make it a regular string parameter with a simple change:

diff --git i/include/tbl/params.h w/include/tbl/params.h
index 2365cc418..90e6a34ee 100644
--- i/include/tbl/params.h
+++ w/include/tbl/params.h
@@ -1551,11 +1551,23 @@ PARAM_THREAD(
  * String parameters
  */
 
-#  define PARAM_STRING(nm, pv, def, ...) \
-	PARAM(, , nm, tweak_string, pv, NULL, NULL, def, NULL, __VA_ARGS__)
+#  define PARAM_STRING(nm, tw, pv, def, ...) \
+	PARAM(, , nm, tw, pv, NULL, NULL, def, NULL, __VA_ARGS__)
+
+PARAM_STRING(
+	/* name */	h2_rxbuf_storage,
+	/* tweak */	tweak_h2_rxbuf_storage,
+	/* priv */	&mgt_stv_h2_rxbuf,
+	/* def */	"Transient",
+	/* descr */
+	"The name of the storage backend that HTTP/2 receive buffers"
+	" should be allocated from.",
+	/* flags */	MUST_RESTART
+)
 
 PARAM_STRING(
 	/* name */	cc_command,
+	/* tweak */	tweak_string,
 	/* priv */	&mgt_cc_cmd,
 	/* def */	VCC_CC,
 	/* descr */
@@ -1568,6 +1580,7 @@ PARAM_STRING(
 
 PARAM_STRING(
 	/* name */	vcl_path,
+	/* tweak */	tweak_string,
 	/* priv */	&mgt_vcl_path,
 	/* def */	VARNISH_VCL_DIR,
 	/* descr */
@@ -1582,6 +1595,7 @@ PARAM_STRING(
 
 PARAM_STRING(
 	/* name */	vmod_path,
+	/* tweak */	tweak_string,
 	/* priv */	&mgt_vmod_path,
 	/* def */	VARNISH_VMOD_DIR,
 	/* descr */
@@ -1657,28 +1671,10 @@ PARAM_PCRE2(
 	" messages."
 )
 
-/*--------------------------------------------------------------------
- * Custom parameters with separate tweak function
- */
-
-#  define PARAM_CUSTOM(nm, pv, def, ...) \
-	PARAM(, , nm, tweak_ ## nm, pv, NULL, NULL, def, NULL, __VA_ARGS__)
-
-PARAM_CUSTOM(
-	/* name */	h2_rxbuf_storage,
-	/* priv */	&mgt_stv_h2_rxbuf,
-	/* def */	"Transient",
-	/* descr */
-	"The name of the storage backend that HTTP/2 receive buffers"
-	" should be allocated from.",
-	/* flags */	MUST_RESTART
-)
-
 #  undef PARAM_ALL
 #  undef PARAM_PCRE2
 #  undef PARAM_STRING
 #  undef PARAM_VCC
-#  undef PARAM_CUSTOM
 #endif /* defined(PARAM_ALL) */
 
 #undef PARAM_MEMPOOL

dridi · 2021-08-25T09:38:09Z

I suppose we could also call the tweak function "tweak_storage", it doesn't really have anything specific to the h2 rxbuf.

mbgrydeland · 2021-08-26T09:09:50Z

I'm not sure about making it a general tweak_storage parameter. It only works for this parameter because the default is "Transient", which is exempted from being matched against an actual defined storage. If the default is anything else, it will fail during the initial set-all-parameters-to-its-default-value-routine, which happens before command line options are processed.

I think it is fine to keep it as a special case tweak, and not try to generalise it.

dridi · 2021-08-26T09:16:46Z

I think you would sill have the same problem if any future parameter referred to a stevedore, let's imagine for example http_req_body_storage (edit: or shortlived_storage). The only safe default would be Transient until other storage backends are set up.

Refs #3442 Refs #3661

Ref #3661

And simply require string parameters to define their tweaks. Refs #3661

We could have shortlived_storage and req_body_storage parameters to stop requiring Transient for those special cases. Refs #3661

mbgrydeland mentioned this pull request Aug 4, 2021

Wrong turn at cache/cache_wrk.c:626: #3654

Closed

bsdphk reviewed Aug 6, 2021

View reviewed changes

bin/varnishd/storage/stevedore.c Outdated Show resolved Hide resolved

bin/varnishd/storage/stevedore.c Outdated Show resolved Hide resolved

bin/varnishd/http2/cache_http2_panic.c Show resolved Hide resolved

mbgrydeland force-pushed the master-h2-rxbuf-padding branch from 080c0cd to 8122a79 Compare August 18, 2021 12:50

mbgrydeland added 18 commits August 24, 2021 10:33

STV temp buffer API

d13d174

This is an API for getting an arbitrary buffer through the stevedores. The stevedore in question may then deploy LRU nuking or other measures to control resource usage.

Simple STV temp buffer implementation

dff8fb0

Use SML_AllocBuf with -smalloc

599b05e

Use SML_AllocBuf also for -sfile

1366ef4

Make h2_del_req take a non-const struct h2_req

6f22549

Make f00007.vtc stable

15a425e

Use timeout_idle as a timeout for request body

4426fb5

H2 streams waiting for request body data will timeout after timeout_idle seconds if no new data on the stream is being received. This will ensure that individual H2 streams can be reaped if there is no data received from the peer.

Make H/2 varnishtest ignore window updates while in rxreq/rxresp

8e8122a

This makes it easier to not have to know exactly when and how many window updates to expect in a test case.

Add H/2 stream data head of line blocking test case

0d65486

This is the test case that fails if these changes aren't in tree. Note the commented out rxwinup commands that are necessary for the proper fail mode when run without the varnishtest window update changes.

Add test case for large req body buffer use

64236f8

Improve the panic output when triggered on an H2 session

94d4c23

Panic dump H2 rxbuf

920b1e3

Panic H2 local and remote settings

74dd070

Test case that buffers fully before starting to consume the req body

563f679

mbgrydeland added 4 commits August 24, 2021 10:33

Add a test case for exhausting window by padding

65cf52f

New 'h2_rxbuf_storage' param to set rxbuf stevedore

862a62b

This parameter allows the user to choose which storage backend / stevedore that the H/2 receive buffers are allocated from. By default it uses Transient.

mbgrydeland force-pushed the master-h2-rxbuf-padding branch from 8122a79 to 862a62b Compare August 24, 2021 08:34

dridi merged commit 84e9926 into varnishcache:master Aug 30, 2021

dridi mentioned this pull request Aug 30, 2021

Frame #1 for rxresp was of type WINDOW_UPDATE (8) instead of HEADERS (1) #3442

Closed

dridi pushed a commit that referenced this pull request Aug 30, 2021

vtc_http2: Don't leak ignored window updates

b63725c

Refs #3442 Refs #3661

nigoroll mentioned this pull request Aug 30, 2021

h2 stream logging lost #3679

Open

nigoroll added a commit that referenced this pull request Aug 30, 2021

Add allocbuf/freebuf to the umem stevedore

de68152

Ref #3661

dridi mentioned this pull request Sep 1, 2021

varnishtest: Update h2 streams windows #3681

Closed

dridi added a commit that referenced this pull request Sep 3, 2021

param: Turn h2_rxbuf_storage into a string parameter

ff44f06

And simply require string parameters to define their tweaks. Refs #3661

dridi added a commit that referenced this pull request Sep 3, 2021

tweak: Rename the storage tweak to be neutral

e37c74b

We could have shortlived_storage and req_body_storage parameters to stop requiring Transient for those special cases. Refs #3661

dridi mentioned this pull request May 23, 2022

std.cache_req_body(BYTES size, BOOL partial = 0) #3798

Draft

dridi mentioned this pull request Sep 22, 2022

-srxbuf and -sTransient configuration #3854

Open

This was referenced Oct 16, 2023

vcl_vrt: Skip VCL execution if the client is gone #3998

Merged

h2: Integrate h2_rxbuf_storage (6.0) #4004

Merged

vcl_vrt: Skip VCL execution if the client is gone (6.0) #4006

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Master h2 rxbuf padding #3661

Master h2 rxbuf padding #3661

mbgrydeland commented Aug 4, 2021

bsdphk commented Aug 5, 2021

dridi commented Aug 5, 2021

mbgrydeland commented Aug 5, 2021

bsdphk commented Aug 6, 2021

bsdphk left a comment

mbgrydeland commented Aug 13, 2021

bsdphk commented Aug 18, 2021

mbgrydeland commented Aug 18, 2021

mbgrydeland commented Aug 18, 2021

nigoroll commented Aug 23, 2021

dridi commented Aug 25, 2021

dridi commented Aug 25, 2021

mbgrydeland commented Aug 26, 2021

dridi commented Aug 26, 2021 •

edited

Master h2 rxbuf padding #3661

Master h2 rxbuf padding #3661

Conversation

mbgrydeland commented Aug 4, 2021

bsdphk commented Aug 5, 2021

dridi commented Aug 5, 2021

mbgrydeland commented Aug 5, 2021

bsdphk commented Aug 6, 2021

bsdphk left a comment

Choose a reason for hiding this comment

mbgrydeland commented Aug 13, 2021

bsdphk commented Aug 18, 2021

mbgrydeland commented Aug 18, 2021

mbgrydeland commented Aug 18, 2021

nigoroll commented Aug 23, 2021

dridi commented Aug 25, 2021

dridi commented Aug 25, 2021

mbgrydeland commented Aug 26, 2021

dridi commented Aug 26, 2021 • edited

dridi commented Aug 26, 2021 •

edited