groupcache: enable H2C + add benchmarks #5068

GiedriusS · 2022-01-17T16:02:25Z

Add h2c benchmarks. Somehow h2c is slower than regular HTTP locally for
me. Probably because these aren't real life benchmarks i.e. establishing
TCP connections locally have no cost. And if there is only one operation
happening at any time then h2 has to do more work hence more CPU usage.

With 500 parallel requests going on at the same time, h2 becomes faster
than (or is at the same level as) h1 while using a minimal number of TCP
connections. Thus, it will work much faster in real life situations. So,
let's enable it.

name                                                               time/op
GroupcacheRetrieval/h2c/seq-16                                      227µs ± 1%
GroupcacheRetrieval/h2c/parallel=500-16                            54.8µs ±20%
GroupcacheRetrieval/h1,_max_one_TCP_connection/seq-16               150µs ± 2%
GroupcacheRetrieval/h1,_max_one_TCP_connection/parallel=500-16      146µs ± 1%
GroupcacheRetrieval/h1,_unlimited_TCP_connections/seq-16            145µs ± 3%
GroupcacheRetrieval/h1,_unlimited_TCP_connections/parallel=500-16  52.8µs ± 9%

name                                                               alloc/op
GroupcacheRetrieval/h2c/seq-16                                      183kB ± 0%
GroupcacheRetrieval/h2c/parallel=500-16                             143kB ± 1%
GroupcacheRetrieval/h1,_max_one_TCP_connection/seq-16               161kB ± 0%
GroupcacheRetrieval/h1,_max_one_TCP_connection/parallel=500-16      162kB ± 0%
GroupcacheRetrieval/h1,_unlimited_TCP_connections/seq-16            161kB ± 0%
GroupcacheRetrieval/h1,_unlimited_TCP_connections/parallel=500-16   116kB ± 2%

name                                                               allocs/op
GroupcacheRetrieval/h2c/seq-16                                        283 ± 0%
GroupcacheRetrieval/h2c/parallel=500-16                               256 ± 1%
GroupcacheRetrieval/h1,_max_one_TCP_connection/seq-16                 260 ± 0%
GroupcacheRetrieval/h1,_max_one_TCP_connection/parallel=500-16        262 ± 0%
GroupcacheRetrieval/h1,_unlimited_TCP_connections/seq-16              260 ± 0%
GroupcacheRetrieval/h1,_unlimited_TCP_connections/parallel=500-16     279 ± 1%

Signed-off-by: Giedrius Statkevičius giedrius.statkevicius@vinted.com

GiedriusS · 2022-01-17T16:04:16Z

cc @akanshat @onprem

Add h2c benchmarks. Somehow h2c is slowed than regular HTTP locally for me. Probably because these aren't real life benchmarks i.e. establishing TCP connections locally has no cost. And if there is only one operation happening at any time then h2 has to do more work hence more CPU usage. With 500 parallel requests going on at the same time, h2 becomes faster than (or is at the same level as) h1 while using a minimal number of TCP connections. Thus, it will work much faster in real life situations. So, let's enable it. ``` name time/op GroupcacheRetrieval/h2c/seq-16 227µs ± 1% GroupcacheRetrieval/h2c/parallel=500-16 54.8µs ±20% GroupcacheRetrieval/h1,_max_one_TCP_connection/seq-16 150µs ± 2% GroupcacheRetrieval/h1,_max_one_TCP_connection/parallel=500-16 146µs ± 1% GroupcacheRetrieval/h1,_unlimited_TCP_connections/seq-16 145µs ± 3% GroupcacheRetrieval/h1,_unlimited_TCP_connections/parallel=500-16 52.8µs ± 9% name alloc/op GroupcacheRetrieval/h2c/seq-16 183kB ± 0% GroupcacheRetrieval/h2c/parallel=500-16 143kB ± 1% GroupcacheRetrieval/h1,_max_one_TCP_connection/seq-16 161kB ± 0% GroupcacheRetrieval/h1,_max_one_TCP_connection/parallel=500-16 162kB ± 0% GroupcacheRetrieval/h1,_unlimited_TCP_connections/seq-16 161kB ± 0% GroupcacheRetrieval/h1,_unlimited_TCP_connections/parallel=500-16 116kB ± 2% name allocs/op GroupcacheRetrieval/h2c/seq-16 283 ± 0% GroupcacheRetrieval/h2c/parallel=500-16 256 ± 1% GroupcacheRetrieval/h1,_max_one_TCP_connection/seq-16 260 ± 0% GroupcacheRetrieval/h1,_max_one_TCP_connection/parallel=500-16 262 ± 0% GroupcacheRetrieval/h1,_unlimited_TCP_connections/seq-16 260 ± 0% GroupcacheRetrieval/h1,_unlimited_TCP_connections/parallel=500-16 279 ± 1% ``` Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

GiedriusS · 2022-01-18T10:38:27Z

pkg/cache/groupcache.go

@@ -93,6 +96,12 @@ func NewGroupcacheWithConfig(logger log.Logger, reg prometheus.Registerer, conf
 	cfg *CachingBucketConfig) (*Groupcache, error) {
 	httpProto := galaxyhttp.NewHTTPFetchProtocol(&galaxyhttp.HTTPOptions{
 		BasePath: basepath,
+		Transport: &http2.Transport{
+			AllowHTTP: true,
+			DialTLS: func(network, addr string, cfg *tls.Config) (net.Conn, error) {


For now, let's disable TLS but in the future, we can come back to this.

Shall we open an issue for it?

I will add it to #5037

kakkoyun

Overall LGTM.

Just a couple of testing nits.

kakkoyun · 2022-01-18T13:57:49Z

pkg/cache/groupcache.go

@@ -93,6 +96,12 @@ func NewGroupcacheWithConfig(logger log.Logger, reg prometheus.Registerer, conf
 	cfg *CachingBucketConfig) (*Groupcache, error) {
 	httpProto := galaxyhttp.NewHTTPFetchProtocol(&galaxyhttp.HTTPOptions{
 		BasePath: basepath,
+		Transport: &http2.Transport{
+			AllowHTTP: true,
+			DialTLS: func(network, addr string, cfg *tls.Config) (net.Conn, error) {


Shall we open an issue for it?

pkg/cache/groupcache_test.go

Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

onprem

Thanks, this looks good. I have a minor suggestion which we can take care of in a follow-up.

Sorry for the late review.

onprem · 2022-01-19T12:48:09Z

pkg/cache/groupcache_test.go

+
+	go func() {
+		if err = httpServer.ListenAndServe(); err != nil {
+			fmt.Printf("failed to listen: %s\n", err.Error())


We should do an os.Exit(1) here as well. If we fail to start the http server, we should fail the test here instead of continuing with it.

onprem · 2022-01-19T12:48:38Z

pkg/cache/groupcache_test.go

+	}()
+	go func() {
+		if err = httpServerH2C.ListenAndServe(); err != nil {
+			fmt.Printf("failed to listen: %s\n", err.Error())


ditto, os.Exit(1) after this.

Add fixes according to the suggestions here thanos-io#5068 (comment). Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

Add fixes according to the suggestions here #5068 (comment). Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

Add fixes according to the suggestions here thanos-io#5068 (comment). Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com> Signed-off-by: Nicholaswang <wzhever@gmail.com>

pull-request-size bot added the size/L label Jan 17, 2022

GiedriusS force-pushed the enable_h2c branch from 011871b to c85c76d Compare January 17, 2022 17:21

GiedriusS force-pushed the enable_h2c branch from c85c76d to 558ce8a Compare January 18, 2022 08:32

GiedriusS commented Jan 18, 2022

View reviewed changes

GiedriusS mentioned this pull request Jan 18, 2022

Next steps with groupcache (galaxycache) #5037

Open

7 tasks

marijus-ravickas previously approved these changes Jan 18, 2022

View reviewed changes

kakkoyun previously approved these changes Jan 18, 2022

View reviewed changes

Add fix according to Kemal's suggestion

72f80bf

Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

GiedriusS dismissed stale reviews from kakkoyun and marijus-ravickas via 72f80bf January 18, 2022 15:15

GiedriusS merged commit ba9b36d into thanos-io:main Jan 18, 2022

GiedriusS deleted the enable_h2c branch January 18, 2022 15:46

onprem reviewed Jan 19, 2022

View reviewed changes

GiedriusS added a commit to GiedriusS/thanos that referenced this pull request Jan 20, 2022

cache: add os.Exit if starting HTTP

28678b7

Add fixes according to the suggestions here thanos-io#5068 (comment). Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

GiedriusS added a commit to GiedriusS/thanos that referenced this pull request Jan 20, 2022

cache: add os.Exit if starting HTTP fails

77d3cda

Add fixes according to the suggestions here thanos-io#5068 (comment). Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

GiedriusS mentioned this pull request Jan 20, 2022

cache: add os.Exit if starting HTTP fails #5084

Merged

GiedriusS added a commit that referenced this pull request Jan 26, 2022

cache: add os.Exit if starting HTTP fails (#5084)

299cf45

Add fixes according to the suggestions here #5068 (comment). Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

groupcache: enable H2C + add benchmarks #5068

groupcache: enable H2C + add benchmarks #5068

GiedriusS commented Jan 17, 2022 •

edited

GiedriusS commented Jan 17, 2022

GiedriusS Jan 18, 2022

kakkoyun Jan 18, 2022

GiedriusS Jan 18, 2022

kakkoyun left a comment

kakkoyun Jan 18, 2022

onprem left a comment

onprem Jan 19, 2022

onprem Jan 19, 2022

groupcache: enable H2C + add benchmarks #5068

groupcache: enable H2C + add benchmarks #5068

Conversation

GiedriusS commented Jan 17, 2022 • edited

GiedriusS commented Jan 17, 2022

GiedriusS Jan 18, 2022

Choose a reason for hiding this comment

kakkoyun Jan 18, 2022

Choose a reason for hiding this comment

GiedriusS Jan 18, 2022

Choose a reason for hiding this comment

kakkoyun left a comment

Choose a reason for hiding this comment

kakkoyun Jan 18, 2022

Choose a reason for hiding this comment

onprem left a comment

Choose a reason for hiding this comment

onprem Jan 19, 2022

Choose a reason for hiding this comment

onprem Jan 19, 2022

Choose a reason for hiding this comment

GiedriusS commented Jan 17, 2022 •

edited