You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Backups intermittently show Failed column as non zero randomly on few set of nodes. Please let me know what could be the possible root cause here. Backups are critical part of any prod setup and intermittent failures have become an issue for us.
Using AWS Scylla AMIs
Scylla version : 5.2.7
Scylla manager version : 3.2.3
Below are agent logs --
Dec 07 23:23:27 ip-172-31-129-136 scylla-manager-agent[19262]: {"L":"ERROR","T":"2023-12-07T23:23:27.268+0530","N":"http","M":"GET /storage_service/scylla_release_version","from":"172.31.129.6:46824","status":502,"bytes":0,"duration":"737ms","S":"github.com/scylladb/go-log.Logger.log\n\tgithub.com/scylladb/go-log@v0.0.7/logger.go:101\ngithub.com/scylladb/go-log.Logger.Error\n\tgithub.com/scylladb/go-log@v0.0.7/logger.go:84\nmain.(*logEntry).Write\n\tgithub.com/scylladb/scylla-manager/v3/pkg/cmd/agent/log.go:53\ngithub.com/go-chi/chi/v5/middleware.RequestLogger.func1.1.1\n\tgithub.com/go-chi/chi/v5@v5.0.0/middleware/logger.go:54\ngithub.com/go-chi/chi/v5/middleware.RequestLogger.func1.1\n\tgithub.com/go-chi/chi/v5@v5.0.0/middleware/logger.go:58\nnet/http.HandlerFunc.ServeHTTP\n\tnet/http/server.go:2122\ngithub.com/go-chi/chi/v5.(*Mux).ServeHTTP\n\tgithub.com/go-chi/chi/v5@v5.0.0/mux.go:87\nnet/http.serverHandler.ServeHTTP\n\tnet/http/server.go:2936\nnet/http.(*conn).serve\n\tnet/http/server.go:1995"}
Dec 07 23:23:26 ip-172-31-129-136 scylla-manager-agent[19262]: {"L":"INFO","T":"2023-12-07T23:23:26.752+0530","M":"http: TLS handshake error from 172.31.129.6:47344: EOF"}
Dec 07 23:23:27 ip-172-31-129-136 scylla-manager-agent[19262]: {"L":"INFO","T":"2023-12-07T23:23:27.267+0530","M":"http: proxy error: context canceled"}
There is a possibility that this is connected to #3298 that has been fixed with SM 3.2.5 release (agent runs out of memory when performing a big backup).
Could you upgrade to SM 3.2.5 and verify that this problem is solved?
Backups intermittently show Failed column as non zero randomly on few set of nodes. Please let me know what could be the possible root cause here. Backups are critical part of any prod setup and intermittent failures have become an issue for us.
Using AWS Scylla AMIs
Scylla version : 5.2.7
Scylla manager version : 3.2.3
Below are agent logs --
Dec 07 23:23:27 ip-172-31-129-136 scylla-manager-agent[19262]: {"L":"ERROR","T":"2023-12-07T23:23:27.268+0530","N":"http","M":"GET /storage_service/scylla_release_version","from":"172.31.129.6:46824","status":502,"bytes":0,"duration":"737ms","S":"github.com/scylladb/go-log.Logger.log\n\tgithub.com/scylladb/go-log@v0.0.7/logger.go:101\ngithub.com/scylladb/go-log.Logger.Error\n\tgithub.com/scylladb/go-log@v0.0.7/logger.go:84\nmain.(*logEntry).Write\n\tgithub.com/scylladb/scylla-manager/v3/pkg/cmd/agent/log.go:53\ngithub.com/go-chi/chi/v5/middleware.RequestLogger.func1.1.1\n\tgithub.com/go-chi/chi/v5@v5.0.0/middleware/logger.go:54\ngithub.com/go-chi/chi/v5/middleware.RequestLogger.func1.1\n\tgithub.com/go-chi/chi/v5@v5.0.0/middleware/logger.go:58\nnet/http.HandlerFunc.ServeHTTP\n\tnet/http/server.go:2122\ngithub.com/go-chi/chi/v5.(*Mux).ServeHTTP\n\tgithub.com/go-chi/chi/v5@v5.0.0/mux.go:87\nnet/http.serverHandler.ServeHTTP\n\tnet/http/server.go:2936\nnet/http.(*conn).serve\n\tnet/http/server.go:1995"}
Manager logs :
Lot of times health check for nodes give timeout on the manager --
The text was updated successfully, but these errors were encountered: