Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sentry Self Hosted keeps crashing randomly and maxes out the RAM #2700

Closed
edgariscoding opened this issue Jan 10, 2024 · 13 comments
Closed

Sentry Self Hosted keeps crashing randomly and maxes out the RAM #2700

edgariscoding opened this issue Jan 10, 2024 · 13 comments
Labels

Comments

@edgariscoding
Copy link
Contributor

Self-Hosted Version

24.1.0.dev0

CPU Architecture

x86_64

Docker Version

24.0.7

Docker Compose Version

2.21.0

Steps to Reproduce

No specific steps, a few days ago during a planned maintenance window when our servers were down a lot of clients started reporting events, exactly at this time something stopped working and no events were captured after that.

The Sentry dashboard was accessible but the last reported event was from the time when the maintenance window started. The issues never resumed being received after that. I had to restart the containers, then it worked properly.

This is when the issues started but now I'm seeing issues throughout the day, there's not much activity going on right now and the dashboard keeps showing a "Service Unavailable - The service is temporarily unavailable. Please try again later." page every now and then. After a few minutes I'm able to load the dashboard. Looking at the sentry vm, it's using almost 100% of the RAM... and the output from the logs is attached below.

Looking at the logs I see lots of errors regarding kafka. I'm not very familiar with this so I would appreciate any help.

Expected Result

Expected Sentry not crash

Actual Result

sentry-self-hosted-snuba-issue-occurrence-consumer-1               | 2024-01-10 17:56:51,916 Error callback from librdKafka -187, _ALL_BROKERS_DOWN, 1/1 brokers are down
sentry-self-hosted-snuba-issue-occurrence-consumer-1               | %3|1704909412.844|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-issue-occurrence-consumer-1               | %3|1704909412.862|FAIL|rdkafka#producer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-issue-occurrence-consumer-1               | %3|1704909412.868|FAIL|rdkafka#consumer-3| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-issue-occurrence-consumer-1               | 2024-01-10 17:56:52,868 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-issue-occurrence-consumer-1               | 2024-01-10 17:57:42,740 New partitions assigned: {Partition(topic=Topic(name='generic-events'), index=0): 0}
sentry-self-hosted-snuba-outcomes-consumer-1                       | %4|1704909312.529|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-snuba-outcomes-consumer-1                       | %3|1704909312.529|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 13905ms in state UP)
sentry-self-hosted-ingest-occurrences-1                            | %3|1704908159.811|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 241474ms in state UP)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %4|1704908160.822|REQTMOUT|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704908160.870|FAIL|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 233421ms in state UP)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704908180.712|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Name or service not known (after 20896ms in state CONNECT)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704908181.773|FAIL|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Name or service not known (after 20158ms in state CONNECT)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:36:43 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='generic-events'), index=0): 0}
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:36:43 [INFO] sentry.post_process_forwarder.post_process_forwarder: Starting multithreaded post process forwarder
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %5|1704909277.425|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out HeartbeatRequest in flight (after 30254ms, timeout #0)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %4|1704909277.617|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704909277.979|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 1095843ms in state UP)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %5|1704909278.626|REQTMOUT|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out HeartbeatRequest in flight (after 30950ms, timeout #0)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %4|1704909278.578|SESSTMOUT|rdkafka#consumer-2| [thrd:main]: Consumer group session timed out (in join-state steady) after 32299 ms without a successful response from the group coordinator (broker 1001, last error was Success): revoking assignment and rejoining group
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %4|1704909278.721|SESSTMOUT|rdkafka#consumer-1| [thrd:main]: Consumer group session timed out (in join-state steady) after 32792 ms without a successful response from the group coordinator (broker 1001, last error was Local: Timed out): revoking assignment and rejoining group
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %4|1704909278.726|REQTMOUT|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704909278.767|FAIL|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 1095941ms in state UP)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704909297.373|FAIL|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Name or service not known (after 18541ms in state CONNECT)
sentry-self-hosted-snuba-generic-metrics-distributions-consumer-1  | 2024-01-10 17:56:52,885 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-generic-metrics-distributions-consumer-1  | 2024-01-10 17:57:42,758 New partitions assigned: {Partition(topic=Topic(name='snuba-generic-metrics'), index=0): 7990}
sentry-self-hosted-snuba-profiling-functions-consumer-1            | %3|1704909278.080|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 1095246ms in state UP)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:54:37,617 Partitions to revoke: [Partition(topic=Topic(name='profiles-call-tree'), index=0)]
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:54:39,067 Closing <arroyo.processing.strategies.guard.StrategyGuard object at 0x7f6525829b50>...
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:54:40,075 Waiting for <arroyo.processing.strategies.guard.StrategyGuard object at 0x7f6525829b50> to exit...
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:55:12,529 Error callback from librdKafka -185, _TIMED_OUT, GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 13905ms in state UP)
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:55:12,533 Stopped
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:56:05,093 Initializing Snuba...
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:56:21,542 Snuba initialization took 16.450508257000003s
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:56:24,152 Initializing Snuba...
sentry-self-hosted-events-consumer-1                               | %3|1704907874.271|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/1001: Failed to resolve 'kafka:9092': Temporary failure in name resolution (after 20443ms in state CONNECT, 2 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704907801.413|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/1001: Failed to resolve 'kafka:9092': Temporary failure in name resolution (after 28865ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704907811.177|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/1001: Failed to resolve 'kafka:9092': Temporary failure in name resolution (after 23654ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-kafka-1                                         | [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:user.home=/root
sentry-self-hosted-kafka-1                                         | [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:user.dir=/
sentry-self-hosted-kafka-1                                         | [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.memory.free=236MB
sentry-self-hosted-kafka-1                                         | [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.memory.max=3552MB
sentry-self-hosted-ingest-profiles-1                               | %3|1704909312.586|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: 1 request(s) timed out: disconnect (after 13963ms in state UP)
sentry-self-hosted-ingest-profiles-1                               | 17:55:12 [INFO] arroyo.processing.processor: Stopped
sentry-self-hosted-ingest-profiles-1                               | Updating certificates in /etc/ssl/certs...
sentry-self-hosted-ingest-profiles-1                               | 0 added, 0 removed; done.
sentry-self-hosted-ingest-profiles-1                               | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-ingest-profiles-1                               | done.
sentry-self-hosted-ingest-profiles-1                               | %3|1704909405.661|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-ingest-profiles-1                               | %3|1704909406.659|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-ingest-profiles-1                               | 17:57:42 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='profiles'), index=0): 0}
sentry-self-hosted-web-1                                           | 19:55:34 [INFO] sentry.access.api: api.access (method='POST' view='sentry.api.endpoints.relay.project_configs.RelayProjectConfigsEndpoint' response=200 user_id='None' is_app='None' token_type='None' is_frontend_request='False' organization_id='None' auth_id='None' path='/api/0/relays/projectconfigs/' caller_ip='172.18.0.29' user_agent='None' rate_limited='False' rate_limit_category='None' request_duration_seconds=0.010183572769165039 rate_limit_type='DNE' concurrent_limit='None' concurrent_requests='None' reset_time='None' group='None' limit='None' remaining='None')
sentry-self-hosted-ingest-replay-recordings-1                      | %4|1704909278.721|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-web-1                                           | 19:55:45 [INFO] sentry.access.api: api.access (method='POST' view='sentry.api.endpoints.relay.project_configs.RelayProjectConfigsEndpoint' response=200 user_id='None' is_app='None' token_type='None' is_frontend_request='False' organization_id='None' auth_id='None' path='/api/0/relays/projectconfigs/' caller_ip='172.18.0.29' user_agent='None' rate_limited='False' rate_limit_category='None' request_duration_seconds=0.010748147964477539 rate_limit_type='DNE' concurrent_limit='None' concurrent_requests='None' reset_time='None' group='None' limit='None' remaining='None')
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:30:30,911 Partitions to revoke: [Partition(topic=Topic(name='snuba-commit-log'), index=0)]
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:30:31,716 Closing <snuba.subscriptions.scheduler_processing_strategy.TickBuffer object at 0x7f4cfeb59190>...
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:30:32,110 Waiting for <snuba.subscriptions.scheduler_processing_strategy.TickBuffer object at 0x7f4cfeb59190> to exit...
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:30:32,116 <snuba.subscriptions.scheduler_processing_strategy.TickBuffer object at 0x7f4cfeb59190> exited successfully, releasing assignment.
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:30:32,116 Partition revocation complete.
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704907834.617|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/1001: Failed to resolve 'kafka:9092': Temporary failure in name resolution (after 28136ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704907848.015|FAIL|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Temporary failure in name resolution (after 21894ms in state CONNECT, 2 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704907858.767|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/1001: Failed to resolve 'kafka:9092': Temporary failure in name resolution (after 20251ms in state CONNECT, 2 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704907878.314|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/1001: Failed to resolve 'kafka:9092': Temporary failure in name resolution (after 20649ms in state CONNECT, 2 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704907890.514|FAIL|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Temporary failure in name resolution (after 20138ms in state CONNECT, 2 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704907901.766|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/1001: Failed to resolve 'kafka:9092': Temporary failure in name resolution (after 20209ms in state CONNECT, 2 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704907916.864|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/1001: Failed to resolve 'kafka:9092': Name or service not known (after 15767ms in state CONNECT)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704907917.121|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/1001: Failed to resolve 'kafka:9092': Name or service not known (after 13808ms in state CONNECT)
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:31:59,926 New partitions assigned: {Partition(topic=Topic(name='snuba-commit-log'), index=0): 20737}
sentry-self-hosted-snuba-subscription-consumer-events-1            | %4|1704908172.510|SESSTMOUT|rdkafka#consumer-2| [thrd:main]: Consumer group session timed out (in join-state steady) after 45343 ms without a successful response from the group coordinator (broker 1001, last error was Success): revoking assignment and rejoining group
sentry-self-hosted-web-1                                           | 19:55:55 [INFO] sentry.access.api: api.access (method='POST' view='sentry.api.endpoints.relay.project_configs.RelayProjectConfigsEndpoint' response=200 user_id='None' is_app='None' token_type='None' is_frontend_request='False' organization_id='None' auth_id='None' path='/api/0/relays/projectconfigs/' caller_ip='172.18.0.29' user_agent='None' rate_limited='False' rate_limit_category='None' request_duration_seconds=0.011533737182617188 rate_limit_type='DNE' concurrent_limit='None' concurrent_requests='None' reset_time='None' group='None' limit='None' remaining='None')
sentry-self-hosted-web-1                                           | 19:56:05 [INFO] sentry.access.api: api.access (method='POST' view='sentry.api.endpoints.relay.project_configs.RelayProjectConfigsEndpoint' response=200 user_id='None' is_app='None' token_type='None' is_frontend_request='False' organization_id='None' auth_id='None' path='/api/0/relays/projectconfigs/' caller_ip='172.18.0.29' user_agent='None' rate_limited='False' rate_limit_category='None' request_duration_seconds=0.00989985466003418 rate_limit_type='DNE' concurrent_limit='None' concurrent_requests='None' reset_time='None' group='None' limit='None' remaining='None')
sentry-self-hosted-web-1                                           | 19:56:15 [INFO] sentry.access.api: api.access (method='POST' view='sentry.api.endpoints.relay.project_configs.RelayProjectConfigsEndpoint' response=200 user_id='None' is_app='None' token_type='None' is_frontend_request='False' organization_id='None' auth_id='None' path='/api/0/relays/projectconfigs/' caller_ip='172.18.0.29' user_agent='None' rate_limited='False' rate_limit_category='None' request_duration_seconds=0.011057615280151367 rate_limit_type='DNE' concurrent_limit='None' concurrent_requests='None' reset_time='None' group='None' limit='None' remaining='None')
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:56:49,027 Snuba initialization took 24.92029119s
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:56:49,067 Consumer Starting
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:56:49,067 Checking Clickhouse connections...
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:56:49,123 Successfully connected to Clickhouse: cluster_name=None
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:56:49,124 librdkafka log level: 6
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:36:13,123 Partitions to revoke: [Partition(topic=Topic(name='snuba-commit-log'), index=0)]
sentry-self-hosted-kafka-1                                         | [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.memory.total=240MB
sentry-self-hosted-kafka-1                                         | [main] INFO org.apache.zookeeper.ZooKeeper - Initiating client connection, connectString=zookeeper:2181 sessionTimeout=40000 watcher=io.confluent.admin.utils.ZookeeperConnectionWatcher@65b3120a
sentry-self-hosted-kafka-1                                         | [main] INFO org.apache.zookeeper.common.X509Util - Setting -D jdk.tls.rejectClientInitiatedRenegotiation=true to disable client-initiated TLS renegotiation
sentry-self-hosted-kafka-1                                         | [main] INFO org.apache.zookeeper.ClientCnxnSocket - jute.maxbuffer value is 4194304 Bytes
sentry-self-hosted-kafka-1                                         | [main] INFO org.apache.zookeeper.ClientCnxn - zookeeper.request.timeout value is 0. feature enabled=
sentry-self-hosted-kafka-1                                         | [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server zookeeper/172.18.0.39:2181. Will not attempt to authenticate using SASL (unknown error)
sentry-self-hosted-kafka-1                                         | [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Socket connection established, initiating session, client: /172.18.0.33:40912, server: zookeeper/172.18.0.39:2181
sentry-self-hosted-kafka-1                                         | [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Session establishment complete on server zookeeper/172.18.0.39:2181, sessionid = 0x100000148c30000, negotiated timeout = 40000
sentry-self-hosted-kafka-1                                         | [main] INFO org.apache.zookeeper.ZooKeeper - Session: 0x100000148c30000 closed
sentry-self-hosted-kafka-1                                         | [main-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for session: 0x100000148c30000
sentry-self-hosted-kafka-1                                         | ===> Launching ... 
sentry-self-hosted-kafka-1                                         | ===> Launching kafka ... 
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:56:59,600] INFO Registered kafka:type=kafka.Log4jController MBean (kafka.utils.Log4jControllerRegistration$)
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:02,640] WARN The package io.confluent.support.metrics.collectors.FullCollector for collecting the full set of support metrics could not be loaded, so we are reverting to anonymous, basic metric collection. If you are a Confluent customer, please refer to the Confluent Platform documentation, section Proactive Support, on how to activate full metrics collection. (io.confluent.support.metrics.KafkaSupportConfig)
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:02,645] WARN The support metrics collection feature ("Metrics") of Proactive Support is disabled. (io.confluent.support.metrics.SupportedServerStartable)
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:10,538] INFO Starting the log cleaner (kafka.log.LogCleaner)
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:10,589] INFO [kafka-log-cleaner-thread-0]: Starting (kafka.log.LogCleaner)
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:11,003] INFO Awaiting socket connections on 0.0.0.0:9092. (kafka.network.Acceptor)
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:11,045] INFO [SocketServer brokerId=1001] Created data-plane acceptor and processors for endpoint : EndPoint(0.0.0.0,9092,ListenerName(PLAINTEXT),PLAINTEXT) (kafka.network.SocketServer)
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:11,046] INFO [SocketServer brokerId=1001] Started 1 acceptor threads for data-plane (kafka.network.SocketServer)
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:11,175] INFO Creating /brokers/ids/1001 (is it secure? false) (kafka.zk.KafkaZkClient)
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:11,205] INFO Stat of the created znode at /brokers/ids/1001 is: 686,686,1704909431188,1704909431188,1,0,0,72057599553634305,180,0,686
sentry-self-hosted-kafka-1                                         |  (kafka.zk.KafkaZkClient)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:54:41,175 <arroyo.processing.strategies.guard.StrategyGuard object at 0x7f6525829b50> exited successfully, releasing assignment.
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:54:41,970 Partition revocation complete.
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:54:48,117 Error callback from librdKafka -185, _TIMED_OUT, GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 1095246ms in state UP)
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:36:13,314 Closing <snuba.subscriptions.scheduler_processing_strategy.TickBuffer object at 0x7f4cfeb593a0>...
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:36:13,317 Waiting for <snuba.subscriptions.scheduler_processing_strategy.TickBuffer object at 0x7f4cfeb593a0> to exit...
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:36:13,317 <snuba.subscriptions.scheduler_processing_strategy.TickBuffer object at 0x7f4cfeb593a0> exited successfully, releasing assignment.
sentry-self-hosted-events-consumer-1                               | %3|1704907901.214|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Temporary failure in name resolution (after 20195ms in state CONNECT, 2 identical error(s) suppressed)
sentry-self-hosted-events-consumer-1                               | %3|1704907917.121|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Name or service not known (after 14898ms in state CONNECT)
sentry-self-hosted-events-consumer-1                               | 17:31:59 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='ingest-events'), index=0): 25619}
sentry-self-hosted-events-consumer-1                               | 17:35:58 [INFO] arroyo.processing.processor: Partitions to revoke: [Partition(topic=Topic(name='ingest-events'), index=0)]
sentry-self-hosted-events-consumer-1                               | 17:35:59 [INFO] arroyo.processing.processor: Closing <arroyo.processing.strategies.guard.StrategyGuard object at 0x7f696c405130>...
sentry-self-hosted-events-consumer-1                               | 17:36:01 [INFO] arroyo.processing.processor: Waiting for <arroyo.processing.strategies.guard.StrategyGuard object at 0x7f696c405130> to exit...
sentry-self-hosted-events-consumer-1                               | 17:36:03 [INFO] arroyo.processing.processor: <arroyo.processing.strategies.guard.StrategyGuard object at 0x7f696c405130> exited successfully, releasing assignment.
sentry-self-hosted-events-consumer-1                               | 17:36:05 [INFO] arroyo.processing.processor: Partition revocation complete.
sentry-self-hosted-snuba-profiling-functions-consumer-1            | %3|1704909297.371|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Name or service not known (after 19290ms in state CONNECT)
sentry-self-hosted-snuba-outcomes-consumer-1                       | %3|1704909409.157|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:36:13,318 Partition revocation complete.
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:56:49,158 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:36:58,569 New partitions assigned: {Partition(topic=Topic(name='snuba-commit-log'), index=0): 20739}
sentry-self-hosted-snuba-subscription-consumer-events-1            | %4|1704909287.471|SESSTMOUT|rdkafka#consumer-2| [thrd:main]: Consumer group session timed out (in join-state steady) after 45342 ms without a successful response from the group coordinator (broker 1001, last error was Success): revoking assignment and rejoining group
sentry-self-hosted-snuba-subscription-consumer-events-1            | %5|1704909291.075|REQTMOUT|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out HeartbeatRequest in flight (after 45695ms, timeout #0)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %4|1704909291.168|REQTMOUT|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704909292.179|FAIL|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 1374382ms in state UP)
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:54:48,371 Partitions to revoke: [Partition(topic=Topic(name='snuba-commit-log'), index=0)]
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:54:54,672 Closing <snuba.subscriptions.scheduler_processing_strategy.TickBuffer object at 0x7f4cfeb591f0>...
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:54:55,477 Waiting for <snuba.subscriptions.scheduler_processing_strategy.TickBuffer object at 0x7f4cfeb591f0> to exit...
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:54:55,620 <snuba.subscriptions.scheduler_processing_strategy.TickBuffer object at 0x7f4cfeb591f0> exited successfully, releasing assignment.
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:54:55,920 Partition revocation complete.
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:55:07,522 Shutdown signalled
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:55:07,522 Stopping consumer
sentry-self-hosted-snuba-subscription-consumer-events-1            | %5|1704909312.531|REQTMOUT|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out LeaveGroupRequest in flight (after 5004ms, timeout #0): possibly held back by preceeding blocking JoinGroupRequest with timeout in 287955ms
sentry-self-hosted-snuba-subscription-consumer-events-1            | %4|1704909312.531|REQTMOUT|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704909312.531|FAIL|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 15161ms in state UP, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:55:12,535 Stopped
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:56:06,023 Initializing Snuba...
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:56:38,174 Snuba initialization took 32.152243641999995s
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:56:41,794 Initializing Snuba...
sentry-self-hosted-events-consumer-1                               | %4|1704908157.462|SESSTMOUT|rdkafka#consumer-1| [thrd:main]: Consumer group session timed out (in join-state steady) after 30048 ms without a successful response from the group coordinator (broker 1001, last error was Success): revoking assignment and rejoining group
sentry-self-hosted-events-consumer-1                               | %5|1704908159.668|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out HeartbeatRequest in flight (after 30200ms, timeout #0)
sentry-self-hosted-events-consumer-1                               | %4|1704908159.668|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-events-consumer-1                               | %3|1704908159.811|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 241475ms in state UP)
sentry-self-hosted-events-consumer-1                               | %3|1704908181.172|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Temporary failure in name resolution (after 21359ms in state CONNECT)
sentry-self-hosted-ingest-occurrences-1                            | %3|1704908180.715|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Name or service not known (after 20902ms in state CONNECT)
sentry-self-hosted-ingest-occurrences-1                            | 17:36:43 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='ingest-occurrences'), index=0): 0}
sentry-self-hosted-ingest-occurrences-1                            | 17:54:39 [INFO] arroyo.processing.processor: Partitions to revoke: [Partition(topic=Topic(name='ingest-occurrences'), index=0)]
sentry-self-hosted-ingest-occurrences-1                            | 17:54:41 [INFO] arroyo.processing.processor: Closing <arroyo.processing.strategies.run_task_with_multiprocessing.RunTaskWithMultiprocessing object at 0x7f6b72600790>...
sentry-self-hosted-ingest-occurrences-1                            | 17:54:42 [INFO] arroyo.processing.processor: Waiting for <arroyo.processing.strategies.run_task_with_multiprocessing.RunTaskWithMultiprocessing object at 0x7f6b72600790> to exit...
sentry-self-hosted-ingest-occurrences-1                            | 17:54:54 [INFO] arroyo.processing.processor: Shutdown signalled
sentry-self-hosted-ingest-occurrences-1                            | 17:54:55 [INFO] arroyo.processing.processor: <arroyo.processing.strategies.run_task_with_multiprocessing.RunTaskWithMultiprocessing object at 0x7f6b72600790> exited successfully, releasing assignment.
sentry-self-hosted-ingest-occurrences-1                            | 17:54:55 [INFO] arroyo.processing.processor: Partition revocation complete.
sentry-self-hosted-ingest-occurrences-1                            | %5|1704909278.625|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out HeartbeatRequest in flight (after 30952ms, timeout #0)
sentry-self-hosted-ingest-occurrences-1                            | %4|1704909278.579|SESSTMOUT|rdkafka#consumer-1| [thrd:main]: Consumer group session timed out (in join-state steady) after 32299 ms without a successful response from the group coordinator (broker 1001, last error was Success): revoking assignment and rejoining group
sentry-self-hosted-events-consumer-1                               | 17:36:43 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='ingest-events'), index=0): 25626}
sentry-self-hosted-events-consumer-1                               | %5|1704909278.626|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out HeartbeatRequest in flight (after 30951ms, timeout #0)
sentry-self-hosted-events-consumer-1                               | %4|1704909278.578|SESSTMOUT|rdkafka#consumer-1| [thrd:main]: Consumer group session timed out (in join-state steady) after 32299 ms without a successful response from the group coordinator (broker 1001, last error was Success): revoking assignment and rejoining group
sentry-self-hosted-events-consumer-1                               | %4|1704909278.721|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-events-consumer-1                               | %3|1704909278.767|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 1096093ms in state UP)
sentry-self-hosted-events-consumer-1                               | 17:54:39 [INFO] arroyo.processing.processor: Partitions to revoke: [Partition(topic=Topic(name='ingest-events'), index=0)]
sentry-self-hosted-events-consumer-1                               | 17:54:41 [INFO] arroyo.processing.processor: Closing <arroyo.processing.strategies.guard.StrategyGuard object at 0x7f696dd50760>...
sentry-self-hosted-events-consumer-1                               | 17:54:42 [INFO] arroyo.processing.processor: Waiting for <arroyo.processing.strategies.guard.StrategyGuard object at 0x7f696dd50760> to exit...
sentry-self-hosted-events-consumer-1                               | 17:54:44 [INFO] arroyo.processing.processor: <arroyo.processing.strategies.guard.StrategyGuard object at 0x7f696dd50760> exited successfully, releasing assignment.
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:54:57,421 Error callback from librdKafka -193, _RESOLVE, GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Name or service not known (after 19290ms in state CONNECT)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:55:07,525 Shutdown signalled
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:55:07,526 Flushing and closing cogs recorder if it exists...
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:55:07,526 Stopping consumer
sentry-self-hosted-snuba-profiling-functions-consumer-1            | %5|1704909312.531|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out LeaveGroupRequest in flight (after 5004ms, timeout #0): possibly held back by preceeding blocking JoinGroupRequest with timeout in 19198ms
sentry-self-hosted-snuba-profiling-functions-consumer-1            | %4|1704909312.531|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-snuba-profiling-functions-consumer-1            | %3|1704909312.531|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 13907ms in state UP)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:55:12,532 Error callback from librdKafka -185, _TIMED_OUT, GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 13907ms in state UP)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:55:12,535 Stopped
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:56:05,134 Initializing Snuba...
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:57:03,782 Snuba initialization took 21.989187238s
sentry-self-hosted-ingest-occurrences-1                            | %4|1704909278.724|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-snuba-subscription-consumer-events-1            | {"module": "builtins", "event": "Checking Clickhouse connections", "severity": "info", "timestamp": "2024-01-10T17:57:03.822176Z"}
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704909423.829|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 1ms in state CONNECT)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704909423.869|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704909424.821|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-events-1            | %3|1704909424.869|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-events-1            | 2024-01-10 17:57:57,731 New partitions assigned: {Partition(topic=Topic(name='snuba-commit-log'), index=0): 20740}
sentry-self-hosted-ingest-occurrences-1                            | %3|1704909278.767|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 1096741ms in state UP)
sentry-self-hosted-ingest-occurrences-1                            | %3|1704909297.375|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Name or service not known (after 18544ms in state CONNECT)
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:56:49,163 Error callback from librdKafka -187, _ALL_BROKERS_DOWN, 1/1 brokers are down
sentry-self-hosted-snuba-outcomes-consumer-1                       | %3|1704909410.155|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:56:50,155 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-outcomes-consumer-1                       | 2024-01-10 17:57:42,749 New partitions assigned: {Partition(topic=Topic(name='outcomes'), index=0): 29471}
sentry-self-hosted-events-consumer-1                               | 17:54:46 [INFO] arroyo.processing.processor: Partition revocation complete.
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704909297.376|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Name or service not known (after 19145ms in state CONNECT)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:54:40 [INFO] arroyo.processing.processor: Partitions to revoke: [Partition(topic=Topic(name='generic-events'), index=0)]
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:54:44 [INFO] arroyo.processing.processor: Closing <arroyo.processing.strategies.run_task_in_threads.RunTaskInThreads object at 0x7fce5e451be0>...
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:54:46 [INFO] arroyo.processing.processor: Waiting for <arroyo.processing.strategies.run_task_in_threads.RunTaskInThreads object at 0x7fce5e451be0> to exit...
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:54:53 [INFO] arroyo.processing.processor: <arroyo.processing.strategies.run_task_in_threads.RunTaskInThreads object at 0x7fce5e451be0> exited successfully, releasing assignment.
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:54:55 [INFO] arroyo.processing.processor: Partition revocation complete.
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:55:07 [INFO] arroyo.processing.processor: Shutdown signalled
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:55:07 [INFO] arroyo.processing.processor: Stopping consumer
sentry-self-hosted-events-consumer-1                               | %3|1704909297.376|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Name or service not known (after 18545ms in state CONNECT)
sentry-self-hosted-events-consumer-1                               | 17:55:07 [INFO] arroyo.processing.processor: Shutdown signalled
sentry-self-hosted-events-consumer-1                               | 17:55:07 [INFO] arroyo.processing.processor: Stopping consumer
sentry-self-hosted-events-consumer-1                               | %5|1704909312.587|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out LeaveGroupRequest in flight (after 5003ms, timeout #0): possibly held back by preceeding blocking JoinGroupRequest with timeout in 19080ms
sentry-self-hosted-events-consumer-1                               | %4|1704909312.587|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-events-consumer-1                               | %3|1704909312.587|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 13967ms in state UP)
sentry-self-hosted-events-consumer-1                               | 17:55:12 [INFO] arroyo.processing.processor: Stopped
sentry-self-hosted-events-consumer-1                               | Sentry is attempting to send 1 pending events
sentry-self-hosted-events-consumer-1                               | Waiting up to 2 seconds
sentry-self-hosted-events-consumer-1                               | Press Ctrl-C to quit
sentry-self-hosted-ingest-replay-recordings-1                      | %3|1704909278.767|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 1095942ms in state UP)
sentry-self-hosted-ingest-replay-recordings-1                      | %3|1704909297.372|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: Failed to resolve 'kafka:9092': Name or service not known (after 18538ms in state CONNECT)
sentry-self-hosted-ingest-replay-recordings-1                      | 17:54:39 [INFO] arroyo.processing.processor: Partitions to revoke: [Partition(topic=Topic(name='ingest-replay-recordings'), index=0)]
sentry-self-hosted-ingest-replay-recordings-1                      | 17:54:41 [INFO] arroyo.processing.processor: Closing <arroyo.processing.strategies.guard.StrategyGuard object at 0x7fc55343a790>...
sentry-self-hosted-ingest-replay-recordings-1                      | 17:54:42 [INFO] arroyo.processing.processor: Waiting for <arroyo.processing.strategies.guard.StrategyGuard object at 0x7fc55343a790> to exit...
sentry-self-hosted-ingest-replay-recordings-1                      | 17:54:44 [INFO] arroyo.processing.processor: <arroyo.processing.strategies.guard.StrategyGuard object at 0x7fc55343a790> exited successfully, releasing assignment.
sentry-self-hosted-ingest-replay-recordings-1                      | 17:54:46 [INFO] arroyo.processing.processor: Partition revocation complete.
sentry-self-hosted-ingest-replay-recordings-1                      | 17:55:07 [INFO] arroyo.processing.processor: Shutdown signalled
sentry-self-hosted-ingest-replay-recordings-1                      | 17:55:07 [INFO] arroyo.processing.processor: Stopping consumer
sentry-self-hosted-ingest-replay-recordings-1                      | %5|1704909312.929|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out LeaveGroupRequest in flight (after 5004ms, timeout #0): possibly held back by preceeding blocking JoinGroupRequest with timeout in 18742ms
sentry-self-hosted-ingest-replay-recordings-1                      | %4|1704909312.929|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-ingest-replay-recordings-1                      | %3|1704909312.929|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: 1 request(s) timed out: disconnect (after 14305ms in state UP)
sentry-self-hosted-ingest-replay-recordings-1                      | 17:55:12 [INFO] arroyo.processing.processor: Stopped
sentry-self-hosted-ingest-replay-recordings-1                      | Sentry is attempting to send 1 pending events
sentry-self-hosted-ingest-replay-recordings-1                      | Waiting up to 2 seconds
sentry-self-hosted-ingest-replay-recordings-1                      | Press Ctrl-C to quit
sentry-self-hosted-ingest-replay-recordings-1                      | 17:55:13 [ERROR] sentry_sdk.errors: Unexpected status code: 500 (body: b'<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">\n<html>\n<head>\n<meta http-equiv="Content-Type" content="text/html;charset=utf-8">\n<title>Service Unavailable</title>\n<style type="text/css">\nbody, p, h1 {\n  font-family: Verdana, Arial, Helvetica, sans-serif;\n}\nh2 {\n  font-family: Arial, Helvetica, sans-serif;\n  color: #b10b29;\n}\n</style>\n</head>\n<body>\n<h2>Service Unavailable</h2>\n<p>The service is temporarily unavailable. Please try again later.</p>\n</body>\n</html>\n')
sentry-self-hosted-ingest-replay-recordings-1                      | %4|1704909314.503|TERMINATE|rdkafka#producer-2| [thrd:app]: Producer terminating with 1 message (178 bytes) still in queue or transit: use flush() to wait for outstanding message delivery
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:55:15,822 Checking Clickhouse connections...
sentry-self-hosted-ingest-replay-recordings-1                      | Updating certificates in /etc/ssl/certs...
sentry-self-hosted-ingest-replay-recordings-1                      | 0 added, 0 removed; done.
sentry-self-hosted-ingest-replay-recordings-1                      | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-ingest-replay-recordings-1                      | done.
sentry-self-hosted-ingest-replay-recordings-1                      | %3|1704909414.016|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 1ms in state CONNECT)
sentry-self-hosted-ingest-replay-recordings-1                      | %3|1704909415.001|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-ingest-replay-recordings-1                      | 17:57:42 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='ingest-replay-recordings'), index=0): 2495}
sentry-self-hosted-events-consumer-1                               | 17:55:12 [ERROR] sentry_sdk.errors: Unexpected status code: 500 (body: b'<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">\n<html>\n<head>\n<meta http-equiv="Content-Type" content="text/html;charset=utf-8">\n<title>Service Unavailable</title>\n<style type="text/css">\nbody, p, h1 {\n  font-family: Verdana, Arial, Helvetica, sans-serif;\n}\nh2 {\n  font-family: Arial, Helvetica, sans-serif;\n  color: #b10b29;\n}\n</style>\n</head>\n<body>\n<h2>Service Unavailable</h2>\n<p>The service is temporarily unavailable. Please try again later.</p>\n</body>\n</html>\n')
sentry-self-hosted-events-consumer-1                               | Updating certificates in /etc/ssl/certs...
sentry-self-hosted-events-consumer-1                               | 0 added, 0 removed; done.
sentry-self-hosted-events-consumer-1                               | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:11,206] INFO Registered broker 1001 at path /brokers/ids/1001 with addresses: ArrayBuffer(EndPoint(kafka,9092,ListenerName(PLAINTEXT),PLAINTEXT)), czxid (broker epoch): 686 (kafka.zk.KafkaZkClient)
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:11,762] INFO [/config/changes-event-process-thread]: Starting (kafka.common.ZkNodeChangeNotificationListener$ChangeEventProcessThread)
sentry-self-hosted-kafka-1                                         | [2024-01-10 17:57:11,916] INFO [SocketServer brokerId=1001] Started data-plane processors for 1 acceptors (kafka.network.SocketServer)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %5|1704909312.533|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out LeaveGroupRequest in flight (after 5004ms, timeout #0): possibly held back by preceeding blocking JoinGroupRequest with timeout in 19134ms
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %4|1704909312.533|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-events-consumer-1                               | done.
sentry-self-hosted-events-consumer-1                               | %3|1704909413.492|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 4ms in state CONNECT)
sentry-self-hosted-events-consumer-1                               | %3|1704909414.467|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-events-consumer-1                               | 17:57:42 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='ingest-events'), index=0): 25628}
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:56:20,846 Snuba initialization took 15.713754767000005s
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:56:25,592 Initializing Snuba...
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:56:51,254 Snuba initialization took 25.66412641000001s
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:56:51,329 Consumer Starting
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:56:07,563 Initializing Snuba...
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:56:27,206 Snuba initialization took 19.659862033000003s
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:56:30,779 Initializing Snuba...
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:56:54,938 Snuba initialization took 24.161244126s
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:56:54,971 Consumer Starting
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:56:54,978 Checking Clickhouse connections...
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:56:55,003 Successfully connected to Clickhouse: cluster_name=None
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:56:55,004 librdkafka log level: 6
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | %3|1704909415.018|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 1ms in state CONNECT)
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | %3|1704909415.031|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 6ms in state CONNECT)
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:56:55,032 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 6ms in state CONNECT)
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:56:55,032 Error callback from librdKafka -187, _ALL_BROKERS_DOWN, 1/1 brokers are down
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | %3|1704909416.020|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | %3|1704909416.025|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:56:56,025 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-generic-metrics-sets-consumer-1           | 2024-01-10 17:57:42,756 New partitions assigned: {Partition(topic=Topic(name='snuba-generic-metrics'), index=0): 7985}
sentry-self-hosted-ingest-occurrences-1                            | 17:55:07 [INFO] arroyo.processing.processor: Shutdown signalled
sentry-self-hosted-ingest-occurrences-1                            | 17:55:07 [INFO] arroyo.processing.processor: Stopping consumer
sentry-self-hosted-ingest-occurrences-1                            | %5|1704909312.536|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out LeaveGroupRequest in flight (after 5004ms, timeout #0): possibly held back by preceeding blocking JoinGroupRequest with timeout in 19086ms
sentry-self-hosted-ingest-occurrences-1                            | %4|1704909312.536|REQTMOUT|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-ingest-occurrences-1                            | %3|1704909312.536|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 13915ms in state UP)
sentry-self-hosted-ingest-occurrences-1                            | 17:55:12 [INFO] arroyo.processing.processor: Stopped
sentry-self-hosted-ingest-occurrences-1                            | Updating certificates in /etc/ssl/certs...
sentry-self-hosted-ingest-occurrences-1                            | 0 added, 0 removed; done.
sentry-self-hosted-ingest-occurrences-1                            | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-ingest-occurrences-1                            | done.
sentry-self-hosted-ingest-occurrences-1                            | %3|1704909409.112|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-ingest-occurrences-1                            | %3|1704909410.047|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-ingest-occurrences-1                            | 17:57:42 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='ingest-occurrences'), index=0): 0}
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:56:51,347 Checking Clickhouse connections...
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:56:51,380 Successfully connected to Clickhouse: cluster_name=None
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:56:51,381 librdkafka log level: 6
sentry-self-hosted-snuba-profiling-functions-consumer-1            | %3|1704909411.396|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 8ms in state CONNECT)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:56:51,402 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 8ms in state CONNECT)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:56:51,411 Error callback from librdKafka -187, _ALL_BROKERS_DOWN, 1/1 brokers are down
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704909312.533|FAIL|rdkafka#consumer-1| [thrd:GroupCoordinator]: GroupCoordinator: kafka:9092: 1 request(s) timed out: disconnect (after 13912ms in state UP)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %5|1704909312.549|REQTMOUT|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out LeaveGroupRequest in flight (after 5005ms, timeout #0): possibly held back by preceeding blocking JoinGroupRequest with timeout in 19123ms
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %4|1704909312.549|REQTMOUT|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator/1001: Timed out 1 in-flight, 0 retry-queued, 0 out-queue, 0 partially-sent requests
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704909312.549|FAIL|rdkafka#consumer-2| [thrd:GroupCoordinator]: GroupCoordinator: 1 request(s) timed out: disconnect (after 13925ms in state UP)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:55:12 [INFO] arroyo.processing.processor: Stopped
sentry-self-hosted-post-process-forwarder-issue-platform-1         | Updating certificates in /etc/ssl/certs...
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 0 added, 0 removed; done.
sentry-self-hosted-post-process-forwarder-issue-platform-1         | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-post-process-forwarder-issue-platform-1         | done.
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704909413.967|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 64ms in state CONNECT)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704909413.989|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | %3|1704909412.385|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:56:52,385 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-10 17:57:42,751 New partitions assigned: {Partition(topic=Topic(name='profiles-call-tree'), index=0): 0}
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704909414.903|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1704909414.965|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.33:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:57:42 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='generic-events'), index=0): 0}
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 17:57:42 [INFO] sentry.post_process_forwarder.post_process_forwarder: Starting multithreaded post process forwarder

Event ID

No response

@aldy505
Copy link
Collaborator

aldy505 commented Jan 11, 2024

Your Kafka just blew up (this is a regular problem with Kafka on high traffic -- it's totally sucks, I know). The proper solution to this is to scale out Kafka horizontally, that means you'd need to spawn multiple VM and deploy Kafka there (usually it's 3 VM or 5 VM), and you'll end up having to maintain Kafka cluster regularly.

But, please don't do this for self-hosted Sentry. It's a total nightmare and maintenance burden.

The possible root cause of your problem is simply the dependencies (with majority on Kafka) can't handle high traffic. The solution you can do are:

  • Put a rate limiter middleware on the reverse proxy you're using behind Sentry's nginx container. See this draft docs on productionalizing & some configuration on modern proxies. The Sentry clients (or the Sentry's SDK) handles non-200 response well, so your application won't crash because they got rate limited.
  • Lower your sample rate (error sample rate, transaction sample rate, profiling sample rate) on your application. This would mean changing the configuration on SDK initialization of your app.
  • Bump your server specs. This is the most rational solution but if you can't afford higher specs, adding a rate limiter would do.

What you can do to mitigate your current condition is (or basically what I'd do if I were you):

  1. sudo docker compose down
  2. Drop the kafka volume, this will only cause data loss to the last 5 minute sentry event, that should be no issue. sudo docker volume rm sentry-kafka
  3. sudo ./install.sh, this will recreate the Kafka volume, recreate every Kafka topics.
  4. sudo docker compose up -d, restart your entire sentry

Let me know if that helps

@edgariscoding
Copy link
Contributor Author

@aldy505

So I followed your instructions to drop the Kafka volume and this happened:

image

I also followed the "nuclear option" on this page:

https://develop.sentry.dev/self-hosted/troubleshooting/

/var/log/error.log shows:

2024/01/24 17:29:00 
[error] 754#754: 
*21297 connect() failed (111: Unknown error) while connecting to upstream, 
client: 100.68.101.10, 
server: sentry.ciranet.com, 
request: "POST /api/1/envelope/ HTTP/1.1", 
upstream: "http://127.0.0.1:9000/api/1/envelope/", 
host: "sentry.ciranet.com"

/var/log/access.log shows:

100.68.101.10 - - 
[24/Jan/2024:22:03:39 +0000] 
"GET / HTTP/1.1" 502 150 "-" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/17.3 Safari/605.1.15"

And docker-compose logs show:

sentry-self-hosted-metrics-consumer-1                              |   File "/usr/local/lib/python3.10/site-packages/sentry/utils/imports.py", line 8, in __missing__
sentry-self-hosted-metrics-consumer-1                              |     module = __import__(module_name, {}, {}, [class_name])
sentry-self-hosted-metrics-consumer-1                              |   File "/usr/local/lib/python3.10/site-packages/sentry/sentry_metrics/consumers/indexer/parallel.py", line 23, in <module>
sentry-self-hosted-metrics-consumer-1                              |     from sentry.sentry_metrics.consumers.indexer.processing import MessageProcessor
sentry-self-hosted-metrics-consumer-1                              |   File "/usr/local/lib/python3.10/site-packages/sentry/sentry_metrics/consumers/indexer/processing.py", line 26, in <module>
sentry-self-hosted-kafka-1                                         |    at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:31)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.ControllerEventManager$ControllerEventThread.doWork(ControllerEventManager.scala:139)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | 2024-01-24 17:37:31,199 librdkafka log level: 6
sentry-self-hosted-metrics-consumer-1                              |     from sentry.sentry_metrics.indexer.postgres.postgres_v2 import PostgresIndexer
sentry-self-hosted-metrics-consumer-1                              |   File "/usr/local/lib/python3.10/site-packages/sentry/sentry_metrics/indexer/postgres/postgres_v2.py", line 38, in <module>
sentry-self-hosted-metrics-consumer-1                              |     indexer_cache = StringIndexerCache(
sentry-self-hosted-metrics-consumer-1                              |   File "/usr/local/lib/python3.10/site-packages/sentry/sentry_metrics/indexer/cache.py", line 51, in __init__
sentry-self-hosted-metrics-consumer-1                              |     self.cache = caches[cache_name]
sentry-self-hosted-metrics-consumer-1                              |   File "/usr/local/lib/python3.10/site-packages/django/utils/connection.py", line 62, in __getitem__
sentry-self-hosted-metrics-consumer-1                              |     conn = self.create_connection(alias)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | %3|1706117851.229|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.19.0.2:9092 failed: Connection refused (after 2ms in state CONNECT)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | 2024-01-24 17:37:31,230 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.19.0.2:9092 failed: Connection refused (after 2ms in state CONNECT)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | 2024-01-24 17:37:31,230 Error callback from librdKafka -187, _ALL_BROKERS_DOWN, 1/1 brokers are down
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | %3|1706117852.223|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.19.0.2:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | 2024-01-24 17:37:32,223 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.19.0.2:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-cron-1                                          |     new_errors = check(app_configs=app_configs, databases=databases)
sentry-self-hosted-cron-1                                          |   File "/usr/local/lib/python3.10/site-packages/django/core/checks/caches.py", line 65, in check_file_based_cache_is_absolute
sentry-self-hosted-cron-1                                          |     cache = caches[alias]
sentry-self-hosted-cron-1                                          |   File "/usr/local/lib/python3.10/site-packages/django/utils/connection.py", line 62, in __getitem__
sentry-self-hosted-nginx-1                                         | 172.19.0.1 - - [24/Jan/2024:22:28:16 +0000] "POST /api/5/envelope/?sentry_key=8f8907c23299fc4baff49f4cbf349b4c&sentry_version=7&sentry_client=sentry.javascript.angular-ivy%2F7.93.0 HTTP/1.0" 503 84 "https://qa.ciravote.com/" "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36" "-"
sentry-self-hosted-cron-1                                          |     conn = self.create_connection(alias)
sentry-self-hosted-cron-1                                          |   File "/usr/local/lib/python3.10/site-packages/sentry_sdk/integrations/django/caching.py", line 108, in sentry_create_connection
sentry-self-hosted-cron-1                                          |     cache = original_create_connection(self, alias)
sentry-self-hosted-cron-1                                          |   File "/usr/local/lib/python3.10/site-packages/django/core/cache/__init__.py", line 49, in create_connection
sentry-self-hosted-cron-1                                          |     raise InvalidCacheBackendError(
sentry-self-hosted-cron-1                                          | django.core.cache.backends.base.InvalidCacheBackendError: Could not find backend 'django.core.cache.backends.memcached.MemcachedCache': Module "django.core.cache.backends.memcached" does not define a "MemcachedCache" attribute/class
sentry-self-hosted-cron-1                                          | Sentry is attempting to send 1 pending events
sentry-self-hosted-cron-1                                          | Waiting up to 2 seconds
sentry-self-hosted-cron-1                                          | Press Ctrl-C to quit
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-24 17:37:00,995 Snuba initialization took 15.030627143000004s
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-24 17:37:05,132 Initializing Snuba...
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-24 17:37:27,245 Snuba initialization took 22.114520626999997s
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-24 17:37:27,304 Consumer Starting
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-24 17:37:27,304 Checking Clickhouse connections...
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-24 17:37:27,353 Successfully connected to Clickhouse: cluster_name=None
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-24 17:37:27,354 librdkafka log level: 6
sentry-self-hosted-snuba-profiling-functions-consumer-1            | %3|1706117847.388|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.19.0.2:9092 failed: Connection refused (after 7ms in state CONNECT)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-24 17:37:27,388 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.19.0.2:9092 failed: Connection refused (after 7ms in state CONNECT)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-24 17:37:27,389 Error callback from librdKafka -187, _ALL_BROKERS_DOWN, 1/1 brokers are down
sentry-self-hosted-snuba-profiling-functions-consumer-1            | %3|1706117848.374|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.19.0.2:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-profiling-functions-consumer-1            | 2024-01-24 17:37:28,375 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.19.0.2:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-ingest-replay-recordings-1                      |     raise InvalidCacheBackendError(
sentry-self-hosted-ingest-replay-recordings-1                      | django.core.cache.backends.base.InvalidCacheBackendError: Could not find backend 'django.core.cache.backends.memcached.MemcachedCache': Module "django.core.cache.backends.memcached" does not define a "MemcachedCache" attribute/class
sentry-self-hosted-ingest-replay-recordings-1                      | 17:37:29 [WARNING] sentry: Unable to update option cache for redis.clusters (key='redis.clusters')
sentry-self-hosted-ingest-replay-recordings-1                      | %3|1706117849.424|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.19.0.2:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-ingest-replay-recordings-1                      | %3|1706117850.423|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.19.0.2:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-cron-1                                          | 22:28:07 [ERROR] sentry_sdk.errors: Unexpected status code: 503 (body: b'{"detail":"failed to queue envelope","causes":["envelope buffer capacity exceeded"]}')
sentry-self-hosted-attachments-consumer-1                          | 
sentry-self-hosted-attachments-consumer-1                          | Traceback (most recent call last):
sentry-self-hosted-attachments-consumer-1                          |   File "/usr/local/lib/python3.10/site-packages/django/core/cache/__init__.py", line 47, in create_connection
sentry-self-hosted-attachments-consumer-1                          |     backend_cls = import_string(backend)
sentry-self-hosted-attachments-consumer-1                          |   File "/usr/local/lib/python3.10/site-packages/django/utils/module_loading.py", line 32, in import_string
sentry-self-hosted-attachments-consumer-1                          |     raise ImportError(
sentry-self-hosted-attachments-consumer-1                          | ImportError: Module "django.core.cache.backends.memcached" does not define a "MemcachedCache" attribute/class
sentry-self-hosted-attachments-consumer-1                          | 
sentry-self-hosted-attachments-consumer-1                          | The above exception was the direct cause of the following exception:
sentry-self-hosted-attachments-consumer-1                          | 
sentry-self-hosted-attachments-consumer-1                          | Traceback (most recent call last):
sentry-self-hosted-attachments-consumer-1                          |   File "/usr/local/lib/python3.10/site-packages/sentry/options/store.py", line 262, in set_cache
sentry-self-hosted-attachments-consumer-1                          |     self.cache.set(cache_key, value, self.ttl)
sentry-self-hosted-attachments-consumer-1                          |   File "/usr/local/lib/python3.10/site-packages/django/utils/connection.py", line 15, in __getattr__
sentry-self-hosted-attachments-consumer-1                          |     return getattr(self._connections[self._alias], item)
sentry-self-hosted-attachments-consumer-1                          |   File "/usr/local/lib/python3.10/site-packages/django/utils/connection.py", line 62, in __getitem__
sentry-self-hosted-attachments-consumer-1                          |     conn = self.create_connection(alias)
sentry-self-hosted-attachments-consumer-1                          |   File "/usr/local/lib/python3.10/site-packages/sentry_sdk/integrations/django/caching.py", line 108, in sentry_create_connection
sentry-self-hosted-attachments-consumer-1                          |     cache = original_create_connection(self, alias)
sentry-self-hosted-metrics-consumer-1                              |   File "/usr/local/lib/python3.10/site-packages/sentry_sdk/integrations/django/caching.py", line 108, in sentry_create_connection
sentry-self-hosted-metrics-consumer-1                              |     cache = original_create_connection(self, alias)
sentry-self-hosted-metrics-consumer-1                              |   File "/usr/local/lib/python3.10/site-packages/django/core/cache/__init__.py", line 49, in create_connection
sentry-self-hosted-attachments-consumer-1                          |   File "/usr/local/lib/python3.10/site-packages/django/core/cache/__init__.py", line 49, in create_connection
sentry-self-hosted-attachments-consumer-1                          |     raise InvalidCacheBackendError(
sentry-self-hosted-attachments-consumer-1                          | django.core.cache.backends.base.InvalidCacheBackendError: Could not find backend 'django.core.cache.backends.memcached.MemcachedCache': Module "django.core.cache.backends.memcached" does not define a "MemcachedCache" attribute/class
sentry-self-hosted-attachments-consumer-1                          | 17:37:49 [WARNING] sentry: Unable to update option cache for redis.clusters (key='redis.clusters')
sentry-self-hosted-kafka-1                                         |    at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:96)
sentry-self-hosted-kafka-1                                         | [2024-01-24 17:37:47,280] ERROR [Controller id=1002 epoch=50] Controller 1002 epoch 50 failed to change state for partition scheduled-subscriptions-metrics-0 from OfflinePartition to OnlinePartition (state.change.logger)
sentry-self-hosted-kafka-1                                         | kafka.common.StateChangeFailedException: Failed to elect leader for partition scheduled-subscriptions-metrics-0 under strategy OfflinePartitionLeaderElectionStrategy(false)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.ZkPartitionStateMachine.$anonfun$doElectLeaderForPartitions$7(PartitionStateMachine.scala:427)
sentry-self-hosted-kafka-1                                         |    at scala.collection.mutable.ResizableArray.foreach(ResizableArray.scala:62)
sentry-self-hosted-kafka-1                                         |    at scala.collection.mutable.ResizableArray.foreach$(ResizableArray.scala:55)
sentry-self-hosted-metrics-consumer-1                              |     raise InvalidCacheBackendError(
sentry-self-hosted-metrics-consumer-1                              | django.core.cache.backends.base.InvalidCacheBackendError: Could not find backend 'django.core.cache.backends.memcached.MemcachedCache': Module "django.core.cache.backends.memcached" does not define a "MemcachedCache" attribute/class
sentry-self-hosted-kafka-1                                         |    at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:49)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.ZkPartitionStateMachine.doElectLeaderForPartitions(PartitionStateMachine.scala:424)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.ZkPartitionStateMachine.electLeaderForPartitions(PartitionStateMachine.scala:335)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.ZkPartitionStateMachine.doHandleStateChanges(PartitionStateMachine.scala:236)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.ZkPartitionStateMachine.handleStateChanges(PartitionStateMachine.scala:157)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.PartitionStateMachine.triggerOnlineStateChangeForPartitions(PartitionStateMachine.scala:73)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.PartitionStateMachine.triggerOnlinePartitionStateChange(PartitionStateMachine.scala:58)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.PartitionStateMachine.startup(PartitionStateMachine.scala:41)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.KafkaController.onControllerFailover(KafkaController.scala:306)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.KafkaController.elect(KafkaController.scala:1404)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.KafkaController.processStartup(KafkaController.scala:1291)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.KafkaController.process(KafkaController.scala:1924)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.QueuedEvent.process(ControllerEventManager.scala:53)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.ControllerEventManager$ControllerEventThread.process$1(ControllerEventManager.scala:136)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.ControllerEventManager$ControllerEventThread.$anonfun$doWork$1(ControllerEventManager.scala:139)
sentry-self-hosted-kafka-1                                         |    at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)
sentry-self-hosted-kafka-1                                         |    at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:31)
sentry-self-hosted-kafka-1                                         |    at kafka.controller.ControllerEventManager$ControllerEventThread.doWork(ControllerEventManager.scala:139)
sentry-self-hosted-kafka-1                                         |    at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:96)

Currently trying to get it running again.

@aldy505
Copy link
Collaborator

aldy505 commented Jan 25, 2024

For this error:

sentry-self-hosted-metrics-consumer-1                              |     raise InvalidCacheBackendError(
sentry-self-hosted-metrics-consumer-1                              | django.core.cache.backends.base.InvalidCacheBackendError: Could not find backend 'django.core.cache.backends.memcached.MemcachedCache': Module "django.core.cache.backends.memcached" does not define a "MemcachedCache" attribute/class

Try to apply solution defined here: https://github.com/getsentry/self-hosted/pull/2722/files

For this error:

sentry-self-hosted-kafka-1                                         | [2024-01-24 17:37:47,280] ERROR [Controller id=1002 epoch=50] Controller 1002 epoch 50 failed to change state for partition scheduled-subscriptions-metrics-0 from OfflinePartition to OnlinePartition (state.change.logger)
sentry-self-hosted-kafka-1                                         | kafka.common.StateChangeFailedException: Failed to elect leader for partition scheduled-subscriptions-metrics-0 under strategy OfflinePartitionLeaderElectionStrategy(false)

It's usually a problem between Kafka and Zookeeper. My guess is that Zookeeper didn't give out correct state (or maybe it hasn't been started for long enough that it's not stable enough to be used by Kafka). From this, what I'd do:

  1. sudo docker compose down
  2. sudo docker volume rm sentry-kafka && sudo docker volume rm sentry-zookeeper
  3. sudo docker volume create sentry-zookeeper
  4. sudo docker compose up -d zookeeper, wait for 5-10 minutes
  5. sudo ./install.sh (you don't need to do docker compose down as it's taken care of)
  6. sudo docker compose up -d (or sudo docker compose --env-file .env.custom up -d)

Would you please try to use the stable release instead of the nightly images? Pin your version to 24.1.0.

@edgariscoding
Copy link
Contributor Author

I will try those steps shortly. I just figured out what's causing the 502 error, looks like /var/lib/docker/overlay2 is taking up 50GB of disk space and caused the server to run out of storage. Currently reading up on how to clean this up.

@aldy505
Copy link
Collaborator

aldy505 commented Jan 25, 2024

Here are some of the things I'd do (I've got that issue as well).

  1. Stop your sentry instance: sudo docker compose down
  2. Modify your /etc/docker/daemon.json configuration to lessen the persisted logs file: https://develop.sentry.dev/self-hosted/troubleshooting/#docker-logs-disk-usage
  3. Truncate your current logs: sudo bash -c 'truncate -s 0 /var/lib/docker/containers/**/*-json.log'
  4. Prune everything Docker (container, builder, etc): https://develop.sentry.dev/self-hosted/troubleshooting/#docker-image-and-builder-cleanup
  5. By now you'll probably saved 2-3 GB of storage. One more thing you can do is to lessen your SENTRY_RETENTION_DAYS time just for a while. If you don't want to do this, it's fine.
  6. Reboot your server, start Sentry again.

@edgariscoding
Copy link
Contributor Author

Holy crap thank you so much for all your help. Do you have a tip jar? lol

Good news.

I was able to free enough space by using pruning. I followed your suggestions for daemon.json. My retention is lowered to 60. There was nothing to truncate.

After doing this I pulled 24.1.0and reinstalled. I applied the fix for MemcachedCache then started it up. It's now working fine.

Here's some output from the docker logs, it still shows some errors. These are errors that I've seen in the logs since I initially set up this server months ago. Do you think these can be fixed?

sentry-self-hosted-worker-1                                        | 01:25:00 [INFO] sentry.tasks.auto_ongoing_issues: auto_transition_issues_new_to_ongoing started (first_seen_lte=1705541100 first_seen_lte_datetime=datetime.datetime(2024, 1, 18, 1, 25, tzinfo=datetime.timezone.utc))
sentry-self-hosted-ingest-replay-recordings-1                      | 01:15:54 [INFO] arroyo.processing.processor: Partition revocation complete.
sentry-self-hosted-ingest-replay-recordings-1                      | 01:15:54 [INFO] arroyo.processing.processor: Stopped
sentry-self-hosted-ingest-replay-recordings-1                      | Updating certificates in /etc/ssl/certs...
sentry-self-hosted-ingest-replay-recordings-1                      | 0 added, 0 removed; done.
sentry-self-hosted-ingest-replay-recordings-1                      | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-ingest-replay-recordings-1                      | done.
sentry-self-hosted-attachments-consumer-1                          | 01:15:54 [INFO] arroyo.processing.processor: Stopped
sentry-self-hosted-attachments-consumer-1                          | Updating certificates in /etc/ssl/certs...
sentry-self-hosted-attachments-consumer-1                          | 0 added, 0 removed; done.
sentry-self-hosted-attachments-consumer-1                          | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-attachments-consumer-1                          | done.
sentry-self-hosted-attachments-consumer-1                          | %3|1706145455.783|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 70ms in state CONNECT)
sentry-self-hosted-attachments-consumer-1                          | %3|1706145456.708|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-attachments-consumer-1                          | %6|1706145477.667|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Disconnected while requesting ApiVersion: might be caused by incorrect security.protocol configuration (connecting to a SSL listener?) or broker version is < 0.10 (see api.version.request) (after 1895ms in state APIVERSION_QUERY)
sentry-self-hosted-attachments-consumer-1                          | %3|1706145478.808|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 1036ms in state CONNECT)
sentry-self-hosted-snuba-replays-consumer-1                        | %3|1706145445.340|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 16ms in state CONNECT)
sentry-self-hosted-snuba-replays-consumer-1                        | 2024-01-25 01:17:25,340 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 16ms in state CONNECT)
sentry-self-hosted-snuba-replays-consumer-1                        | 2024-01-25 01:17:25,341 Error callback from librdKafka -187, _ALL_BROKERS_DOWN, 1/1 brokers are down
sentry-self-hosted-snuba-replays-consumer-1                        | %3|1706145446.288|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-replays-consumer-1                        | %3|1706145446.317|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-replays-consumer-1                        | 2024-01-25 01:17:26,317 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-ingest-replay-recordings-1                      | %3|1706145443.572|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 10ms in state CONNECT)
sentry-self-hosted-ingest-occurrences-1                            | 0 added, 0 removed; done.
sentry-self-hosted-ingest-occurrences-1                            | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-ingest-occurrences-1                            | done.
sentry-self-hosted-ingest-occurrences-1                            | %3|1706145451.844|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 6ms in state CONNECT)
sentry-self-hosted-ingest-occurrences-1                            | %3|1706145452.838|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-ingest-occurrences-1                            | %3|1706145482.891|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 29 identical error(s) suppressed)
sentry-self-hosted-ingest-replay-recordings-1                      | %3|1706145444.556|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-postgres-1                                      | Replication config already present in pg_hba. Not changing anything.
sentry-self-hosted-postgres-1                                      | 
sentry-self-hosted-postgres-1                                      | PostgreSQL Database directory appears to contain a database; Skipping initialization
sentry-self-hosted-postgres-1                                      | 
sentry-self-hosted-postgres-1                                      | 2024-01-25 01:16:57.953 UTC [1] LOG:  starting PostgreSQL 14.5 (Debian 14.5-2.pgdg110+2) on x86_64-pc-linux-gnu, compiled by gcc (Debian 10.2.1-6) 10.2.1 20210110, 64-bit
sentry-self-hosted-postgres-1                                      | 2024-01-25 01:16:58.044 UTC [1] LOG:  listening on IPv4 address "0.0.0.0", port 5432
sentry-self-hosted-postgres-1                                      | 2024-01-25 01:16:58.044 UTC [1] LOG:  listening on IPv6 address "::", port 5432
sentry-self-hosted-subscription-consumer-events-1                  | 01:15:55 [INFO] arroyo.processing.processor: Partition revocation complete.
sentry-self-hosted-subscription-consumer-events-1                  | 01:15:55 [INFO] arroyo.processing.processor: Stopped
sentry-self-hosted-subscription-consumer-events-1                  | Updating certificates in /etc/ssl/certs...
sentry-self-hosted-subscription-consumer-events-1                  | 0 added, 0 removed; done.
sentry-self-hosted-subscription-consumer-events-1                  | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-subscription-consumer-events-1                  | done.
sentry-self-hosted-snuba-subscription-consumer-transactions-1      | {"module": "builtins", "event": "Checking Clickhouse connections", "severity": "info", "timestamp": "2024-01-25T01:17:37.805437Z"}
sentry-self-hosted-snuba-subscription-consumer-transactions-1      | %3|1706145457.818|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 29ms in state CONNECT)
sentry-self-hosted-snuba-subscription-consumer-transactions-1      | %3|1706145457.865|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-subscription-consumer-transactions-1      | %3|1706145458.782|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-transactions-1      | %3|1706145458.845|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-transactions-1      | %6|1706145477.667|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Disconnected while requesting ApiVersion: might be caused by incorrect security.protocol configuration (connecting to a SSL listener?) or broker version is < 0.10 (see api.version.request) (after 1882ms in state APIVERSION_QUERY)
sentry-self-hosted-ingest-replay-recordings-1                      | %3|1706145474.560|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 30 identical error(s) suppressed)
sentry-self-hosted-ingest-occurrences-1                            | 01:18:08 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='ingest-occurrences'), index=0): 0}
sentry-self-hosted-ingest-replay-recordings-1                      | 01:18:08 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='ingest-replay-recordings'), index=0): 0}
sentry-self-hosted-snuba-consumer-1                                | %3|1706145477.845|FAIL|rdkafka#producer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Failed to resolve 'kafka:9092': Name or service not known (after 57ms in state CONNECT)
sentry-self-hosted-snuba-transactions-consumer-1                   | 2024-01-25 01:17:47,496 Successfully connected to Clickhouse: cluster_name=None
sentry-self-hosted-subscription-consumer-events-1                  | %3|1706145446.909|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 37ms in state CONNECT)
sentry-self-hosted-snuba-subscription-consumer-transactions-1      | %3|1706145477.851|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Failed to resolve 'kafka:9092': Name or service not known (after 2ms in state CONNECT)
sentry-self-hosted-subscription-consumer-events-1                  | %3|1706145447.871|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-subscription-consumer-events-1                  | %3|1706145478.904|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 1028ms in state CONNECT, 30 identical error(s) suppressed)
sentry-self-hosted-subscription-consumer-events-1                  | 01:18:08 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='events-subscription-results'), index=0): 0}
sentry-self-hosted-postgres-1                                      | 2024-01-25 01:16:58.055 UTC [1] LOG:  listening on Unix socket "/var/run/postgresql/.s.PGSQL.5432"
sentry-self-hosted-postgres-1                                      | 2024-01-25 01:16:58.079 UTC [32] LOG:  database system was shut down at 2024-01-25 01:15:54 UTC
sentry-self-hosted-postgres-1                                      | 2024-01-25 01:16:58.122 UTC [1] LOG:  database system is ready to accept connections
sentry-self-hosted-snuba-subscription-consumer-transactions-1      | %3|1706145478.808|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 1023ms in state CONNECT)
sentry-self-hosted-snuba-transactions-consumer-1                   | 2024-01-25 01:17:47,497 librdkafka log level: 6
sentry-self-hosted-snuba-consumer-1                                | %3|1706145477.845|FAIL|rdkafka#consumer-3| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Failed to resolve 'kafka:9092': Name or service not known (after 46ms in state CONNECT)
sentry-self-hosted-snuba-consumer-1                                | 2024-01-25 01:17:57,845 Error callback from librdKafka -193, _RESOLVE, kafka:9092/bootstrap: Failed to resolve 'kafka:9092': Name or service not known (after 46ms in state CONNECT)
sentry-self-hosted-snuba-consumer-1                                | %3|1706145478.808|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 1052ms in state CONNECT)
sentry-self-hosted-snuba-consumer-1                                | %3|1706145479.756|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-consumer-1                                | %3|1706145479.788|FAIL|rdkafka#producer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-consumer-1                                | %3|1706145479.799|FAIL|rdkafka#consumer-3| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-consumer-1                                | 2024-01-25 01:17:59,799 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-consumer-1                                | %3|1706145480.788|FAIL|rdkafka#producer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-consumer-1                                | 2024-01-25 01:18:08,368 New partitions assigned: {Partition(topic=Topic(name='events'), index=0): 2}
sentry-self-hosted-metrics-consumer-1                              | 01:15:55 [INFO] arroyo.processing.processor: Stopped
sentry-self-hosted-metrics-consumer-1                              | Updating certificates in /etc/ssl/certs...
sentry-self-hosted-metrics-consumer-1                              | 0 added, 0 removed; done.
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | %3|1706145461.200|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 9ms in state CONNECT)
sentry-self-hosted-smtp-1                                          |    10 LOG: MAIN
sentry-self-hosted-metrics-consumer-1                              | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-metrics-consumer-1                              | done.
sentry-self-hosted-metrics-consumer-1                              | %3|1706145462.233|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-metrics-consumer-1                              | %3|1706145463.227|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-events-consumer-1                               | Waiting up to 2 seconds
sentry-self-hosted-events-consumer-1                               | Press Ctrl-C to quit
sentry-self-hosted-events-consumer-1                               | 01:15:55 [ERROR] sentry_sdk.errors: Unexpected status code: 500 (body: b'<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">\n<html>\n<head>\n<meta http-equiv="Content-Type" content="text/html;charset=utf-8">\n<title>Service Unavailable</title>\n<style type="text/css">\nbody, p, h1 {\n  font-family: Verdana, Arial, Helvetica, sans-serif;\n}\nh2 {\n  font-family: Arial, Helvetica, sans-serif;\n  color: #b10b29;\n}\n</style>\n</head>\n<body>\n<h2>Service Unavailable</h2>\n<p>The service is temporarily unavailable. Please try again later.</p>\n</body>\n</html>\n')
sentry-self-hosted-events-consumer-1                               | Updating certificates in /etc/ssl/certs...
sentry-self-hosted-events-consumer-1                               | 0 added, 0 removed; done.
sentry-self-hosted-events-consumer-1                               | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-events-consumer-1                               | done.
sentry-self-hosted-events-consumer-1                               | %3|1706145455.476|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 39ms in state CONNECT)
sentry-self-hosted-events-consumer-1                               | %3|1706145456.436|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-events-consumer-1                               | 01:18:08 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='ingest-events'), index=0): 2}
sentry-self-hosted-smtp-1                                          |    10   exim 4.92 daemon started: pid=10, no queue runs, listening for SMTP on port 25 (IPv6 and IPv4)
sentry-self-hosted-smtp-1                                          |    10 LOG: MAIN
sentry-self-hosted-smtp-1                                          |    10   exim 4.92 daemon started: pid=10, no queue runs, listening for SMTP on port 25 (IPv6 and IPv4)
sentry-self-hosted-metrics-consumer-1                              | %3|1706145478.233|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Failed to resolve 'kafka:9092': Name or service not known (after 3ms in state CONNECT)
sentry-self-hosted-metrics-consumer-1                              | %3|1706145480.231|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-attachments-consumer-1                          | 01:18:08 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='ingest-attachments'), index=0): 0}
sentry-self-hosted-snuba-transactions-consumer-1                   | %3|1706145467.514|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-transactions-consumer-1                   | %3|1706145467.538|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-transactions-consumer-1                   | 2024-01-25 01:17:47,544 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-transactions-consumer-1                   | 2024-01-25 01:17:47,544 Error callback from librdKafka -187, _ALL_BROKERS_DOWN, 1/1 brokers are down
sentry-self-hosted-snuba-transactions-consumer-1                   | %3|1706145468.500|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-transactions-consumer-1                   | %3|1706145468.537|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-transactions-consumer-1                   | 2024-01-25 01:17:48,537 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-subscription-consumer-generic-metrics-1         | 01:15:54 [INFO] arroyo.processing.processor: <sentry.consumers.validate_schema.ValidateSchema object at 0x7fc0b02632e0> exited successfully, releasing assignment.
sentry-self-hosted-subscription-consumer-generic-metrics-1         | 01:15:54 [INFO] arroyo.processing.processor: Partition revocation complete.
sentry-self-hosted-subscription-consumer-generic-metrics-1         | 01:15:54 [INFO] arroyo.processing.processor: Stopped
sentry-self-hosted-subscription-consumer-generic-metrics-1         | Updating certificates in /etc/ssl/certs...
sentry-self-hosted-subscription-consumer-generic-metrics-1         | 0 added, 0 removed; done.
sentry-self-hosted-subscription-consumer-generic-metrics-1         | Running hooks in /etc/ca-certificates/update.d...
sentry-self-hosted-subscription-consumer-generic-metrics-1         | done.
sentry-self-hosted-subscription-consumer-generic-metrics-1         | %3|1706145459.057|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-subscription-consumer-generic-metrics-1         | %3|1706145460.060|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-subscription-consumer-generic-metrics-1         | 01:18:08 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='generic-metrics-subscription-results'), index=0): 0}
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | 2024-01-25 01:17:41,201 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 9ms in state CONNECT)
sentry-self-hosted-snuba-replays-consumer-1                        | %3|1706145476.291|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 30 identical error(s) suppressed)
sentry-self-hosted-snuba-subscription-consumer-transactions-1      | %3|1706145479.849|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-subscription-consumer-transactions-1      | 2024-01-25 01:18:08,858 New partitions assigned: {Partition(topic=Topic(name='snuba-transactions-commit-log'), index=0): 0}
sentry-self-hosted-metrics-consumer-1                              | 01:18:11 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='ingest-metrics'), index=0): 1}
sentry-self-hosted-snuba-replays-consumer-1                        | %3|1706145476.318|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 30 identical error(s) suppressed)
sentry-self-hosted-snuba-replays-consumer-1                        | 2024-01-25 01:17:56,318 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 30 identical error(s) suppressed)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | 2024-01-25 01:17:41,201 Error callback from librdKafka -187, _ALL_BROKERS_DOWN, 1/1 brokers are down
sentry-self-hosted-snuba-replays-consumer-1                        | 2024-01-25 01:18:08,384 New partitions assigned: {Partition(topic=Topic(name='ingest-replay-events'), index=0): 0}
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1706145454.212|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1706145454.232|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1706145478.252|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Failed to resolve 'kafka:9092': Name or service not known (after 8ms in state CONNECT)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1706145478.255|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Failed to resolve 'kafka:9092': Name or service not known (after 2ms in state CONNECT)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1706145480.245|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1706145480.253|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1706145484.245|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 4 identical error(s) suppressed)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | %3|1706145484.254|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 4 identical error(s) suppressed)
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 01:18:11 [INFO] arroyo.processing.processor: New partitions assigned: {Partition(topic=Topic(name='generic-events'), index=0): 0}
sentry-self-hosted-post-process-forwarder-issue-platform-1         | 01:18:11 [INFO] sentry.post_process_forwarder.post_process_forwarder: Starting multithreaded post process forwarder
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | %3|1706145462.190|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | 2024-01-25 01:17:42,190 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | %3|1706145478.193|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Failed to resolve 'kafka:9092': Name or service not known (after 3ms in state CONNECT)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | 2024-01-25 01:17:58,193 Error callback from librdKafka -193, _RESOLVE, kafka:9092/bootstrap: Failed to resolve 'kafka:9092': Name or service not known (after 3ms in state CONNECT)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | %3|1706145480.190|FAIL|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | 2024-01-25 01:18:00,190 Error callback from librdKafka -195, _TRANSPORT, kafka:9092/bootstrap: Connect to ipv4#172.21.0.44:9092 failed: Connection refused (after 0ms in state CONNECT)
sentry-self-hosted-snuba-profiling-profiles-consumer-1             | 2024-01-25 01:18:08,389 New partitions assigned: {Partition(topic=Topic(name='processed-profiles'), index=0): 0}
sentry-self-hosted-snuba-transactions-consumer-1                   | 2024-01-25 01:18:08,550 New partitions assigned: {Partition(topic=Topic(name='transactions'), index=0): 0}

@aldy505
Copy link
Collaborator

aldy505 commented Jan 25, 2024

I see so many connection refused to kafka. Is the kafka container healthy though? Try sudo docker compose logs -t kaka.

@edgariscoding
Copy link
Contributor Author

Sure, here's a chunk of sudo docker-compose logs -t kafka

sentry-self-hosted-kafka-1  | 2024-01-25T01:15:56.846257175Z [2024-01-25 01:15:56,842] WARN Session 0x100019c56740001 for server zookeeper:2181, unexpected error, closing socket connection and attempting reconnect (org.apache.zookeeper.ClientCnxn)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:56.846304076Z java.lang.IllegalArgumentException: Unable to canonicalize address zookeeper:2181 because it's not resolvable
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:56.846309176Z    at org.apache.zookeeper.SaslServerPrincipal.getServerPrincipal(SaslServerPrincipal.java:71)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:56.846314176Z    at org.apache.zookeeper.SaslServerPrincipal.getServerPrincipal(SaslServerPrincipal.java:39)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:56.846318276Z    at org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:1087)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:56.846322076Z    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1139)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:57.957269027Z [2024-01-25 01:15:57,946] ERROR Unable to resolve address: zookeeper:2181 (org.apache.zookeeper.client.StaticHostProvider)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:57.957300527Z java.net.UnknownHostException: zookeeper
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:57.957305527Z    at java.net.InetAddress.getAllByName0(InetAddress.java:1281)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:57.957310127Z    at java.net.InetAddress.getAllByName(InetAddress.java:1193)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:57.957314828Z    at java.net.InetAddress.getAllByName(InetAddress.java:1127)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:57.957319228Z    at org.apache.zookeeper.client.StaticHostProvider$1.getAllByName(StaticHostProvider.java:92)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:57.957324028Z    at org.apache.zookeeper.client.StaticHostProvider.resolve(StaticHostProvider.java:147)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:57.957328128Z    at org.apache.zookeeper.client.StaticHostProvider.next(StaticHostProvider.java:375)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:57.957332528Z    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1137)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:58.246125183Z [2024-01-25 01:15:58,242] WARN Session 0x100019c56740001 for server zookeeper:2181, unexpected error, closing socket connection and attempting reconnect (org.apache.zookeeper.ClientCnxn)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:58.246168083Z java.lang.IllegalArgumentException: Unable to canonicalize address zookeeper:2181 because it's not resolvable
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:58.246174883Z    at org.apache.zookeeper.SaslServerPrincipal.getServerPrincipal(SaslServerPrincipal.java:71)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:58.246181083Z    at org.apache.zookeeper.SaslServerPrincipal.getServerPrincipal(SaslServerPrincipal.java:39)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:58.246187183Z    at org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:1087)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:58.246192683Z    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1139)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:59.358481743Z [2024-01-25 01:15:59,343] ERROR Unable to resolve address: zookeeper:2181 (org.apache.zookeeper.client.StaticHostProvider)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:59.358508143Z java.net.UnknownHostException: zookeeper
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:59.358512943Z    at java.net.InetAddress.getAllByName0(InetAddress.java:1281)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:59.358517443Z    at java.net.InetAddress.getAllByName(InetAddress.java:1193)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:59.358521743Z    at java.net.InetAddress.getAllByName(InetAddress.java:1127)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:59.358538343Z    at org.apache.zookeeper.client.StaticHostProvider$1.getAllByName(StaticHostProvider.java:92)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:59.358542643Z    at org.apache.zookeeper.client.StaticHostProvider.resolve(StaticHostProvider.java:147)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:59.358546943Z    at org.apache.zookeeper.client.StaticHostProvider.next(StaticHostProvider.java:375)
sentry-self-hosted-kafka-1  | 2024-01-25T01:15:59.358551043Z    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1137)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:00.323606208Z [2024-01-25 01:16:00,323] WARN Session 0x100019c56740001 for server zookeeper:2181, unexpected error, closing socket connection and attempting reconnect (org.apache.zookeeper.ClientCnxn)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:00.323628808Z java.lang.IllegalArgumentException: Unable to canonicalize address zookeeper:2181 because it's not resolvable
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:00.323633608Z    at org.apache.zookeeper.SaslServerPrincipal.getServerPrincipal(SaslServerPrincipal.java:71)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:00.323638308Z    at org.apache.zookeeper.SaslServerPrincipal.getServerPrincipal(SaslServerPrincipal.java:39)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:00.323642208Z    at org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:1087)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:00.323646108Z    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1139)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:01.424220597Z [2024-01-25 01:16:01,423] ERROR Unable to resolve address: zookeeper:2181 (org.apache.zookeeper.client.StaticHostProvider)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:01.424249797Z java.net.UnknownHostException: zookeeper
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:01.424253497Z    at java.net.InetAddress.getAllByName0(InetAddress.java:1281)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:01.424256697Z    at java.net.InetAddress.getAllByName(InetAddress.java:1193)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:01.424259897Z    at java.net.InetAddress.getAllByName(InetAddress.java:1127)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:01.424262997Z    at org.apache.zookeeper.client.StaticHostProvider$1.getAllByName(StaticHostProvider.java:92)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:01.424265897Z    at org.apache.zookeeper.client.StaticHostProvider.resolve(StaticHostProvider.java:147)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:01.424268697Z    at org.apache.zookeeper.client.StaticHostProvider.next(StaticHostProvider.java:375)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:01.424271497Z    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1137)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:02.100296605Z [2024-01-25 01:16:02,099] WARN Session 0x100019c56740001 for server zookeeper:2181, unexpected error, closing socket connection and attempting reconnect (org.apache.zookeeper.ClientCnxn)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:02.100362506Z java.lang.IllegalArgumentException: Unable to canonicalize address zookeeper:2181 because it's not resolvable
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:02.100375806Z    at org.apache.zookeeper.SaslServerPrincipal.getServerPrincipal(SaslServerPrincipal.java:71)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:02.100386606Z    at org.apache.zookeeper.SaslServerPrincipal.getServerPrincipal(SaslServerPrincipal.java:39)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:02.100396506Z    at org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:1087)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:02.100412906Z    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1139)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.200935794Z [2024-01-25 01:16:03,200] ERROR Unable to resolve address: zookeeper:2181 (org.apache.zookeeper.client.StaticHostProvider)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.200975894Z java.net.UnknownHostException: zookeeper
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.200981394Z    at java.net.InetAddress.getAllByName0(InetAddress.java:1281)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.200985994Z    at java.net.InetAddress.getAllByName(InetAddress.java:1193)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.200989894Z    at java.net.InetAddress.getAllByName(InetAddress.java:1127)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.200993594Z    at org.apache.zookeeper.client.StaticHostProvider$1.getAllByName(StaticHostProvider.java:92)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.200997494Z    at org.apache.zookeeper.client.StaticHostProvider.resolve(StaticHostProvider.java:147)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.201001194Z    at org.apache.zookeeper.client.StaticHostProvider.next(StaticHostProvider.java:375)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.201004894Z    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1137)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.436442125Z [2024-01-25 01:16:03,435] WARN Session 0x100019c56740001 for server zookeeper:2181, unexpected error, closing socket connection and attempting reconnect (org.apache.zookeeper.ClientCnxn)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.436482126Z java.lang.IllegalArgumentException: Unable to canonicalize address zookeeper:2181 because it's not resolvable
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.436488026Z    at org.apache.zookeeper.SaslServerPrincipal.getServerPrincipal(SaslServerPrincipal.java:71)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.436492226Z    at org.apache.zookeeper.SaslServerPrincipal.getServerPrincipal(SaslServerPrincipal.java:39)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.436496226Z    at org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:1087)
sentry-self-hosted-kafka-1  | 2024-01-25T01:16:03.436499926Z    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1139)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:07.741317921Z ===> User
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:07.749379135Z uid=0(root) gid=0(root) groups=0(root)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:07.749399635Z ===> Configuring ...
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:37.694477005Z ===> Running preflight checks ... 
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:37.724311494Z ===> Check if /var/lib/kafka/data is writable ...
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:42.101685829Z ===> Check if Zookeeper is healthy ...
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.560479373Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:zookeeper.version=3.5.8-f439ca583e70862c3068a1f2a7d4d068eec33315, built on 05/04/2020 15:53 GMT
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.560563173Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:host.name=7747c93f7ef6
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.560670174Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.version=1.8.0_222
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.560763675Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.vendor=Azul Systems, Inc.
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.560851175Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.home=/usr/lib/jvm/zulu-8-amd64/jre
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.572365849Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.class.path=/etc/confluent/docker/docker-utils.jar
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.572544550Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.library.path=/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.572564250Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.io.tmpdir=/tmp
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.572675451Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.compiler=<NA>
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.572767452Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.name=Linux
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.572876352Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.arch=amd64
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.573007753Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.version=6.2.0-1018-azure
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.573288955Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:user.name=root
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.580360300Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:user.home=/root
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.580448101Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:user.dir=/
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.580533601Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.memory.free=236MB
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.580625102Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.memory.max=3552MB
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.580752903Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.memory.total=240MB
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.598891919Z [main] INFO org.apache.zookeeper.ZooKeeper - Initiating client connection, connectString=zookeeper:2181 sessionTimeout=40000 watcher=io.confluent.admin.utils.ZookeeperConnectionWatcher@65b3120a
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.654467875Z [main] INFO org.apache.zookeeper.common.X509Util - Setting -D jdk.tls.rejectClientInitiatedRenegotiation=true to disable client-initiated TLS renegotiation
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.686485381Z [main] INFO org.apache.zookeeper.ClientCnxnSocket - jute.maxbuffer value is 4194304 Bytes
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.760848457Z [main] INFO org.apache.zookeeper.ClientCnxn - zookeeper.request.timeout value is 0. feature enabled=
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:47.843690688Z [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server zookeeper/172.21.0.51:2181. Will not attempt to authenticate using SASL (unknown error)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:48.487267414Z [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Socket connection established, initiating session, client: /172.21.0.44:53730, server: zookeeper/172.21.0.51:2181
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:48.557772166Z [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Session establishment complete on server zookeeper/172.21.0.51:2181, sessionid = 0x100000150ed0000, negotiated timeout = 40000
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:48.729115565Z [main] INFO org.apache.zookeeper.ZooKeeper - Session: 0x100000150ed0000 closed
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:48.729476667Z [main-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for session: 0x100000150ed0000
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:49.084078841Z ===> Launching ... 
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:49.160239229Z ===> Launching kafka ... 
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:51.723934565Z [2024-01-25 01:17:51,723] INFO Registered kafka:type=kafka.Log4jController MBean (kafka.utils.Log4jControllerRegistration$)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:53.159636298Z [2024-01-25 01:17:53,159] WARN The package io.confluent.support.metrics.collectors.FullCollector for collecting the full set of support metrics could not be loaded, so we are reverting to anonymous, basic metric collection. If you are a Confluent customer, please refer to the Confluent Platform documentation, section Proactive Support, on how to activate full metrics collection. (io.confluent.support.metrics.KafkaSupportConfig)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:53.159848097Z [2024-01-25 01:17:53,159] WARN The support metrics collection feature ("Metrics") of Proactive Support is disabled. (io.confluent.support.metrics.SupportedServerStartable)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.260233450Z [2024-01-25 01:17:55,259] INFO Starting the log cleaner (kafka.log.LogCleaner)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.317237996Z [2024-01-25 01:17:55,317] INFO [kafka-log-cleaner-thread-0]: Starting (kafka.log.LogCleaner)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.617017514Z [2024-01-25 01:17:55,616] INFO Awaiting socket connections on 0.0.0.0:9092. (kafka.network.Acceptor)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.654726343Z [2024-01-25 01:17:55,654] INFO [SocketServer brokerId=1001] Created data-plane acceptor and processors for endpoint : EndPoint(0.0.0.0,9092,ListenerName(PLAINTEXT),PLAINTEXT) (kafka.network.SocketServer)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.656025251Z [2024-01-25 01:17:55,655] INFO [SocketServer brokerId=1001] Started 1 acceptor threads for data-plane (kafka.network.SocketServer)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.755421454Z [2024-01-25 01:17:55,755] INFO Creating /brokers/ids/1001 (is it secure? false) (kafka.zk.KafkaZkClient)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.785275035Z [2024-01-25 01:17:55,785] ERROR Error while creating ephemeral at /brokers/ids/1001, node already exists and owner '72059365014896641' does not match current session '72057599690604545' (kafka.zk.KafkaZkClient$CheckedEphemeral)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.797253708Z [2024-01-25 01:17:55,793] ERROR [KafkaServer id=1001] Fatal error during KafkaServer startup. Prepare to shutdown (kafka.server.KafkaServer)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.797285008Z org.apache.zookeeper.KeeperException$NodeExistsException: KeeperErrorCode = NodeExists
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.797292408Z    at org.apache.zookeeper.KeeperException.create(KeeperException.java:126)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.797298008Z    at kafka.zk.KafkaZkClient$CheckedEphemeral.getAfterNodeExists(KafkaZkClient.scala:1819)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.797302808Z    at kafka.zk.KafkaZkClient$CheckedEphemeral.create(KafkaZkClient.scala:1757)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.797307708Z    at kafka.zk.KafkaZkClient.checkedEphemeralCreate(KafkaZkClient.scala:1724)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.797312608Z    at kafka.zk.KafkaZkClient.registerBroker(KafkaZkClient.scala:95)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.797317808Z    at kafka.server.KafkaServer.startup(KafkaServer.scala:294)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.797322508Z    at io.confluent.support.metrics.SupportedServerStartable.startup(SupportedServerStartable.java:114)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.797327108Z    at io.confluent.support.metrics.SupportedKafka.main(SupportedKafka.java:66)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.799276020Z [2024-01-25 01:17:55,798] INFO [SocketServer brokerId=1001] Stopping socket server request processors (kafka.network.SocketServer)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:55.804108749Z [2024-01-25 01:17:55,803] INFO [SocketServer brokerId=1001] Stopped socket server request processors (kafka.network.SocketServer)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:56.090810889Z [2024-01-25 01:17:56,090] INFO Shutting down the log cleaner. (kafka.log.LogCleaner)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:56.091169591Z [2024-01-25 01:17:56,091] INFO [kafka-log-cleaner-thread-0]: Shutting down (kafka.log.LogCleaner)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:56.091562993Z [2024-01-25 01:17:56,091] INFO [kafka-log-cleaner-thread-0]: Stopped (kafka.log.LogCleaner)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:56.091576093Z [2024-01-25 01:17:56,091] INFO [kafka-log-cleaner-thread-0]: Shutdown completed (kafka.log.LogCleaner)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:57.310023486Z [2024-01-25 01:17:57,309] INFO [SocketServer brokerId=1001] Shutting down socket server (kafka.network.SocketServer)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:57.337723354Z [2024-01-25 01:17:57,337] INFO [SocketServer brokerId=1001] Shutdown completed (kafka.network.SocketServer)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:58.769516441Z ===> User
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:58.771615554Z uid=0(root) gid=0(root) groups=0(root)
sentry-self-hosted-kafka-1  | 2024-01-25T01:17:58.771796355Z ===> Configuring ...
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:01.349125406Z ===> Running preflight checks ... 
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:01.353749815Z ===> Check if /var/lib/kafka/data is writable ...
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:01.702816449Z ===> Check if Zookeeper is healthy ...
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115499898Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:zookeeper.version=3.5.8-f439ca583e70862c3068a1f2a7d4d068eec33315, built on 05/04/2020 15:53 GMT
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115541198Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:host.name=7747c93f7ef6
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115548398Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.version=1.8.0_222
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115552998Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.vendor=Azul Systems, Inc.
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115557899Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.home=/usr/lib/jvm/zulu-8-amd64/jre
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115562399Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.class.path=/etc/confluent/docker/docker-utils.jar
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115568499Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.library.path=/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115573699Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.io.tmpdir=/tmp
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115578899Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:java.compiler=<NA>
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115584399Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.name=Linux
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115592099Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.arch=amd64
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115597299Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.version=6.2.0-1018-azure
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115602199Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:user.name=root
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115623799Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:user.home=/root
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115629799Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:user.dir=/
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115671899Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.memory.free=236MB
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115693799Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.memory.max=3552MB
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.115761399Z [main] INFO org.apache.zookeeper.ZooKeeper - Client environment:os.memory.total=240MB
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.118022003Z [main] INFO org.apache.zookeeper.ZooKeeper - Initiating client connection, connectString=zookeeper:2181 sessionTimeout=40000 watcher=io.confluent.admin.utils.ZookeeperConnectionWatcher@65b3120a
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.121859110Z [main] INFO org.apache.zookeeper.common.X509Util - Setting -D jdk.tls.rejectClientInitiatedRenegotiation=true to disable client-initiated TLS renegotiation
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.127148320Z [main] INFO org.apache.zookeeper.ClientCnxnSocket - jute.maxbuffer value is 4194304 Bytes
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.133642131Z [main] INFO org.apache.zookeeper.ClientCnxn - zookeeper.request.timeout value is 0. feature enabled=
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.141528346Z [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server zookeeper/172.21.0.51:2181. Will not attempt to authenticate using SASL (unknown error)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.187533029Z [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Socket connection established, initiating session, client: /172.21.0.44:42724, server: zookeeper/172.21.0.51:2181
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.194712842Z [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Session establishment complete on server zookeeper/172.21.0.51:2181, sessionid = 0x100000150ed0002, negotiated timeout = 40000
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.303345340Z [main] INFO org.apache.zookeeper.ZooKeeper - Session: 0x100000150ed0002 closed
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.303414140Z [main-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for session: 0x100000150ed0002
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.348018421Z ===> Launching ... 
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.353851731Z ===> Launching kafka ... 
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:02.768738885Z [2024-01-25 01:18:02,767] INFO Registered kafka:type=kafka.Log4jController MBean (kafka.utils.Log4jControllerRegistration$)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:03.200690270Z [2024-01-25 01:18:03,200] WARN The package io.confluent.support.metrics.collectors.FullCollector for collecting the full set of support metrics could not be loaded, so we are reverting to anonymous, basic metric collection. If you are a Confluent customer, please refer to the Confluent Platform documentation, section Proactive Support, on how to activate full metrics collection. (io.confluent.support.metrics.KafkaSupportConfig)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:03.200821470Z [2024-01-25 01:18:03,200] WARN The support metrics collection feature ("Metrics") of Proactive Support is disabled. (io.confluent.support.metrics.SupportedServerStartable)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:03.997508617Z [2024-01-25 01:18:03,997] INFO Starting the log cleaner (kafka.log.LogCleaner)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:04.049036311Z [2024-01-25 01:18:04,048] INFO [kafka-log-cleaner-thread-0]: Starting (kafka.log.LogCleaner)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:04.287910145Z [2024-01-25 01:18:04,287] INFO Awaiting socket connections on 0.0.0.0:9092. (kafka.network.Acceptor)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:04.313341191Z [2024-01-25 01:18:04,313] INFO [SocketServer brokerId=1001] Created data-plane acceptor and processors for endpoint : EndPoint(0.0.0.0,9092,ListenerName(PLAINTEXT),PLAINTEXT) (kafka.network.SocketServer)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:04.314363793Z [2024-01-25 01:18:04,314] INFO [SocketServer brokerId=1001] Started 1 acceptor threads for data-plane (kafka.network.SocketServer)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:04.380304713Z [2024-01-25 01:18:04,380] INFO Creating /brokers/ids/1001 (is it secure? false) (kafka.zk.KafkaZkClient)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:04.395751441Z [2024-01-25 01:18:04,395] INFO Stat of the created znode at /brokers/ids/1001 is: 535,535,1706145484387,1706145484387,1,0,0,72057599690604547,180,0,535
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:04.395783841Z  (kafka.zk.KafkaZkClient)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:04.396154541Z [2024-01-25 01:18:04,396] INFO Registered broker 1001 at path /brokers/ids/1001 with addresses: ArrayBuffer(EndPoint(kafka,9092,ListenerName(PLAINTEXT),PLAINTEXT)), czxid (broker epoch): 535 (kafka.zk.KafkaZkClient)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:04.592745499Z [2024-01-25 01:18:04,592] INFO [/config/changes-event-process-thread]: Starting (kafka.common.ZkNodeChangeNotificationListener$ChangeEventProcessThread)
sentry-self-hosted-kafka-1  | 2024-01-25T01:18:04.720332130Z [2024-01-25 01:18:04,720] INFO [SocketServer brokerId=1001] Started data-plane processors for 1 acceptors (kafka.network.SocketServer)

@aldy505
Copy link
Collaborator

aldy505 commented Jan 27, 2024

It seems like the Kafka failed to connect to Zookeeper. You should check if your Zookeeper is healthy, or does it have corrupted data.

@edgariscoding
Copy link
Contributor Author

edgariscoding commented Jan 27, 2024 via email

@aldy505
Copy link
Collaborator

aldy505 commented Jan 30, 2024

sudo docker compose logs zookeeper --since 10m (last 10 minutes).

If you can't find the root cause of the issue, you can actually use Redpanda instead of Kafka + Zookeeper, it saves so much RAM usage. I made a simple guide on Sentry's Discord here: https://discord.com/channels/621778831602221064/796028405833007104/1201076383426809948 -- but please beware that this is not officially supported by the employees of Sentry. If there are any future updates, you'll have to do git reset --hard HEAD and re-apply every changes again.

For those complaining about the usage of Kafka and Zookeeper, I finally got time to play around and migrate everything to Redpanda. The configuration changes can be seen here: master...aldy505:sentry-self-hosted:feat/redpanda

No migrations between Kafka and Redpanda because the Kafka's retention is just 1 day, so by bother doing that.

With 2GB memory limits, Redpanda only uses 500 MB of RAM. See screenshot below.

Steps to do if you want to use Redpanda:

  1. Apply the changes as seen on the GitHub diff (modifying docker-compose.yml, install/create-docker-volumes.sh, and install/create-kafka-topics.sh)
  2. Run sudo ./install.sh and followed by sudo docker compose up -d (or sudo docker compose --env-file .env.custom up -d)

@getsantry
Copy link

getsantry bot commented Feb 21, 2024

This issue has gone three weeks without activity. In another week, I will close it.

But! If you comment or otherwise update it, I will reset the clock, and if you remove the label Waiting for: Community, I will leave it alone ... forever!


"A weed is but an unloved flower." ― Ella Wheeler Wilcox 🥀

@getsantry getsantry bot added the Stale label Feb 21, 2024
@getsantry getsantry bot closed this as not planned Won't fix, can't repro, duplicate, stale Feb 29, 2024
@edgariscoding
Copy link
Contributor Author

@aldy505

Question for you. You mentioned that it may be beneficial to set a rate limiter middleware on the reverse proxy.

I noticed that Sentry > Admin > Settings page has a Rate Limit - The maximum number of events the system should accept per minute. A value of 0 will disable the default rate limit. option.

Could this be used in place of a rate limiter middleware?

@github-actions github-actions bot locked and limited conversation to collaborators Mar 24, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
Archived in project
Archived in project
Development

No branches or pull requests

2 participants