Skip to content
This repository was archived by the owner on Dec 14, 2022. It is now read-only.
This repository was archived by the owner on Dec 14, 2022. It is now read-only.

Usage of Elastic causes Cassandra mismatches (read repairs) #180

@nvidgren

Description

@nvidgren

Axway APIM-ELK version
4.1.0

API-Manager Version
7.7 august update

Question
Starting the Elastic cluster and Filebeats is causing the API GW Cassandra database to experience considerable amount of mismatches and read repairs (readrepairrepairedblocking). Is this phenomenon known and what could cause it?

What I've tried so far
The first idea was to reduce the calls API Builder does towards API GW by enabling the lookup cache with TTL 30 minutes. Then APM was enabled to see if something comes up in its monitoring.
This did not work and so tracing was enabled in Cassandra but even trace logs are not revealing any information that could be of use to analyze the root cause

Additional information
We are running a standard ELK cluster on 4 nodes. API Builder/Logstash is currently running only on one. Filebeats are installed on 6 API GW nodes but the issue occurs already when only one is started. If no Filebeats are running the mismatches are not occurring.
The ELK configuration is standard apart from the settings described above. Filebeats are stand-alone installations but the Axway configuration file is being used.

The issue only occurs in production where the amount of traffic is considerably more than in our integration environment. In our INT we cannot reproduce the issue.

Metadata

Metadata

Assignees

Labels

questionFurther information is requested

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions