Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

KAFKA-15330: Add missing documentation of metrics introduced as part of KAFKA-15028 #14480

Merged
merged 2 commits into from Oct 4, 2023

Conversation

jolshan
Copy link
Contributor

@jolshan jolshan commented Oct 3, 2023

I've added details for VerificationFailureRate and VerificationTimeMs.

I considered adding the documentation for the AddPartitionsToTxnVerification metrics, but I noticed that all the request metrics simply listed Produce|FetchConsumer|FetchFollower. If we don't already report the AddPartitionsToTxn request metrics in this file, it doesn't make sense to add the verification variant. (As well as all the other APIs we report)

We can file a followup jira if we want to redo that whole section.

Once we have alignment here, I will cherrypick to 3.6 branch as well as the kafka-site repo.

Committer Checklist (excluded from commit message)

  • Verify design and implementation
  • Verify test coverage and CI build status
  • Verify documentation (including upgrade notes)

@divijvaidya
Copy link
Contributor

If we don't already report the AddPartitionsToTxn request metrics in this file, it doesn't make sense to add the verification variant. (As well as all the other APIs we report)
We can file a followup jira if we want to redo that whole section.

I do think that we need to have an exhaustive list of all metrics available in documentation. Otherwise user may not know existence of certain metrics unless they look into the code. We can choose to do it in a separate JIRA.

docs/ops.html Outdated
<tr>
<td>Rate of transactional verification errors</td>
<td>kafka.server:type=AddPartitionsToTxnManager,name=VerificationFailureRate</td>
<td>in steady state 0, but transient errors are expected during rolls and reassignments of the transactional state partition</td>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We didn't specify what the metric actually measures. Your PR description when you added it has a nice description asL
Rate of verifications that returned in failure either from the AddPartitionsToTxn API response or through errors in the AddPartitionsToTxnManager. In steady state this metric will report 0, but transient errors are expected during rolls and reassignments of the transactional state partition

@jolshan
Copy link
Contributor Author

jolshan commented Oct 4, 2023

Copy link
Contributor

@divijvaidya divijvaidya left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good! Thank you making the change Justine.

Feel free to merge after CI passes.

@jolshan
Copy link
Contributor Author

jolshan commented Oct 4, 2023

I will open the kafka-site PR shortly :)

@jolshan
Copy link
Contributor Author

jolshan commented Oct 4, 2023

Tests are unrelated 😓 It is a bit worrying we see this large number of failures in Java 11 from time to time.
I may follow up later.

@jolshan jolshan merged commit b5d880a into apache:trunk Oct 4, 2023
1 check failed
jolshan added a commit that referenced this pull request Oct 4, 2023
KAFKA-15028 (#14480)

I've added details for VerificationFailureRate and VerificationTimeMs.

I considered adding the documentation for the AddPartitionsToTxnVerification metrics, but I noticed that all the request metrics simply listed Produce|FetchConsumer|FetchFollower. If we don't already report the AddPartitionsToTxn request metrics in this file, it doesn't make sense to add the verification variant. (As well as all the other APIs we report)

Filed a followup jira if we want to redo that whole section.

Reviewers: Reviewers: Divij Vaidya <diviv@amazon.com>
k-wall pushed a commit to k-wall/kafka that referenced this pull request Nov 21, 2023
…of KAFKA-15028 (apache#14480)

I've added details for VerificationFailureRate and VerificationTimeMs.

I considered adding the documentation for the AddPartitionsToTxnVerification metrics, but I noticed that all the request metrics simply listed Produce|FetchConsumer|FetchFollower. If we don't already report the AddPartitionsToTxn request metrics in this file, it doesn't make sense to add the verification variant. (As well as all the other APIs we report)

Filed a followup jira if we want to redo that whole section.

Reviewers: Reviewers: Divij Vaidya <diviv@amazon.com>
mjsax pushed a commit to confluentinc/kafka that referenced this pull request Nov 22, 2023
…of KAFKA-15028 (apache#14480)

I've added details for VerificationFailureRate and VerificationTimeMs.

I considered adding the documentation for the AddPartitionsToTxnVerification metrics, but I noticed that all the request metrics simply listed Produce|FetchConsumer|FetchFollower. If we don't already report the AddPartitionsToTxn request metrics in this file, it doesn't make sense to add the verification variant. (As well as all the other APIs we report)

Filed a followup jira if we want to redo that whole section.

Reviewers: Reviewers: Divij Vaidya <diviv@amazon.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants