Skip to content

Conversation

@micheleRP
Copy link
Contributor

@micheleRP micheleRP commented Jun 10, 2025

Description

This pull request introduces a new section to the documentation about cluster states, providing descriptions of various states for different cluster types (Serverless, BYOC, and Dedicated).

Documentation updates:

  • Added a new "Cluster State" page: This page explains the various states a cluster can be in, categorized by cluster types (Serverless, BYOC, Dedicated). It includes detailed descriptions of each state, such as "Creating," "Ready," "Suspended," and more.

  • Updated navigation to include the "Cluster State" page: Added a new entry under the "Cluster Maintenance" section in the navigation (modules/ROOT/nav.adoc).

Resolves https://redpandadata.atlassian.net/browse/DOC-241
Review deadline:

Page previews

Cluster State

Checks

  • New feature
  • Content gap
  • Support Follow-up
  • Small fix (typos, links, copyedits, etc)

@micheleRP micheleRP requested a review from a team as a code owner June 10, 2025 21:58
@netlify
Copy link

netlify bot commented Jun 10, 2025

Deploy Preview for rp-cloud ready!

Name Link
🔨 Latest commit 9f8b9d6
🔍 Latest deploy log https://app.netlify.com/projects/rp-cloud/deploys/687fff4567601300080e016c
😎 Deploy Preview https://deploy-preview-323--rp-cloud.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@coderabbitai
Copy link
Contributor

coderabbitai bot commented Jun 10, 2025

Important

Review skipped

Auto incremental reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

📝 Walkthrough

Walkthrough

A new documentation page, "Cluster State," has been added under the "Cluster Maintenance" section in the navigation structure. This page details the various possible states of a cluster managed by Redpanda Cloud, categorizing them by deployment type: Serverless, BYOC (Bring Your Own Cloud), and Dedicated. Each deployment type has a tabbed table listing all possible states and their descriptions. No changes were made to code, APIs, or existing documentation outside of these additions.

Sequence Diagram(s)

Assessment against linked issues

Objective Addressed Explanation
Document all possible cluster states for BYOC/Dedicated and Serverless, including definitions and descriptions (DOC-241)
Add a new documentation page before "Monitor Redpanda Cloud" for cluster states (DOC-241)
Include state names such as Unknown, Creating, Creating agent, Upgrading, Deleting, Deleted, Failed, Resizing, Unspecified, Placing, Ready, Unrecognized, Suspended, etc. (DOC-241)

Assessment against linked issues: Out-of-scope changes

No out-of-scope changes were found.


🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Explain this complex logic.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai explain this code block.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and explain its main purpose.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR.
  • @coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 3

🧹 Nitpick comments (1)
modules/manage/pages/cluster-maintenance/cluster-state.adoc (1)

21-21: Inconsistent state naming across tables
Serverless uses Unrecognized and Unspecified, while BYOC and Dedicated use Unknown. Consider unifying state terms or explaining why different terminology is used.

Also applies to: 41-41, 60-60

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4059278 and 8218a69.

📒 Files selected for processing (2)
  • modules/ROOT/nav.adoc (1 hunks)
  • modules/manage/pages/cluster-maintenance/cluster-state.adoc (1 hunks)
⏰ Context from checks skipped due to timeout of 90000ms (3)
  • GitHub Check: Redirect rules - rp-cloud
  • GitHub Check: Header rules - rp-cloud
  • GitHub Check: Pages changed - rp-cloud
🔇 Additional comments (1)
modules/ROOT/nav.adoc (1)

385-385: Navigation entry added under Cluster Maintenance looks correct
The new cluster-state.adoc link follows the existing pattern and indentation.

@micheleRP micheleRP requested a review from birdayz June 12, 2025 19:23
@micheleRP
Copy link
Contributor Author

@birdayz Would you please review and confirm that we should document these cluster states? Do users need to know how transitions between cluster states happen?

@micheleRP micheleRP requested a review from deniscoady June 18, 2025 18:43
|Failed |Cluster was unable to enter the ready state from either the creating or placing states.
|Placing |Cluster is in the process of being placed on a cell with sufficient resources in the data plane.
|Ready |Cluster is running and accepting external requests.
|Suspended |Cluster is running but blocks all external requests.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we mention suspension is mostly related to being out of credits keep me true on the suspension reason @nicolaferraro.

Copy link
Member

@nicolaferraro nicolaferraro Jul 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

True, automated suspension (close to 100% of the cases) is related to credit exhaustion for trial accounts. Adding a credit card unlocks clusters that transition to the Ready state again.

[cols="1,4",options="header"]
|===
|State |Description
|Creating |Cluster is in the process of having its control plane state created.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am not a native speaker but this reads a little off imo, could we simply say The Redpanda cluster has started the creation process

@micheleRP

This comment was marked as resolved.

Co-authored-by: Santiago Jimenez Giraldo <santiago@redpanda.com>
@sago2k8
Copy link
Contributor

sago2k8 commented Jul 16, 2025

Is Node operation in progress state internal only? Does it equal Upgrading?

Correct it is Upgrading, the public states are:

    STATE_CREATING_AGENT = 1;
    STATE_CREATING = 2;
    STATE_READY = 3;
    STATE_DELETING = 4;
    STATE_DELETING_AGENT = 5;
    STATE_UPGRADING = 6;
    STATE_FAILED = 7;
    STATE_SUSPENDED = 8;

Copy link
Contributor

@sago2k8 sago2k8 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some comments on the states, sorry for taking long to review.

|Deleting agent |Cluster is in the process of having its control plane state removed, and the Redpanda Cloud agent is being removed.
|Upgrading |Cluster is undergoing a rolling upgrade or a scale-up/scale-down operation.
|Failed |Cluster was unable to enter the Ready state from either the Creating or the Creating agent states.
|Suspended |Cluster is running but blocks all external requests. +
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not aware of this state in BYOC and Dedicated. @alenkacz, @birdayz, do you know?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes it's possible to be in suspended state.

Copy link
Member

@c4milo c4milo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good to me as initial documentation about cluster states but I'm honestly not super happy of how we designed this state machine, and customers will also notice.

@micheleRP
Copy link
Contributor Author

@sago2k8, or any reviewer: What action can a user take to get out of a failed state?

@sago2k8
Copy link
Contributor

sago2k8 commented Jul 21, 2025

@sago2k8, or any reviewer: What action can a user take to get out of a failed state?

Today, re-creating the cluster, there will be some details about what went wrong.

|Placing |Cluster is in the process of being placed on a cell with sufficient resources in the data plane.
|Ready |Cluster is running and accepting external requests.
|Deleting |Cluster is in the process of having its control plane state removed. Resources dedicated to the cluster in the data plane will be released.
|Failed |Cluster was unable to enter the ready state from either the Creating or Placing states. +
Copy link
Contributor

@paulohtb6 paulohtb6 Jul 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

where can I check what went wrong? If we know, we could provide this info here

|State |Description
|Creating |Cluster is in the process of having its control plane state created.
|Placing |Cluster is in the process of being placed on a cell with sufficient resources in the data plane.
|Ready |Cluster is running and accepting external requests.
Copy link
Contributor

@paulohtb6 paulohtb6 Jul 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What do we mean exactly by external requests? Requests outside of the network?

Maybe a better question is, what is an internal request?

|Deleting |Cluster is in the process of having its control plane state removed. Resources dedicated to the cluster in the data plane are released.
|Deleting agent |Cluster is in the process of having its control plane state removed, and the Redpanda Cloud agent is being removed.
|Upgrading |Cluster is undergoing a rolling upgrade or a scale-up/scale-down operation.
|Failed |Cluster was unable to enter the Ready state from either the Creating or the Creating agent states. +
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
|Failed |Cluster was unable to enter the Ready state from either the Creating or the Creating agent states. +
|Failed |Cluster was unable to enter the *Ready* state from either the *Creating* or the *Creating agent states*. +

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

and again, if we know where the user can look, we should provide directions

Co-authored-by: Paulo Borges <paulohtb6@gmail.com>
@micheleRP micheleRP merged commit 988868e into main Jul 23, 2025
5 checks passed
@micheleRP micheleRP deleted the DOC-241-doc-all-possible-cluster-states branch July 23, 2025 02:11
@coderabbitai coderabbitai bot mentioned this pull request Jul 24, 2025
4 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants