Skip to content

HDDS-15323. Add storage capacity planning administrator guide.#436

Open
jojochuang wants to merge 2 commits into
apache:masterfrom
jojochuang:HDDS-15323
Open

HDDS-15323. Add storage capacity planning administrator guide.#436
jojochuang wants to merge 2 commits into
apache:masterfrom
jojochuang:HDDS-15323

Conversation

@jojochuang
Copy link
Copy Markdown
Contributor

What changes were proposed in this pull request?

HDDS-15323. Add storage capacity planning administrator guide.

Please describe your PR in detail:

  • Document datanode reserved space, full-disk thresholds, and inspection workflows. Link from the Datanodes architecture page.

What is the link to the Apache Jira?

https://issues.apache.org/jira/browse/HDDS-15323

How was this patch tested?

Screenshot 2026-05-19 at 12 27 44 PM

Document datanode reserved space, full-disk thresholds, and inspection
workflows. Link from the Datanodes architecture page.

Co-authored-by: Cursor <cursoragent@cursor.com>

- **Purpose**: Protects the Datanode from OS-level disk-full errors.
- **Configuration**:
- `hdds.datanode.volume.min.free.space`: A fixed value (e.g., `20GB`).
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jojochuang Should we also mention about volume min free space hard limit?

- Soft limit (min.free.space.percent, default 2%): used for what is reported to SCM in heartbeats as freeSpaceToSpare. This governs SCM placement decisions.
- Hard limit (min.free.space.hard.limit.percent, default 1.5%): used for local write rejection on the Datanode.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah yes. I missed these recent additions.

To see the status of individual disks on a specific Datanode:

```bash
ozone admin datanode usageinfo --uuid <datanode-uuid>
Copy link
Copy Markdown
Contributor

@Gargi-jais11 Gargi-jais11 May 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

uuid is deprecated.

Suggested change
ozone admin datanode usageinfo --uuid <datanode-uuid>
ozone admin datanode usageinfo --node-id <datanode-uuid>


### 3. Detailed Volume Info

To see the status of individual disks on a specific Datanode:
Copy link
Copy Markdown
Contributor

@Gargi-jais11 Gargi-jais11 May 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a bit misleading as datanode usageInfo it does not give individual disks info on specific Datanode.
Rather it should be,

Suggested change
To see the status of individual disks on a specific Datanode:
To see the aggregated capacity, usage, and remaining space for a specific Datanode:

| `hdds.datanode.volume.min.free.space.percent` | `0.02` (2%) | Percentage-based minimum free space. |
| `hdds.datanode.dir.du.reserved` | (unset) | Fixed bytes reserved for non-Ozone use. |
| `ozone.scm.container.size` | `5GB` | The target size for containers. |
| `hdds.datanode.storage.utilization.critical.threshold` | `0.95` (95%) | Threshold at which SCM logs a "Critical" space warning. |
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We also mention about warning utilisation threshold :

  <property>
    <name>hdds.datanode.storage.utilization.warning.threshold</name>
    <value>0.75</value>
    <tag>OZONE, SCM, MANAGEMENT</tag>
    <description>
      If a datanode overall storage utilization exceeds more than this
      value, a warning will be logged while processing the nodeReport in SCM.
    </description>
  </property>

Document soft/hard min free space limits, warning utilization threshold,
--node-id CLI flag, and clarify per-datanode usageinfo scope.

Co-authored-by: Cursor <cursoragent@cursor.com>
@jojochuang jojochuang requested a review from Gargi-jais11 May 21, 2026 20:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants