HDDS-14501. [Website v2] [Docs] [Administrator Guide] Replacing Datanode Disks #285

Gargi-jais11 · 2026-01-25T14:40:43Z

What changes were proposed in this pull request?

https://ozone-site-v2.staged.apache.org/docs/administrator-guide/operations/disk-replacement/datanodes

A datanode may have multiple data volumes, specified in hdds.datanode.dir. For example,

/data1,/data2,/data3
hdds.datanode.failed.data.volumes.tolerated: The number of data volumes that are allowed to fail before a datanode stops offering service. By default, this value is -1, meaning unlimited.

Similarly, hdds.datanode.failed.metadata.volumes.tolerated allows a number of metadata volumes to fail.

During datanode startup, it performs check to determine if a volume fails. If the datanode is allowed to continue without abort, the volume is taken off. After datanode starts, a periodic disk check is run every 60 minutes (determined by configuration property hdds.datanode.periodic.disk.check.interval.minutes.

When a volume is determined failed, it is chosen by volume choosing policy to allocate new containers.

To replace the failed disks, shut down the datanode, update hdds.datanode.dir to remove it from the directory list, and then restart the datanode.

note: Ozone datanode does not support hotswap yet, meaning to update the disk list, it must restart the datanode process.

The state of volumes can be seen in Datanode metrics and web UI.

Also did some more add ons.

What is the link to the Apache Jira?

https://issues.apache.org/jira/browse/HDDS-14501

How was this patch tested?

Check off which of the following tests were done on this change. If additional testing was done, please elaborate here as well.

The CI checks on my fork are passing
I verified the rendered content using a local preview
I manually verified the steps provided in this change work as described

jojochuang

+1 looks correct to me. I think this is good enough in any case. We can revisit later if there are minor issues.

replacing datanode disk page writeup

9582eef

jojochuang approved these changes Jan 26, 2026

View reviewed changes

jojochuang merged commit 7bc7394 into apache:HDDS-9225-website-v2 Jan 26, 2026
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HDDS-14501. [Website v2] [Docs] [Administrator Guide] Replacing Datanode Disks #285

HDDS-14501. [Website v2] [Docs] [Administrator Guide] Replacing Datanode Disks #285

Uh oh!

Gargi-jais11 commented Jan 25, 2026 •

edited

Loading

Uh oh!

jojochuang left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

HDDS-14501. [Website v2] [Docs] [Administrator Guide] Replacing Datanode Disks #285

HDDS-14501. [Website v2] [Docs] [Administrator Guide] Replacing Datanode Disks #285

Uh oh!

Conversation

Gargi-jais11 commented Jan 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

What is the link to the Apache Jira?

How was this patch tested?

Uh oh!

jojochuang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Gargi-jais11 commented Jan 25, 2026 •

edited

Loading