As a user I'd like to replace failed drives in a RAID with new ones #737

therealprof · 2015-07-14T21:58:41Z

For some reason unknown to me there does not seem to be an obvious way to replace a failed drive which is part of a RAID pool by a new one and resync the pool.

Steps to reproduce:

Set up a RAID-1 pool
Shut down the system
Pull a drive
Reboot the system
Have a look around and discover that there's no evidence of a degraded RAID or any possibility to remove the failed drive or add a new one in its place

iFloris · 2015-07-28T08:21:31Z

Ran into a situation comparable to what @therealprof describes here just now.
My steps are the following:

Raid 6 pool, running for a few weeks.
1 drive failed.
Shares no longer mountable.
Bunch of errors in the web-ui such as: Unknown internal error doing a GET to /api/pools?page=1&format=json&page_size=15&count= and Another pool(rockstor_rockstor) has a Share with this same name(home) as this pool(everyraid). This configuration is not supported. You can delete one of them manually with this command: btrfs subvol delete /mnt2/[pool name]/home
Delete dropped drive from disk view.
Searched for but did not find: a way to repair or resize raid pool.
Try to delete pool and start from scratch :(
Systems complains I need to delete shares first
Try to delete shares - system complains it is in read only mode
Try suggested terminal commands to remove shares - errors

End result as in step 5 as described by @therealprof : File system gets stuck in read only mode, shares inaccessible, with no (visible?) way to repair

Next step: Reinstall Rockstor from usb and start over.

schakrava · 2015-07-29T05:53:47Z

Thanks @iFloris and @therealprof for your detailed comments. I am holding off until 4.2 kernel to test DR scenarios including this one. The prediction is sometime mid august. Once the behavior is consistent with the kernel, we can add support in Rockstor. @gkadillak is working on a useful alert framework, so it's all coming together. Thanks for your patience.

iFloris · 2015-07-29T19:07:37Z

@schakrava Thanks, sounds great! Other than a slight inconvenience, it is not a problem for me as I currently only use Rockstor to store backups from other machines on and do some testing. (Off topic: Your last two sentences sounded like 80's A-Team Hannibal's voice in my head)

therealprof · 2015-07-29T19:41:19Z

@iFloris For me rockstor still feels quite a immature for production use so similarly to you I'm making sure that I do have plenty of fresh backups around (remote and local on external drives) so I can fully restore any valuable data quickly. This also helped me to work around the broken RAID issue by simply manually removing everything from the database and starting fresh from a backup.

therealprof closed this as completed Jul 29, 2015

therealprof reopened this Jul 29, 2015

iFloris mentioned this issue Oct 8, 2015

Failed to delete the Share(Images). Error from the OS: 'NoneType' object has no attribute 'name' #930

Closed

phillxnet mentioned this issue Apr 26, 2017

Implement a delete missing disk in pool UI #1700

Closed

phillxnet mentioned this issue Jun 13, 2017

support ro rw degraded and skip_balance mount options #1728

Closed

This was referenced Sep 6, 2017

improve share mount re ro,degraded pool options #1804

Closed

improve mount status reporting capability #1763

Closed

This was referenced Sep 23, 2018

Show warn on dashboard if errors occurs on IO or fs #1532

Closed

surface pool missing disk info in UI #1897

Closed

phillxnet mentioned this issue Oct 2, 2018

Implement a delete missing disk in pool UI #1700 #1971

Merged

schakrava closed this as completed in #1971 Oct 5, 2018

schakrava assigned phillxnet Oct 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

As a user I'd like to replace failed drives in a RAID with new ones #737

As a user I'd like to replace failed drives in a RAID with new ones #737

therealprof commented Jul 14, 2015

iFloris commented Jul 28, 2015

schakrava commented Jul 29, 2015

iFloris commented Jul 29, 2015

therealprof commented Jul 29, 2015

As a user I'd like to replace failed drives in a RAID with new ones #737

As a user I'd like to replace failed drives in a RAID with new ones #737

Comments

therealprof commented Jul 14, 2015

iFloris commented Jul 28, 2015

schakrava commented Jul 29, 2015

iFloris commented Jul 29, 2015

therealprof commented Jul 29, 2015