Skip to content

Latest commit

 

History

History
53 lines (40 loc) · 4.25 KB

how-site-recovery-is-performed-by-technical-support.adoc

File metadata and controls

53 lines (40 loc) · 4.25 KB
permalink sidebar keywords summary
maintain/how-site-recovery-is-performed-by-technical-support.html
sidebar
storagegrid, site recovery, technical support, tech support
If an entire StorageGRID site fails or if multiple Storage Nodes fail, you must contact technical support. Technical support will assess your situation, develop a recovery plan, and then recover the failed nodes or site in a way that meets your business objectives, optimizes recovery time, and prevents unnecessary data loss.

How technical support recovers a site

If an entire StorageGRID site fails or if multiple Storage Nodes fail, you must contact technical support. Technical support will assess your situation, develop a recovery plan, and then recover the failed nodes or site in a way that meets your business objectives, optimizes recovery time, and prevents unnecessary data loss.

Caution
Site recovery can only be performed by technical support.

StorageGRID systems are resilient to a wide variety of failures, and you can successfully perform many recovery and maintenance procedures yourself. However, it is difficult to create a simple, generalized site recovery procedure because the detailed steps depend on factors that are specific to your situation. For example:

  • Your business objectives: After the complete loss of a StorageGRID site, you should evaluate how best to meet your business objectives. For example, do you want to rebuild the lost site in-place? Do you want to replace the lost StorageGRID site in a new location? Every customer’s situation is different, and your recovery plan must be designed to address your priorities.

  • Exact nature of the failure: Before beginning a site recovery, establish if any nodes at the failed site are intact or if any Storage Nodes contain recoverable objects. If you rebuild nodes or storage volumes that contain valid data, unnecessary data loss could occur.

  • Active ILM policies: The number, type, and location of object copies in your grid is controlled by your active ILM policies. The specifics of your ILM policies can affect the amount of recoverable data, as well as the specific techniques required for recovery.

    Caution
    If a site contains the only copy of an object and the site is lost, the object is lost.
  • Bucket (or container) consistency: The consistency applied to a bucket (or container) affects whether StorageGRID fully replicates object metadata to all nodes and sites before telling a client that object ingest was successful. If the consistency value allows for eventual consistency, some object metadata might have been lost in the site failure. This can affect the amount of recoverable data and potentially the details of the recovery procedure.

  • History of recent changes: The details of your recovery procedure can be affected by whether any maintenance procedures were in progress at the time of the failure or whether any recent changes were made to your ILM policies. Technical support must assess the recent history of your grid as well as its current situation before beginning a site recovery.

Caution
Site recovery can only be performed by technical support.

This is a general overview of the process that technical support uses to recover a failed site:

  1. Technical support:

    1. Makes a detailed assessment of the failure.

    2. Works with you to review your business objectives.

    3. Develops a recovery plan tailored for your situation.

  2. If the primary Admin Node if it has failed, technical support recovers it.

  3. Technical support recovers all Storage Nodes, following this outline:

    1. Replace Storage Node hardware or virtual machines as required.

    2. Restore object metadata to the failed site.

    3. Restore object data to the recovered Storage Nodes.

      Caution
      Data loss will occur if the recovery procedures for a single failed Storage Node are used.
      Note
      When an entire site has failed, technical support uses specialized commands to successfully restore objects and object metadata.
  4. Technical support recovers other failed nodes.

    After object metadata and data have been recovered, technical support uses standard procedures to recover failed Gateway Nodes, non-primary Admin Nodes, or Archive Nodes.

Related information

Site decommission