RAID failures rectification

Symptom

When applying the hardware configurations in the master and worker nodes, the container might move to an error state.

Cause

Unsuccessful Redfish calls or usage of servers not supporting Redfish API calls can cause RAID failure.

Action
  1. Check the logs of the failed container in the following directory /var/nps/logs/<topology_name>_<server_IP>_redfish_raid_<date>.log and look for one of the following errors:





  2. Log in to the BIOS of the failed servers and delete the logical drives.
  3. To perform RAID configurations from the NPS toolkit for the failed servers, follow any one of the following substeps.
    • To apply RAID configurations on all the servers, run the following command:
      nps deploy -s hw_prep -l debug -sl all -a raid
    • To apply RAID configurations on a single server or multiple servers, run the following command:
      nps deploy -s hw_prep -l debug -sl <Server iLO IP (or) Server iLO IP1, server iLO IP2,…> -a raid