Resolving a problem with the SAN Volume Controller boot drives
Complete the following steps to resolve most problems with SAN Volume Controller boot drives.
Before you begin
The node serial number (also known as the product or machine serial number) is on the MT-M S/N label (Machine Type - Model and Serial Number label) on the front (left side) of the node. The node serial number is written to the system board and to each of the two boot drives during the manufacturing process.
When the SAN Volume Controller software starts, it reads the node serial number from the system board (by using the node serial number for the panel name) and compares it with the node serial numbers that are stored on the two boot drives.
- Unrecoverable node error 543: This error indicates that none of the node serial numbers that are stored in the three locations match. The node serial number from the system board must match with at least one of the two boot drives for the SAN Volume Controller software to assume that node serial number is good.
- Unrecoverable node error 545: This error indicates that the node serial numbers on each boot drive match each other but are not the same as the node serial number from the system board. In this case, the node serial number on the system board might be wrong or the node serial number on the boot drives might be wrong. For example, the system board that is changed or the boot drives come from another node.
- Node error 743: This error indicates that the node serial number cannot be read from one of the two boot drives because that drive failed, is missing, or is out of sync with the other boot drive.
- Node error 744: This error indicates that the node serial number from one of the boot drives identifies as belonging to a different node. If boot drives were swapped between drive slots 1 and 2, node error 744 is produced.
- Node error 745: This error indicates that a boot drive is found in an unsupported slot. This error occurs when at least one of the first two drives is online and at least one invalid slot (3-8) is occupied.
About this task
An event is displayed in the Monitoring > Events panel of the management GUI if the problem produces node error 743, 744, or 745. Run the fix procedure for that event. Otherwise, connect to the technician port to use the MT-M S/N label on the node to see the boot drive slot information and determine the problem.
- Do not swap boot drives between slots.
- Each boot drive has a copy of the VPD on the system board.
- Software upgrading is to one boot drive at a time to prevent failures during CCU.
Procedure
To resolve a problem with a boot drive, complete the following steps in order:
Replacing the system board:
When neither of the boot drives have usable SAN Volume Controller software:
For example, if you replace both of the boot drives from FRU stock at the same time, neither boot drive has usable SAN Volume Controller software. If the SAN Volume Controller software is not running, the node status, node fault, battery status, and battery fault LEDs remain off.
When every copy of the node serial number is lost:
For example, if you replace the system board and both of the boot drives with FRU stock at the same time, every copy of the node serial number is lost.
Results
The status of a drive slot is uninitialized only if the SAN Volume Controller software might not automatically initialize the FRU drive. This status can happen if the node serial number on the other boot drive does not match the node serial number on the system board. If the node serial number on the other boot drive matches the MT-M S/N label on the front that is left of the node, you can rescue the uninitialized boot drive from the other boot drive safely. Use the service assistant GUI or the satask rescuenode command to rescue the drive.