Replacing the main board: 2145-SV1

You can replace the main board on a 2145-SV1 node. All the components that were removed when you removed the main board are reused during the installation of the new main board.

Before you begin

DANGER
Multiple power cords. The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords. (L003)
Safety icons for multiple power cords

The machine serial number or node serial number is on the MT-M S/N label on the front of the 2145-SV1 node. It was also written to the main board and to each of the boot drives when the node was manufactured. When the system software starts, it reads the node serial number from the main board and uses the serial number as the panel ID for this node. The panel ID can be found in the service assistant GUI, management GUI, and the output of many command-line interface (CLI) commands.

If the main board is replaced with a FRU part, it has a machine serial number of 0000000. The 2145-SV1 node has a panel_id of 0000000. This value will not match the node serial number that is stored on each of the boot drives, causing node error 545. If copies of the node serial number on each boot drive do not match, the node error is 543. Use the following procedure to fix these node errors.

Ensure that the following items are available:
  • A VGA monitor and a USB keyboard.
  • Power cables for the node so that it can be turned on while out of the rack.
  • A computer with an Ethernet port and browser that can be directly connected to the technician port and provide access to the service assistant GUI. SSH-capable software is required to access the CLI (PuTTY).
  • Alcohol wipes and thermal grease are required to correctly replace the microprocessors. You must remove the microprocessors when you replace the main board.
Notes:
  • When you reassemble the components in the node, be sure to route all cables carefully so that they are not exposed to excessive pressure.
  • The SATA drive backplane and system board were cost-reduced and unused connectors were removed. You might find that only one SATA cable is needed to connect the backplane to the system board. Previously, a second SATA cable was provided to allow for a 5th and 6th boot drive, which the system did not need.

    After you replace the SATA drive backplane, the yellow LEDs on the backplane might blink if a cost-reduced version of the system board is installed in the 2145-SV1 system. However, the system board will continue to work properly. To prevent the LEDs from blinking, you can replace the backplane at a convenient time with the latest version of the FRU.

About this task

This service action assumes that the following conditions were met.

Perform the following steps to install the main board.

Procedure

  1. Align the main board at an angle, as shown in Figure 1.
    Figure 1. Replacing the 2145-SV1 main board
    Replacing the 2145-SV1 main board
  2. Carefully slide the main board forward until it can lay flat. Be careful to avoid the pegs ( 2  shown in Figure 2).
    Slide the main board back toward the rear of the server. Make sure that the rear connectors extend through the rear of the chassis.
  3. Reconnect the main board cables that you disconnected.
  4. Use the two screws ( 1  and  3  in Figure 2) to reattach the main board.
    Figure 2. Reattaching the main board on a 2145-SV1 node
    Reattaching the main board on a 2145-SV1 node
    •  1  Attachment screw 1
    •  2  Support pegs for back cover
    •  3  Attachment screw 2
  5. Reinstall the microprocessor and heat sink, as described in Replacing the microprocessor: 2145-SV1.
  6. Reinstall the DIMMs, as described in Replacing the memory modules: 2145-SV1.
  7. Reinstall the fan cage, as described in Replacing the fan bracket: 2145-SV1.
  8. Reinstall the air baffle, as described in Replacing the air baffle: 2145-SV1.
  9. Reinstall the power supply units, as described in Replacing a power supply: 2145-SV1.
  10. Reinstall the TPM, as described in Removing and replacing a Trusted Platform Module: 2145-SV1.
  11. Replace the PCI express riser-card assemblies, as described in Replacing a PCI express riser-card assembly: 2145-SV1.
  12. Make sure that all cables, adapters, and other components are installed and seated correctly and that you have not left loose tools or parts inside the node. Make sure that all internal cables are correctly routed. If you disconnected the Fibre Channel and Ethernet cables, make sure that each cable is reconnected to the same port from which it was removed.
  13. Replace the top covers, as described in Replacing the top covers: 2145-SV1.
  14. If you removed the node from the rack, replace it in the rack, as described in Replacing a node in a rack: 2145-SV1.
  15. If you removed any Fibre Channel, SAS, or Ethernet cables, use the labels that you placed on each cable to connect the cables to the same ports from which they were removed.
  16. Replace the power cords. The node powers on when the cords are reconnected.
  17. Wait for the node status LEDs to remain stable for at least 5 minutes before you take any further action.
    This procedure might take up to 2 hours to complete.
    Notes:
    • If the node status, node fault, and battery status LEDs remain off for more than 5 minutes, attach a monitor and a USB keyboard to change the default boot order.
    • If the repair was successful, the node fault LED is on and node error 545 is seen in the service assistant GUI.
      • Node error 545 means that the node serial number on the main board, which is used for the panel_id, does not match the node serial number of each of the two boot drives.
      • Use the service assistant GUI or the sainfo lsbootdrive CLI command to confirm that.
        • The node serial number on the main board is 0000000 (that is, seven zeros) shown as the panel_id.
        • The node serial number for each boot drive slot is the same as the number that is on the MT-M S/N label on the front of this node.
      • If the previous two conditions were met, use the service assistant GUI or the following CLI command to change the node serial number on the main board:
        satask chvpd -type 2145-SV1 -serial <the S/N value on the MT-M S/N label>
      • The node reboots.
      • If there are no node errors, the node starts and rejoins the system if it was previously in the system. If the node rejoined the system, the node status LED is on.
    • If node error 543 is displayed, check the following items:
      • When the machine serial number on the main board is 0000000, node error 543 means that the copies of the node serial number on each boot drive do not match. This error might occur when the node serial number cannot be read from the boot drives because it is missing.
      • Use the service assistant GUI or the sainfo lsbootdrive CLI command to see the state of each boot drive slot.
        For example, if the output from the sainfo lsbootdrive shows:
        • The node number on the main board is 0000000 (that is, seven zeros) shown as the panel_ID.
        • The node serial number for one boot drive slot is the same as that found on the MT-M S/N label on the front of this node.
        • The status of the other boot drive slot is uninitialized.
        Use the service assistant GUI or the following CLI command to initialize the uninitialized boot drive only if the three previous conditions were met.
        satask rescuenode
        • The node reboots.
        • Node error 545 is displayed for this node in the service assistant GUI.
        • Write the node serial number.
    • If the repair was successful but the node was not able to save its state data before it shuts down, the node displays node error 578. Follow the procedures in Deleting a node from a clustered system by using the management GUI to delete the node from the cluster. Then, add it back into the cluster. If more than one node has failed, ensure that each node is added back into its original I/O group.