MAP 5800: Light path

MAP 5800: Light path helps you to solve hardware problems that prevent the SAN Volume Controller 2145-DH8 from booting.

Before you begin

If you are not familiar with these maintenance analysis procedures (MAPs), first read Using the maintenance analysis procedures.

You might be sent here because of the following situations:

Light path for SAN Volume Controller 2145-DH8

Light path diagnostics is a system of LEDs on top of the operator-information panel of the SAN Volume Controller 2145-DH8 node, which leads you to the failed component.

About this task

When an error occurs, LEDs are lit along the front of the operator-information panel, the light path diagnostics panel, then on the failed component. By viewing the LEDs in a particular order, you can often identify the source of the error.

LEDs that are lit to indicate an error, remain lit when the server is turned off, if the node is connected to an operating power supply.

Ensure that the node is turned on, and then resolve any hardware errors that are indicated by the Error LED and light path LEDs:

Procedure

  1. Is the System error LED  7 , shown in Figure 1, on the SAN Volume Controller 2145-DH8 operator-information panel on or flashing?
    Figure 1. SAN Volume Controller 2145-DH8 operator-information panel
    SAN Volume Controller 2145-DH8 operator-information panel
    •  1  Power control button and LED.
    •  2  Ethernet LED.
    •  3  Locator button and LED.
    •  4  Release latch.
    •  5  Ethernet activity LEDs.
    •  6  Check log LED.
    •  7  System error LED.
    NO
    Reassess your symptoms and return to MAP 5000: Start.
    YES
    Go to step 2.
  2. (from step 1)
    Press the release latch, as shown in Figure 2, and open the light path diagnostics panel, which is shown in Figure 3.
    Figure 2. Press the release latch
    Press the release latch
    Are one or more LEDs on the light path diagnostics panel on or flashing?
    Figure 3. SAN Volume Controller 2145-DH8 light path diagnostics panel
    SAN Volume Controller 2145-DH8 light path diagnostics panel
    NO
    Verify that the operator-information panel cable is correctly seated at both ends. If the error LED is still illuminated but no LEDs are illuminated on the light path diagnostics panel, replace parts in the following sequence:
    1. Operator-information panel
    2. System board
    Verify the repair by continuing with MAP 5700: Repair verification.
    YES
    See Table 1 and complete the action that is specified for the specific light-path-diagnostics LEDs. Then, go to step 3. Some actions require that you observe the state of LEDs on the system board. Figure 4 shows the location of the system board LEDs. The fan LEDs are located next to each FAN. To view the LEDs, complete the following actions:
    1. Before you turn off the node, ensure that its data is mirrored and synchronized.
    2. Identify and label all the cables that are attached to the node so that they can be replaced in the same port. Remove the node from the rack and place it on a flat, static-protective surface.
    3. Remove the top cover.
    4. See Table 1 and complete the action that is specified for the specific light-path-diagnostics LEDs. Then, go to step 3.
    Figure 4. SAN Volume Controller 2145-DH8 system board LEDs.
    SAN Volume Controller 2145-DH8 system board LEDs
    Table 1. Diagnostics panel LEDs
    LED Description Action
    The Error log or Check log LED

    operator-information panel

    An error occurs and cannot be isolated without completing certain procedures.
    1. Plug in the VGA screen and the USB keyboard.
    2. Check the IMM2 system event log and the system-error log for information about the error.
    3. Save the log if necessary and clear the log afterward.
    System-error LED

    operator-information panel

    An error occurred.
    1. Check the light-path-diagnostics LEDs and follow the instructions.
    2. Check the IMM2 system event log and the system-error log for information about the error.
    3. Save the log if necessary and clear the log afterward.
    PS When only the PS LED is lit, a power supply failed. The system might detect a power supply error. Complete the following steps to correct the problem:
    1. Check the power-supply with a lit yellow LED.
    2. Make sure that the power supplies are seated correctly and plugged in a good AC outlet.
    3. Remove a power supply to isolate the failed power supply.
    4. Make sure that both power supplies installed in the server are of the same AC input voltage.
    5. Replace the failed power supply.
    PS + CONFIG

    When both the PS and CONFIG LEDs are lit, the power supply configuration is not valid.

    If the PS LED and the CONFIG LED are lit, the system logs an invalid power configuration error. Make sure that both power supplies installed in the node are of the same rating or wattage.
    OVER SPEC The system consumption reaches the power supply over-current protection point or the power supplies are damaged.
    1. If the power rail (A, B, C, D, E, F, G, and H) error was not detected, complete the following steps:
      1. Use the IBM Systems Energy Estimator to determine the current system power consumption. For more information, go to the following website:

        https://www-947.ibm.com/systems/support/tools/estimator/energy/index.html

      2. Replace the failed power supply.
    2. If the power rail (A, B, C, D, E, F, G, and H) error was also detected, follow actions that are listed in MAP 5040: Power SAN Volume Controller 2145-DH8.
    PCI An error occurred on a PCI bus or on the system board. Another LED is lit next to a failing PCI slot.
    1. Check the riser-card LEDs, the ServeRAID error LED, and the dual-port network adapter error LED to identify the component that caused the error.
    2. Check the system-error log for information about the error.
    3. If you cannot isolate the failing component by using the LEDs and the information in the system-error log, remove one component at a time. Then, restart the server after each component is removed.
    4. Replace the following components, in the order that is shown, restarting the server each time:
      • PCI riser cards
      • ServeRAID adapter
      • Network adapter
      • (Trained technician only) System board.
    5. If the failure remains, contact your IBM® service representative.
    NMI A nonmaskable interrupt occurred, or the NMI button was pressed.
    1. Check the system-error log for information about the error.
    2. Restart the server.
    CONFIG CONFIG + PS An invalid power configuration error occurred. If the CONFIG LED and the PS LED are lit, the system logs an invalid power configuration error. Make sure that both power supplies installed in the server are of the same rating or wattage.
    CONFIG + CPU A hardware configuration error occurred. If the CONFIG LED and the CPU LED are lit, complete the following steps to correct the problem:
    1. Check the microprocessors that were installed to make sure that they are compatible with each other.
    2. (Trained technician only) Replace the incompatible microprocessor.
    3. Check the system-error logs for information about the error. Replace any component that is identified in the error log.
    CONFIG + MEM A hardware configuration error occurred. If the CONFIG LED and the MEM LED are lit, check the system-event log in the Setup utility or IMM2 error messages.
    CONFIG + PCI A hardware configuration error occurred. If the CONFIG LED and the PCI LED are lit, check the system-error logs for information about the error. Replace any component that is identified in the error log.
    CONFIG + HDD A disk drive error occurred. If the CONFIG LED and the HDD LED are lit, check the system-error logs for information about the error. Replace any component that is identified in the error log.
    LINK Reserved.
    CPU When only the CPU LED is lit, a microprocessor failed. When both the CPU and CONFIG LEDs are lit, the microprocessor configuration is invalid.
    1. If the CONFIG LED is not lit, a microprocessor failure occurs, complete the following steps:
      1. (Trained technician only) Make sure that the failing microprocessor and its heat sink, which are indicated by a lit LED on the system board, are installed correctly.
      2. (Trained technician only) Replace the failing microprocessor.
      3. For more information, contact your IBM service representative.
    2. If the CONFIG LED and the CPU LED are lit, the system logs an invalid microprocessor configuration error. Complete the following steps to correct the problem:
      1. Check recently installed microprocessors to ensure that they are compatible with each other.
      2. (Trained technician only) Replace any incompatible microprocessor.
      3. Check the system-error logs for information about the error. Replace any component that is identified in the error log.
    MEM When only the MEM LED is lit, a memory error occurs.
    Note: Note: Each time that you install or remove a DIMM, you must disconnect the node from the power source; then, wait 10 seconds before you restart the server.
    If the CONFIG LED is not lit, the system might detect a memory error. Complete the following steps to correct the problem:
    1. Update the node firmware.
    2. Reseat or swap the DIMMs with lit LED.
    3. Check the system-event log in the Setup utility or IMM error messages.
    4. Replace the failing DIMM.
    MEM + CONFIG

    When both the MEM and CONFIG LEDs are lit, the memory configuration is not valid.

    If the MEM LED and the CONFIG LED are lit, check the system-event log in the Setup utility or IMM2 error messages.
    TEMP The system or the system component temperature exceeded a threshold level. A failing fan can cause the TEMP LED to be lit.
    1. Make sure that the heat sink is seated correctly.
    2. Determine whether a fan failed and replace the fan if necessary.
    3. Make sure that the room temperature is not too high. See the environment requirements for the server temperature information.
    4. Make sure that the air vents are not blocked.
    5. Make sure that the heat sink or the fan on the adapter, or any other network adapter is seated correctly. If the fan failed, replace it.
    6. For more information, contact your IBM service representative.
    FAN A fan is either failed, operating too slowly, or is removed. The TEMP LED might also be lit.
    1. Check whether your node is installed with the dual-port network adapter. If yes, make sure that your node compiles with the configuration with four fans installed.
    2. Reseat the failing fan, which is indicated by a lit LED near the fan connector on the system board.
    3. Replace the failing fan.
    BOARD An error occurred on the system board or the system battery.
    1. Check the LEDs on the system board to identify the component that caused the error. The BOARD LED can be lit due to any of the following reasons:
      • Battery
      • (Trained technician only) System board
    2. Check the system-error log for information about the error.
    3. Replace the failing component.
    HDD A hard disk drive that is failed or is missing.
    1. Check the LEDs on the hard disk drives for the drive with a lit status LED and reseat the hard disk drive.
    2. Reseat the hard disk drive backplane.
    3. If the error remains, replace the following components one at a time, in the order that is listed, restarting the server after each:
      1. Replace the hard disk drive.
      2. Replace the hard disk drive backplane.
    4. If the problem remains, contact your IBM service representative.
  3. Continue with MAP 5700: Repair verification to verify the correct operation.