Diagnosing and resolving problems with fix procedures

You can use fix procedures to diagnose and resolve problems with the system.

About this task

For example, to repair a system, you might complete the following tasks:

  • Analyze the event log (if it is available, or view node errors).
  • Replace failed components.
  • Verify the status of a repaired device.
  • Restore a device to an operational state in the system.
  • Mark the error as fixed in the event log.

Fix procedures help simplify this process by automating as many of the tasks as possible.

The following example uses the management GUI to repair a system.

Procedure

Complete the following steps to start the fix procedure.

  1. Click Monitoring > Events and ensure that you are filtering the event log to display Recommended actions.

    The list might indicate that many errors must be repaired. If the list contains several errors, the error at the top of the list has the highest priority and must always be fixed first. If you do not fix the higher priority errors first, you might not be able to fix the lower priority errors.

  2. Select the error at the top of the list or select the Next recommended action.
  3. Click Run Fix Procedure.

    The pane displays the error code and provides a description of the condition.

  4. Click Next to go forward or Cancel to return to the previous pane.
    One or more panes might be displayed with instructions for you to replace parts or complete other repair activity.
  5. If you are not able to complete the actions now, click Cancel until you return to the previous pane. Click Cancel until you are returned to the Next recommended action pane. When you return to the fix procedures, the repair can be restarted from step 1. After you complete all the instructions, click OK. When the last repair action is completed, the procedures might attempt to restore failed devices to the system.
  6. After you complete the fix, you see the statement Click OK to mark the error as fixed. Click OK. This action marks the error as fixed in the event log and prevents this instance of the error from being listed again.
  7. When you see the statement The repair has been completed., click Exit. If other errors must be fixed, those errors are displayed and the fix procedures continue.

    If no errors remain, the following statement is displayed: There are no unfixed errors in the event log. The statement indicates that no further repair procedures are necessary.

Results

While you are fixing hardware faults, the fix procedures might direct you to complete hardware actions that look like an error to the system (for example, replacing a drive). In these situations, the fix procedures enter maintenance mode automatically. New events are entered into the event log when they occur. However, a specific set of events are not notified unless they remain unfixed when you exit maintenance mode. The events that were recorded in maintenance mode are fixed automatically when the issue is resolved. Maintenance mode prevents unnecessary messages from being sent.