Troubleshooting
Troubleshooting procedures help you diagnose problems.
- How to get information, help, and technical assistance
If you need help, service, technical assistance, or just want more information about IBM® products, you will find a wide variety of sources available from IBM to assist you. - SAN Volume Controller user interfaces for servicing your system
The SAN Volume Controller provides a number of user interfaces to troubleshoot, recover, or maintain your system. The interfaces provide various sets of facilities to help resolve situations that you might encounter. - Starting statistics collection
The system collects statistics over an interval and creates files that can be viewed. - Event reporting
Events that are detected are saved in an event log. As soon as an entry is made in this event log, the condition is analyzed. If any service activity is required, a notification is sent, if you have set up notifications. - Understanding battery operations of a SAN Volume Controller 2145-DH8 node
The SAN Volume Controller 2145-DH8 nodes contain two replaceable battery units.Certain node modelss contain two replaceable battery units. - Understanding event codes
- Understanding the error codes
Error codes are generated by the event-log analysis and system configuration code. - Viewing the messages and codes
SAN Volume Controller messages and codes are returned for error conditions and operational status information. - Viewing logs and traces
The SAN Volume Controller clustered system maintains log files and trace files that can be used to manage your system and diagnose problems. - Resolving a problem with accessing the management GUI
If you are unable to connect to the management GUI from your web browser and received a Page not found or similar error, this information might help you resolve the issue. - Resolving a problem with SSL certificates
If you are unable to connect to the management GUI from your web browser and received a Certificate expired or similar error, this information might help you resolve the issue. - Resolving a problem with SSL/TLS clients
Changing the security level of the system might cause the web interface, CIM clients, and other SSL/TLS clients to stop working. If any clients stop working, complete the following procedure. - Resolving a problem with offloaded data transfer
When Microsoft offloaded data transfer (ODX) is enabled on a system, it is possible to encounter problems. These procedures help you address some common issues that might arise. - Debugging and performance-monitoring statistics for offloaded data transfer
Offloaded data transfer (ODX) infrastructure captures debug and monitoring information for specific ODX modules and makes it available in global and local views. - Resolving a problem with the SAN Volume Controller 2145-DH8 boot drives
Follow these resolution steps to resolve most problems with boot drives. - Resolving a problem with Microsoft Windows disk manager
Microsoft Windows disk manager might hang if the host attempts to write to a LUN that is a remote copy secondary volume on a SAN Volume Controller system. This procedure addresses this issue. - Procedure: Shutting down a HyperSwap site
You can shut down one site in a HyperSwap® topology system without hosts losing access to the HyperSwap volumes, if there is an up-to-date copy of each HyperSwap volume on the site that remains online. - Procedure: Making drives support protection information
You can use this procedure to migrate drives and arrays to pick up support for protection information. - Determining a hardware boot failure
During the hardware boot, you see progress messages, if the model has a front panel display. If the model does not have a front panel display, light path LEDs indicate a hardware boot failure. - Determining the failing enclosure or disk controller by using the CLI
You can use the command-line interface (CLI) to determine the failing enclosure or disk controller. - Resolving a problem with new expansion enclosures
Determine why a newly installed expansion enclosure was not detected by the system. - Diagnosing and resolving problems with fix procedures
You can use fix procedures to diagnose and resolve problems with the system. - Understanding the medium errors and bad blocks
A storage system returns a medium error response to a host when it is unable to successfully read a block. The SAN Volume Controller response to a host read follows this behavior. - Resolving insufficient memory with a distributed array
If the creation of a distributed array fails due to a lack of memory, complete the following procedure to resolve the insufficient memory. - RAID write response time
This feature means that the RAID software layer, where redundancy exists to do so, can prevent drive bad behavior from having an unlimited impact on I/O performance. In addition, the system tries to avoid immediately committing to an array rebuild due to a brief offline event from a single drive, while there is full redundancy. - Using the maintenance analysis procedures
The maintenance analysis procedures (MAPs) inform you how to analyze a failure that occurs with a SAN Volume Controller node. - iSCSI performance analysis and tuning
This procedure provides a solution for Internet Small Computer Systems Interface (iSCSI) host performance problems while connected to a SAN Volume Controller system and its connectivity to the network switch. - Debugging iSCSI backend sessions
Complete the steps in the following procedures to debug iSCSI backend sessions. - Procedure: SAN problem determination
You can solve problems on the SAN Volume Controller system and its connection to the storage area network (SAN). - Fibre Channel and 10G Ethernet link failures
You might need to replace the small form-factor pluggable (SFP) transceiver when a failure occurs on a single Fibre Channel or 10G Ethernet link (applicable to Fibre Channel over Ethernet personality enabled 10G Ethernet link). - Ethernet iSCSI host-link problems
If you are having problems attaching to the Ethernet hosts, your problem might be related to the network, the SAN Volume Controller system, or the host. - Fibre Channel over Ethernet host-link problems
Problems attaching to the Fibre Channel over Ethernet hosts might be related to the network, the SAN Volume Controller system, or the host. - Disaster recovery
Use these disaster recovery solutions for HyperSwap, Metro Mirror, Global Mirror, and Stretched System, where access to storage is still possible after the failure of a site. - Recover system procedure
The recover system procedure recovers the entire system if the system state is lost from all nodes. The procedure re-creates the system by using saved configuration data. The saved configuration data is in the active quorum disk and the latest XML configuration backup file. The recovery might not be able to restore all volume data. This procedure is also known as Tier 3 (T3) recovery. - Backing up and restoring the system configuration
You can back up and restore the configuration data for the system after preliminary tasks are completed.