HyperSwap system configuration details
You can create an IBM® HyperSwap® topology system configuration where each I/O group in the system is physically on a different site. When used with active-active relationships to create HyperSwap volumes, these configurations can be used to maintain access to data on the system when power failures or site-wide outages occur.
In a HyperSwap configuration, each site is defined as an independent failure domain. If one site experiences a failure, then the other site can continue to operate without disruption. You must also configure a third site to host a quorum device or IP quorum application that provides an automatic tie-break in case of a link failure between the two main sites. The main site can be in the same room or across rooms in the data center, buildings on the same campus, or buildings in different cities. Different kinds of sites protect against different types of failures.
- If each site is a different power phase within a single location or data center, the system can survive the failure of any single power domain. For example, one node can be placed in one rack installation and the other node can be in another rack. Each rack is considered a separate site with its own power phase. In this case, if power was lost to one of the racks, the partner node in the other rack might be configured to process requests. The partner node can effectively provide availability to data even when the other node is offline due to a power disruption.
- If each site is a different physical location, the system can survive the failure of any single location. These sites can span shorter distances, for example two sites in the same city, or they can be spread farther geographically, such as two sites in separate cities. If one site experiences a site-wide disaster, the remaining site can remain available to process requests.
If configured properly, the system continues to operate after the loss of one site. The key prerequisite is that each site contains one or more I/O groups that are used to access a copy of the HyperSwap volume. In the management GUI, the Modify System Topology wizard simplifies setting up HyperSwap system topology. After you configure HyperSwap topology, you can use the Create Volumes wizard to create HyperSwap volumes and copies for each site. In addition, the HyperSwap volume wizard automatically creates active-active relationships and change volumes to manage replication between sites. If you are configuring HyperSwap by using the command-line interface, you must also configure the system topology, volumes, and active-active relationships separately.
A HyperSwap system locates the active quorum disk at a third site. If communication is lost between the primary and secondary sites, the site with access to the active quorum disk continues to process transactions. If communication is lost to the active quorum disk, an alternative quorum disk at another site can become the active quorum disk.
A system of nodes can be configured to use up to three quorum disks. However, only one quorum disk can be elected to resolve a situation where the system is partitioned into two sets of nodes of equal size. The purpose of the other quorum disks is to provide redundancy if a quorum disk fails before the system is partitioned.
An alternative configuration can use an extra Fibre Channel switch at the third site with connections from that switch to the primary site and to the secondary site.
A HyperSwap system configuration is supported only when the storage system that hosts the quorum disks supports extended quorum. Although the system can use other types of storage systems for providing quorum disks, access to these quorum disks is always through a single path.
For quorum disk configuration requirements, see the technote Guidance for Identifying and Changing Managed Disks Assigned as Quorum Disk Candidates.