SGI® InfiniteStorage FailSafe High Availability Cluster Administration
Length: 4.5 Days - Current Schedule  |  Price: Price on Application - Payment Info

This course is designed to provide the system support engineer (SSE) with the knowledge and skills necessary to configure, maintain, diagnose, and repair failures associated with SGI's FailSafe 2.x High Availability (HA) cluster system.

IRIS FailSafe is the HA infrastructure solution that enables up to 8 nodes (which can be any combination of SGI Origin® servers) to be part of a cluster that provides highly-available services to clients connected to the cluster via standard networking like Ethernet, FDDI, ATM. The cluster is setup to detect failures quickly and take the necessary steps to minimize the impact of the failure. This is achieved in part by having failure impact limited to the resources in a resource group. The recovery steps can range from attempting to access the storage from an alternate path (in case of a storage access path failure) to failing over an application to another node in the cluster. IRIS FailSafe leverages the IRIX kernel capabilities to detect and provide recovery from failures at various levels that include disk failures, storage path failures, system failures, network failures, or application failures.

Topics Covered

  • FailSafe Architecture and implementation
  • Configuration planning and basic FailSafe set-up
  • Node and cluster configuration
  • FailSafe administration
  • Configuration testing
  • FailSafe plug-ins
  • Script writing basics
  • System upgrades
  • FailSafe cluster maintenance and troubleshooting

Objectives

Upon completion of this course, the student should be able to:

  • Install and configure FailSafe nodes and clusters
  • Verify and test resource and resource group fail over
  • Perform routine FailSafe cluster administration
  • Write and modify scripts for a FailSafe environment
  • Perform upgrade and maintenance procedures
  • Troubleshoot FailSafe problem scenarios

Prerequisites

  • Non-GCS service personnel must possess a maintenance agreement
  • Familiarity with XLV or XVM Volume Managers
  • SGI Origin or Altix server administration experience
  • SGI TP9x00 or RM6x0 series storage system experience
  • SGI CXFS experience recommended but not required

Format

Instructor led (lecture/lab)

Class Hours

Class starts at 9:00 AM. The course dates and times will be included in the confirmation notice you receive from SGI after you register.