This chapter contains the following topics:
“About Performing a New Installation and Configuring the Software on an SGI ICE X System”
“Configuring the Operating System on the System Admin Controller (SAC) Node”
“(Conditional) Configuring External Domain Name Service (DNS) Servers ”
“(Optional) Configuring the Intel Data Center Manager (DCM) Software Development Kit (SDK)”
“(Conditional) Installing the SGI Management Center License Key ”
“Completing the Intel Data Center Power Manager (DCM) Software Development Kit (SDK) Configuration”
“Synchronizing the Software Repository, Installing Software Updates, and Cloning the Images”
“Configuring the Rack Leader Controllers (RLCs) and Service Nodes with the discover Command”
“(Optional) Configuring a Backup Domain Name Service (DNS) Server ”
SGI installs operating system software on each SGI ICE X system before factory shipment occurs. The topics in this chapter include the additional procedures that you need to complete in order to configure the system for your site.
If you want to completely reinstall the operating system and all other software, the topics in this chapter enable you to complete that task. For example, you might need to reinstall the operating system to meet site requirements or to recover a system in case of a disaster.
Figure 3-1 depicts the software installation process.
Table 3-1 shows the installation and configuration procedures to follow if you want to install the SGI ICE X system from scratch. In this case, you reinstall the operating system on the nodes and configure everything yourself.
Table 3-1. SGI ICE X System Installation and Configuration Process
Step | Task | See |
|---|---|---|
1 | Plan the boot slots and partitions. | |
2 | Prepare to install the SGI ICE X software. | |
3 | (Conditional) Configure a static address for the baseboard management controller (BMC) on the system admin controller (SAC). Perform this step only if your site practices require a static IP on the BMC. | |
4 | (Optional) Configure a highly available system admin controller (SAC) or a highly available rack leader controller (RLC). | |
5 | Boot the system. | |
6 | Install the operating system on the system admin controller (SAC) node. You can install either the SUSE Linux Enterprise Server (SLES) or Red Hat Enterprise Server (RHEL) operating system. | “Configuring the Operating System on the System Admin Controller (SAC) Node” |
7 | Run the cluster configuration tool. Complete the initial cluster configuration tasks, which include the following:
| |
8 | (Conditional) Configure external domain name service (DNS). If you want to configure network address translation, you also need to configure an external DNS. | “(Conditional) Configuring External Domain Name Service (DNS) Servers ” |
9 | (Optional) Start the Intel Data Center Manager (DCM) Configuration | “(Optional) Configuring the Intel Data Center Manager (DCM) Software Development Kit (SDK)” |
10 | Sync the repository updates, apply the latest patches to the newly installed software, and clone the images. | “Synchronizing the Software Repository, Installing Software Updates, and Cloning the Images” |
11 | (Conditional) Download the Intel Manycore Platform Software Stack (MPSS). | |
12 | Configure the switches. | |
13 | Use the discover command to install and configure the rack leader controller and service node software. | “Configuring the Rack Leader Controllers (RLCs) and Service Nodes with the discover Command” |
14 | (Optional) Configure a backup domain name service (DNS) server on a service node. | “(Optional) Configuring a Backup Domain Name Service (DNS) Server ” |
15 | Configure the InfiniBand subnetworks. | |
16 | Configure optional features. |
On a multiple-slot SGI ICE X system, the system admin controller (SAC), the rack leader controller (RLC), and the service nodes all have the same disk layout. If you insert an operating system installation disk, power-on the SAC, and type install at the boot: prompt, the default behavior is for the system to create two slots and to write the initial installation to slot 1. After the system is installed, you cannot increase the number of slots without destroying the data on the disks. You can configure up to five slots, for a total of five root/boot directory pairs.
One feature of a multiple-boot system is that you can install different operating systems, or different operating system versions, into different slots. You can boot the system with the operating system of your choice. This configuration might be useful if you want to test an operating system or other software. The following are some other characteristics of single-boot systems and multiple-boot systems:
| Multiple-boot | Single-boot |
| RLCs and service nodes boot from their own disk. Data is retained in the master boot record (MBR). | RLCs and service nodes boot from the boot partition in the slot that is currently configured as the boot slot. Only the SAC retains data in the MBR. |
| RLC and service node software is reinstalled from the SAC. | Software on the RLCs and service nodes is reinstalled over the network. |
| As you increase the number of slots, you decrease the amount of disk space per slot. SGI recommends a minimum of 100 GB per slot. | A single slot uses all available disk space. |
After an upgrade, your SAC might have a partition layout that differs from the layouts shown in this topic.
Procedure 3-1. To plan the boot slots and disk partitions
Select your boot parameters and plan the disk partitions.
The following topics describe the boot parameters you can specify, disk partitioning, and operations such as cloning:
Proceed to the following:
The SGI Tempo 2.9.0 release partition layout uses the GUID partition table (GPT) and a new boot system, which is GRUB version 2. In previous Tempo versions, the partition layout was the MSDOS layout, and the boot system was GRUB version 1. These changes affect system operations in the following ways:
If your SGI ICE X system is new, then the Tempo 2.9.0 software is the first Tempo version installed on this system. All system disks use the new partition layout. You can use the cadmin command, as shown in the SGI documentation, to manage all the slots.
If you upgrade a slot from a previous Tempo release to the Tempo 2.9.0 release, note the following:
If you upgrade any slot in the system admin controller (SAC) to Tempo 2.9.0, the SAC converts the boot loader installed in the master boot record (MBR) to GRUB version 2. This boot loader chooses a slot when you boot the SAC.
If you do not upgrade all the slots to Tempo 2.9.0, the only administrative action you can perform on the slots with the older Tempo software is to boot the slot. You can boot the slots that host older versions of Tempo at the console or by selecting the slot on the GRUB version 2 boot menu. Because the SAC MBR is now upgraded to GRUB version 2, you can no longer use the cadmin command to manage any slots that host older versions of Tempo.
You can no longer install Tempo releases older than Tempo 2.9.0 on any of the system's slots. If you perform any future, from-scratch installations, these installations must be to Tempo 2.9.0 or later. You can, however, use the updatetempo upgrade script to upgrade slots to releases other than Tempo 2.9.0.
Disks with the new GPT layout can exist on the system alongside disks with the MSDOS partition layout. On an SGI ICE X system newly installed with Tempo 2.9.0, the software configures the GPT layout on all system disks and all disks that you clear, except for the compute nodes. After you upgrade one or more slots to Tempo 2.9.0, if you add new system disks or you clear existing system disks, the Tempo software configures the GPT layout on system disks that you add and on all disks that you clear, except for the compute nodes.
For SAC, rack leader controller (RLC), and service nodes, when you install Tempo 2.9.0 with blank system disks, or with system disks that are cleared, Tempo 2.9.0 uses the new GPT layout. If the disks have the old MSDOS partition table format, Tempo continues to use the old MSDOS partition table format until you manually clear the disks and reinstall (if desired).
If you need to downgrade an SGI ICE X system from Tempo 2.9.0 to a previous release, see the following:
“Installing SGI Tempo Versions Older than 2.9.0” in Chapter 5
When you use the SGI system admin controller (SAC) installation DVD to boot an SGI ICE X system, you can specify parameters at the boot: prompt. By default, if you type install at the boot prompt and the installer detects a SAC with exactly one blank disk, the installer partitions the SAC with two slots, and the installer writes the initial installation to slot 1.
If you specify more than one boot parameter, use a space to separate each parameter.
The following list shows the result of pressing Enter at the boot prompt and shows the result of the different boot options:
| Parameter | ||
Effect | ||
| (None) | ||
If you do not specify any boot options, that is, if you press Enter at the boot: prompt without specifying any boot parameters, the installer creates two slots. It installs a root directory on only one slot, however. In this case, the software repository is not configured, so when you run the configure-cluster command, you need to use the GUI tool to manually enter the distribution ISO images in the /tftpboot directory on the SAC. You can use the second slot for a clone or preproduction testing. | ||
| console=specs | ||
Specify console characteristics. Use when you connect to the compute node through a serial console. By default, this is set to ttyS1,38400n8. The BIOS console and the IPMI console settings are configured in the SAC BIOS. If you configure the SAC BIOS to be different from the default, use this boot option to specify your console's characteristics. | ||
| destructive=1 | ||
Permits partitioning operations that are potentially destructive. Use this parameter only if you want to repartition a disk that contains data. Use in conjunction with the re_partition_with_slots parameter. If the system encounters data in a partition, nothing destructive happens unless you also specify the destructive=1 parameter. | ||
| install | ||
Creates two slots on each compute node and installs the operating system from the DVD to slot 1. Both slots contain a root directory, and the installer writes the software distribution ISO images to each slot. This is the recommended boot option for an initial configuration. | ||
| install_slot=slot | ||
Specifies the slot into which the operating system is installed first. Specify 1 (default), 2, 3, 4, or 5 for slot. By default, slot 1 is installed first. To direct the initial installation to a slot other than slot 1, specify this parameter. If you specify a slot that appears to have data on it, no repartitioning is performed unless you also specify destructive=1. Example: You specify install_slot=2 to direct the first installation into slot 2. In this case, after the boot completes, type the following cadmin command to set the default slot to slot 2:
In this example, if you do not specify the default slot (slot 2) during the first boot of the SAC, then the SAC attempts to boot from an empty slot 1. The boot fails. If this happens, restart the boot, and select the install slot during the boot. For information about how to manage or change the boot slot, see the SGI ICE X Administration Guide . | ||
| netinst=path | ||
Specifies the NFS path to an ISO image for a network installation. | ||
| re_partition_with_slots=slots | ||
Partitions the SAC system drive with between one and five slots. Specify 1, 2 (default), 3, 4, or 5 for slots. The SAC system drive must be blank in order for this parameter to have an effect. If the SAC system drive contains data and you want to reconfigure the system, also specify the destructive=1 parameter. For example, the following parameter creates five slots:
The installer creates partitions on the rack leader controllers (RLCs) and service nodes to mimic the SAC. If an RLC or service node is discovered to have a slot count that does not match the SAC, the system reinitializes the partitions on the RLC and service nodes. Likewise, if you change the number of slots on the SAC, the system updates the disk partitioning on the RLC and service nodes, too. | ||
| rescue=1 | ||
Creates a troubleshooting environment. | ||
| serial | ||
Configures the system for serial console operations for the installation and later operations. If you do not specify serial, the system sends output to the VGA. Also see vga in this list. | ||
| vga | ||
Configures the system so that you can use the VGA screen for the installation and later operations. If you press Enter at the boot prompt, the result is the same as if you had typed vga. Also see serial in this list. | ||
Table 3-2 shows the partition layout for a one-slot SGI ICE X system. This layout yields one boot partition. If you configure a single-slot system and later decide to add another partition, the addition process destroys all the data on your system.
Table 3-2. Partition Layout for a Single-boot SGI ICE X System
Partition | File System Type | File System Label | Notes |
|---|---|---|---|
1 | ext3 | sgidata | Contains slot information. On the SAC, contains GRUB version 2 data for choosing root slots at boot time. |
2 | swap | sgiswap | Swap partition. |
3-10 | N/A | N/A | Reserved for future use. |
11 | ext3 | sgiboot1 | Slot 1 /boot partition. |
12-30 | N/A | N/A | Reserved for future use. |
31 | ext3 on SAC, service nodes. XFS on RLCs. | sgiroot1 | Slot 1 / partition. |
Table 3-3 shows the partition layout for a two-slot SGI ICE X system. This layout yields two boot partitions.
Table 3-3. Partition Layout for a Dual-boot SGI ICE X System (Default Layout)
Partition | File System Type | File System Label | Notes |
|---|---|---|---|
1 | ext3 | sgidata | Contains slot information. On the SAC, contains GRUB version 2 data for choosing root slots at boot time. |
2 | swap | sgiswap | Swap partition. |
3-10 | N/A | N/A | Reserved for future use. |
11 | ext3 | sgiboot1 | Slot 1 /boot partition. |
12 | ext3 | sgiboot2 | Slot 2 /boot partition. |
13-30 | N/A | N/A | Reserved for future use. |
31 | ext3 on SAC, service nodes. XFS on RLCs. | sgiroot1 | Slot 1 / partition. |
32 | ext3 on SAC, service nodes. XFS on RLCs. | sgiroot2 | Slot 2 / partition. |
Table 3-4 shows the partition layout for a five-slot SGI ICE X system. This layout yields five boot partitions.
Table 3-4. Partition Layout for a Quintuple-boot SGI ICE X System
Partition | File System Type | File System Label | Notes |
|---|---|---|---|
1 | ? | sgidata | Contains slot information. On the SAC, contains GRUB version 2 for choosing root slots at boot time. |
2 | swap | sgiswap | Swap partition. |
3-10 | N/A | N/A | Reserved for future use. |
11 | ext3 | sgiboot1 | Slot 1 /boot partition. |
12 | ext3 | sgiboot2 | Slot 2 /boot partition. |
13 | ext3 | sgiboot3 | Slot 3 /boot partition. |
14 | ext3 | sgiboot4 | Slot 4 /boot partition. |
15 | ext3 | sgiboot5 | Slot 5 /boot partition. |
16-30 | N/A | N/A | Reserved for future use. |
31 | ext3 on SAC, service nodes. XFS on RLCs. | sgiroot1 | Slot 1 / partition. |
32 | ext3 on SAC, service nodes. XFS on RLCs. | sgiroot2 | Slot 2 / partition. |
33 | ext3 on SAC, service nodes. XFS on RLCs. | sgiroot3 | Slot 3 / partition. |
34 | ext3 on SAC, service nodes. XFS on RLCs. | sgiroot4 | Slot 4 / partition. |
35 | ext3 on SAC, service nodes. XFS on RLCs. | sgiroot5 | Slot 5 / partition. |
The following procedure explains the information you need to obtain before you begin working with the SGI ICE X system. Your installation session can proceed more quickly if you gather information before you begin.
Procedure 3-2. To prepare for an installation
Contact your site's network administrator to obtain network information.
Obtain the following information to use when you configure the baseboard management controller (BMC):
(Optional) The current IP address of the BMC on the system admin node (SAC). You can set the BMC address from a serial console if you do not have this information.
The address you want to set for the BMC.
The netmask you want to set for the BMC.
The default gateway you want to set for the BMC.
Your network administrator can provide an IP address, a hostname, or a fully qualified domain name (FQDN) for each of the preceding addresses.
Obtain the following information to use when you configure the network for the SGI ICE X system:
Hostname
Domain name
IP address
Netmask
Default route
Root password
Obtain the following information about your site's house network:
IP addresses of the domain name servers (DNSs)
Familiarize yourself with the boot parameters, and determine which boot parameters you want to use.
You can configure your SGI ICE X system to boot from one, two (default), three, four, or five partitions. This enables you to configure your SGI ICE X system as either a single-boot computer system or as a multiple-boot computer system. A multiple-boot computer system has two or more partitions, so it has more than one root directory (/) and more than one boot directory (/boot). In an SGI ICE X system, these root and boot directories are paired into multiple slots . A multiple-slot disk layout is also called a cascading dual-root layout or a cascading dual-boot layout.
The installation procedure explains how to create a default, two-slot SGI ICE X system and directs you to use the install boot parameter. If you want to create only one slot, or if you want to create three or more slots, you need to specify different boot parameters.
The installer creates the same disk layout on all nodes. For more information about boot parameters, disk layouts, and so on, see “Planning the Boot Slots and the Disk Partitions”.
(Optional) Obtain the MAC file for your system from your SGI representative.
The MAC file contains MAC address information for the nodes. If you have these addresses, the node discovery process can complete more quickly.
Perform the procedure in this topic only if your site practices require a static IP address for the BMC.
When you set the IP address for the BMC on the SAC, you ensure access to the SAC when the site DHCP server is inaccessible. This procedure is required if you want to enable high availability. If you want to configure a highly available SAC, make sure to perform this topic's procedure on the BMCs on each of the two SACs.
The following procedures explain how to set a static IP address.
Procedure 3-4. Method 2 -- To change the IP address from the SAC
Log into the SAC as the root user.
Type the following command to retrieve the current network settings:
# ipmitool lan print 1 |
In the output from the preceding command, look for the IP Address Source line and the IP Address line.
For example:
IP Address Source : DHCP Address IP Address : 128.162.244.59 |
Note the IP address in this step and decide whether or not this IP address is acceptable. The rest of this procedure explains how to keep this IP address or to set a different static IP address.
Type the following command to specify that you want the BMC to have a static IP address:
# ipmitool lan set 1 ipsrc static |
This step specifies that the IP address on the BMC is a static IP address, and this step sets the IP address to the IP address that is currently assigned to the BMC. If you want to set the IP address to a different IP address, proceed to the following step. If the current IP address is acceptable, you do not need to perform the next step.
(Optional) Set a different IP address.
Perform this step if you want to set the static IP address to be different from the IP address that is set currently.
Type ipmitool commands in the following format:
ipmitool lan set 1 ipaddr ip_addr ipmitool lan set 1 netmask netmask ipmitool lan set 1 defgw gateway |
The arguments are as follows:
| Argument | Specification |
| ip_addr | The IP address you want to assign to the BMC. |
| netmask | The netmask you want to assign to the BMC. |
| gateway | The gateway you want to assign to the BMC. |
For example, you can type the following commands to set the IP address to 100.100.100.100:
# ipmitool lan set 1 ipaddr 100.100.100.100 # ipmitool lan set 1 netmask 255.255.255.0 # ipmitool lan set 1 defgw 128.162.244.1 |
Proceed to one of the following:
If you want to configure a highly available SAC, proceed to the following:
If you want to configure a traditional SAC, proceed to the following:
SGI supports the ability to configure the SAC and rack leader controllers (RLCs) as highly available nodes in an SGI ICE X system. If you want to enable high availability (HA) on the SAC or on the RLCs, contact your SGI representative.
The following procedure explains how to boot the system and begin the installation.
Procedure 3-5. To boot the system
Power-on the SAC.
As Figure 3-2 shows, the power-on button is on the right of the SAC.
(Optional) Configure the system so that you can perform the installation from a VGA screen and can perform later operations from a serial console.
If you want to enable this capability, perform the following steps:
Use a text editor to open file /boot/grub/menu.lst .
Search the file for the word kernel at the beginning of a line.
Add the following to the kernel line: console=type.
For example:
kernel /boot/vmlinuz-2.6.16.56-0.12-smp root=/dev/disk/by-label/sgiroot console=ttyS1,38400n8 splash=silent showopts |
Add the console=type parameter to the end of every kernel line. By default, this is set to ttyS1,38400n8. You might have ttys2, for example.
Later, if you want to access the SAC from only a VGA, you can remove the console= parameters.
Insert the SGI Admin Node Autoinstallation DVD into the DVD drive on the SAC.
The autoinstallation message appears, and at the end is the boot: prompt.
At the boot: prompt, type install and, optionally, other boot parameters.
“Planning the Boot Slots and the Disk Partitions” explains the other, optional, boot parameters.
Monitor the installation. This can take several minutes.
Remove the operating system installation DVD.
At the # prompt, type reboot .
This is the first boot from the SAC's hard disk.
(Optional) Suppress log messages.
If you want to suppress the SAC's log message output to the screen during the boot, edit file /etc/syscontrol.conf and add the following line to the top of the file (line 1):
kernel.printk = 2 4 1 7 |
In the preceding kernel.printk line, the spaces between the numbers 2 4 1 7 are Tab characters.
Proceed to the following:
“Configuring the Operating System on the System Admin Controller (SAC) Node”
The SGI ICE X platform supports both the Red Hat Enterprise Linux (RHEL) operating system and the SUSE Linux Enterprise Server (SLES) and operating system. Use one of the following procedures to install your operating system software on the SAC node:
This section describes how to configure Red Hat Enterprise Linux 6 on the SAC.
Procedure 3-6. To install RHEL 6 on an SGI ICE X SAC
Connect to the system admin controller (SAC) by one of the following methods:
Through the intelligent platform management interface (IMPI) tool
Through the console attached to the SGI ICE X system
Through a separate keyboard, video display terminal, and mouse
Use a text editor, such as vi or vim, to open file /etc/sysconfig/network-scripts/ifcfg-eth0 .
Add lines for the IPADDR, NETMASK , and NETWORK values appropriate for your public (house) network to file /etc/sysconfig/network-scripts/ifcfg-eth0 .
For example:
IPADDR=128.162.244.88 NETMASK=255.255.255.0 NETWORK=128.162.244.0 |
Save and close file /etc/sysconfig/network-scripts/ifcfg-eth0 .
Use a text editor to create file /etc/sysconfig/network .
Add the following three lines to file /etc/sysconfig/network :
NETWORKING=yes HOSTNAME=SAC_hostname GATEWAY=gateway_IP_address |
For SAC_hostname, type the hostname you want to assign to the SAC.
For gateway_IP_address, type the IP address of the gateway for your house network.
For example:
NETWORKING=yes HOSTNAME=my-system-admin GATEWAY=128.162.244.1 |
Save and close file /etc/sysconfig/network.
Use a text editor to open file /etc/hosts.
Add a line in the following format to file /etc/hosts :
SAC_IP SAC_FQDN SAC_hostname |
The variables in the preceding line are as follows:
For SAC_IP, type the IP address of the SAC.
For SAC_FQDN, type the fully qualified domain name (FQDN) of the SAC.
For SAC_hostname, type the hostname of the SAC.
For example, add the following line:
128.162.244.88 my-system-admin.domain-name.mycompany.com my-system-admin |
Save and close file /etc/hosts.
Type the following command to set the SAC hostname:
# hostname SAC_hostname |
For SAC_hostname, type the hostname of the SAC.
For example:
# hostname my-system-admin |
Use a text editor to create file /etc/resolv.conf .
Add lines to file /etc/resolv.conf that specify the search domain and the domain name service (DNS) servers at your site.
Later in the configuration process, when you run the cluster configuration tool, the tool uses the DNS servers you specify in this step for its defaults.
Specify lines with the following format:
search search_domain nameserver name_server_IP nameserver name_server_IP |
The following is an example resolv.conf file:
search mydomain.com nameserver 192.168.0.1 nameserver 192.168.0.25 |
Type the following nscd(8) command to force the invalidation of the name service cache daemon:
# nscd -i hosts |
Type the following commands, in the order shown, to restart services:
# /etc/init.d/network restart # /etc/init.d/rpcbind start # /etc/init.d/nfslock start |
Type the following command to retrieve the SAC's current time zone information:
# strings /etc/localtime | tail -1 CST6CDT,M3.2.0,M11.1.0 |
The previous output shows the SAC set to US Central time. If the output you see is not correct for this SGI ICE X system, perform the following steps:
Type the following command to change to the directory that contains the time zone configuration files:
# cd /usr/share/zoneinfo |
Select a file from that directory that describes the time zone for the SAC.
Type the following commands to enable the new time zone configuration file.
For example:
# /bin/cp -l /usr/share/zoneinfo/time_zone_file /etc/localtime.$$ # /bin/mv /etc/localtime.$$ /etc/localtime |
For time_zone_file, type the name of the time zone file that you need from the /usr/share/zoneinfo directory.
For example, type the following commands to change the SAC's time zone to US Pacific time:
# /bin/cp -l /usr/share/zoneinfo/PST8PDT /etc/localtime.$$ # /bin/mv /etc/localtime.$$ /etc/localtime |
Type the following command to confirm the time zone:
# strings /etc/localtime | tail -1 PST8PDT,M3.2.0,M11.1.0 |
(Conditional) Edit file /etc/ntp.conf to direct requests to the network time protocol (NTP) server at your site.
Complete the following steps if you want to direct requests to your site's NTP server instead of to the public time servers of the pool.ntp.org project:
Use a text editor to open file /etc/ntp.conf .
Insert a pound character (#) into column 1 of each of each line that includes rhel.pool.ntp.org.
| Note: Do not edit or remove entries that serve the cluster networks. |
At the end of the file, add a line that points to your site's NTP server.
The following is an example of a correctly edited file:
# Use public servers from the pool.ntp.org project. # Please consider joining the pool (http://www.pool.ntp.org # server 0.rhel.pool.ntp.org # server 1.rhel.pool.ntp.org # server 2.rhel.pool.ntp.org server ntp.mycompany.com |
The preceding output has been truncated at the right for inclusion in this guide.
Type the following command to restart the NTP server:
# /etc/init.d/ntpd restart |
(Conditional) Type a tilde character (~) and then a period character (.) to exit from the IPMI tool.
Perform this step if you connected to the system through the IPMI tool.
(Optional) Configure the system so that you can perform the installation from a VGA screen and can perform later operations from a serial console.
If you want to enable this capability, perform the following steps:
Use a text editor to open file /boot/grub/menu.lst .
Search the file for the word kernel at the beginning of a line.
Add the following to the kernel line: console=type.
For example:
kernel /boot/vmlinuz-2.6.16.56-0.12-smp root=/dev/disk/by-label/sgiroot console=ttyS1,38400n8 splash=silent showopts |
Add the console=type parameter to the end of every kernel line. By default, this is set to ttyS1,38400n8. You might have ttys2, for example.
Later, if you want to access the SAC from only a VGA, you can remove the console= parameters.
Proceed to the following:
The SLES YaST2 interface enables you to install the SLES operating system on the SGI ICE X system. To navigate the YaST2 modules, use key combinations such as the following:
The Tab key moves the cursor forward, and the Shift + Tab keys move the cursor backward.
The arrow keys move the cursor up, down, left, and right.
To use shortcuts, press the Alt key + the highlighted letter.
Press Enter to complete or confirm an action.
Press Ctrl + L to refresh the screen.
For more information about navigation, see Appendix A, “YaST2 Navigation”.
The following procedure explains how to use YaST2 to install SLES 11 on an SGI ICE X system.
Procedure 3-7. To install SLES 11 on an SGI ICE X SAC
Connect to the system admin controller (SAC) by one of the following methods:
Through the intelligent platform management interface (IMPI) tool
Through the console attached to the SGI ICE X system
Through a separate keyboard, video display terminal, and mouse
On the Language and Keyboard Layout screen, complete the following steps:
Select your language
Select your keyboard layout
Select Next.
On the Welcome screen, select Next.
On the Hostname and Domain Name screen, complete the following steps:
Type the hostname for this SGI ICE X system.
Type the domain name.
Clear the box next to Change Hostname via DHCP. The box appears with an X in it by default, but you need to clear this box.
Select Assign Hostname to Loopback IP. Put an X in this box.
Select Next.
On the Network Configuration screen, complete the following steps:
Select Change. A pop-up window appears.
On the pop-up window, choose Network Interfaces.
On the Network Settings screen, complete the following steps:
Highlight the first network interface card that appears underneath Name.
Select Edit.
On the Network Card Setup screen, specify the SAC's house/public network interface.
Figure 3-3 shows the Network Card Setup screen.
Complete the following steps:
Select Statically Assigned IP Address. SGI recommends a static IP address, not DHCP, for the SAC.
In the IP Address field, type the system's IP address.
In the Subnet Mask field, type the system's subnet mask.
In the Hostname field, type the system's fully qualified domain name (FQDN). SGI requires you to type an FQDN, not the system's shorter hostname, into this field. For example, type mysystem-admin.mydomainname.com. Failure to supply an FQDN in this field causes the configure-cluster command to fail.
Select Next.
You can specify the default route, if needed, in a later step.
On the Network Settings screen, complete the following steps:
Select Hostname/DNS.
In the Hostname field, type the system's fully qualified domain name (FQDN).
In the Domain Name field, type the domain name for your site.
Put an X in the box next to Assign Hostname to Loopback IP.
In the Name Servers and Domain Search List , type the name servers for your house network.
Back at the top of the screen, select Routing .
The Network Settings > Routing screen appears.
In the Default Gateway field, type your site's default gateway.
Select OK.
On the Network Configuration screen, click Next.
The Saving Network Configuration screen appears and saves your configuration.
On the Clock and Time Zone screen, complete the following steps:
Select your region.
Select your time zone.
(Optional) In the Hardware Clock Set To field, choose Local Time or accept the default of UTC.
Select Next.
This step synchronizes the time in the BIOS hardware with the time in the operating system. Your choice depends on how the BIOS hardware clock is set. If the clock is set to GMT, which corresponds to UTC, your system can rely on the operating system to switch from standard time to daylight savings time and back automatically.
On the Password for System Administrator “root” screen, complete the following steps:
In the Password for root User field, type the password you want to use for the root user.
This password becomes the root user's password for all the nodes on the ICE X system. These nodes are as follows:
SAC
Rack leader controller (RLC)
Service nodes
Compute nodes (blades)
In the Confirm password field, type the root user's password again.
In the Test Keyboard Layout field, type a few characters.
For example, if you specified a language other than English, type a few characters that are unique to that language. If these characters appear in this plain text field, you can use these characters in passwords safely.
Select Next.
On the User Authentication Method screen, select one of the authentication methods and select Next.
Typically, users accept the default (Local).
On the New Local User screen, create additional user accounts or select Next.
If you do not create additional users, select Yes on the Empty User Login warning pop-up window, and select Next.
On the Installation Completed screen, select Finish.
Type a tilde character (~) and then a period character (.) to exit from the IPMI tool.
Log into the SAC, open file /etc/hosts within a text editor, and verify that the SAC's fully qualified domain name (FQDN) and hostname are entered correctly.
For example, the following /etc/hosts file entry contains the correct data in the three required fields and is correct for a SAC with an IP address of 100.100.100.100, an FQDN of mysystem-admin.mydomain.com, and a hostname of mysystem-admin:
100.100.100.100 mysystem-admin.mydomain.com mysystem-admin |
Make sure that the /etc/hosts file on the SAC contains the required information. If it does not, edit the /etc/hosts file to contain the three required fields as the preceding example shows.
Confirm that the system is working as expected.
If necessary, restart YaST2 to correct settings.
(Optional) Configure the system so that you can perform the installation from a VGA screen and can perform later operations from a serial console.
If you want to enable this capability, perform the following steps:
Use a text editor to open file /boot/grub/menu.lst .
Search the file for the word kernel at the beginning of a line.
Add the following to the kernel line: console=type.
For example:
kernel /boot/vmlinuz-2.6.16.56-0.12-smp root=/dev/disk/by-label/sgiroot console=ttyS1,38400n8 splash=silent showopts |
Add the console=type parameter to the end of every kernel line. By default, this is set to ttyS1,38400n8. You might have ttys2, for example.
Later, if you want to access the SAC from only a VGA, you can remove the console= parameters.
Proceed to the following:
The cluster configuration tool enables you to configure, or reconfigure, your SGI ICE X system. The procedure in this topic explains the general, required configuration steps for SGI ICE X systems. If your SGI ICE X system includes optional components, or if your site has specific requirements, later procedures explain how to use the cluster configuration tool to create a more customized environment.
The following procedure explains how to complete the following required cluster configuration steps:
Create repositories for software installation files and updates.
Install the system admin node (SAC) cluster software.
Configure the cluster subdomain and examine other network settings. The cluster subdomain is likely to be different from the eth0 domain on the SAC itself.
Configure the NTP server.
Install the cluster's software infrastructure. This step can take 30 minutes.
Configure the house network's DNS resolvers.
Procedure 3-8. To run the cluster configuration tool
Locate your site's SGI software distribution DVDs or verify the path to your site's online software repository.
You can install the software from either physical media or from an ISO on your network.
From the VGA screen, or through an ssh connection, log into the system admin controller (SAC) as the root user.
SGI recommends that you run the cluster configuration tool either from the VGA screen or from an ssh session to the system admin controller (SAC). Avoid running the configure-cluster command from a serial console.
Type the following command to start the cluster configuration tool:
# /opt/sgi/sbin/configure-cluster |
On the cluster configuration tool's Initial Configuration Check screen, select OK on the initial window.
Figure 3-4 shows the initial window.
The cluster configuration tool recognizes a configured cluster. If you start the tool on a configured SGI ICE X system, it opens into the Main Menu.
On the Initial Cluster Setup screen, select OK on the screen.
Figure 3-5 shows the window.
On the Initial Cluster Setup screen, select R Repo Manager: Set Up Software Repos, and click OK.
Figure 3-6 shows the Initial Cluster Setup screen with the task menu. This procedure guides you through the tasks you need to perform for each of the menu selections on the Initial Cluster Setup screen.
The next few steps create software repositories for the initial installation packages and for updates. You need to create repositories for the following software:
The operating system software, either RHEL or SLES
SGI Foundation Suite
SGI Tempo
Additional software packages for which you hold licenses, such as the Message Passing Toolkit, the SGI Performance Suite, and any others
On the One or more ISOs were embedded on the ... screen, select Yes.
On the Repositories are created ... screen, press Enter.
On the You will now be prompted to add additional media ... screen, select OK.
On the Would you like to register media with Tempo? ... screen, select Yes.
On the Please either insert the media in your DVD drive ... screen, select either Insert DVD or Use Custom path/url.
Proceed as follows:
To install the software from DVDs, perform the following steps:
Insert a DVD.
Select Mount inserted DVD.
On the Media registered successfully with crepo ... screen, select OK, and eject the DVD.
On the Would you like to register media with Tempo? ... screen, select Yes if you have more software that you need to register.
If you select Yes, repeat the preceding tasks in this sequence for the next DVD.
If you select No, proceed to the next step.
To install the software from a network location, perform the following steps:
Select Use custom path/URL.
On the Please enter the full path to the mount point or the ISO file ... screen, type the full path in server_name: path_name/iso_file format. This field also accepts a URL or an NFS path. Select OK after typing the path.
On the Media registered successfully with crepo ... screen, select OK.
On the Would you like to register media with Tempo? ... screen, select Yes if you have more software that you need to register.
If you select Yes, repeat the preceding tasks in this sequence for the next DVD.
If you select No, proceed to the next step.
On the Initial Cluster Setup Tasks screen, select I Install Admin Cluster Software, and select OK.
This step installs the cluster software that you wrote to the repositories.
On the Initial Cluster Setup Tasks screen, select N Network Settings, and select OK.
On the Cluster Network Settings screen, select S Configure Subnet Addresses, and select OK.
On the Warning: Changing the subnet IP addresses ... screen, click OK.
Review the settings on the Subnet Network Addresses screen, and modify these settings only if absolutely necessary.
Figure 3-7 shows the Subnet Network Addresses screen. This screen displays the default networks and netmasks that reside within the SGI ICE X cluster.
If you accept the defaults, select OK.
If you do not accept the defaults, you can change the network settings. For example, it is possible that your site has existing networks or conflicting network requirements. For additional information about the IP address ranges, see Appendix B, “Subnetwork Information”. Complete the following steps if you need to change the network settings:
Highlight the setting you want to change, and select OK.
Type in a new IP address, and select OK.
Press Enter.
On the Update Subnet Addresses screen, the Head Network field shows the SAC's IP address. SGI recommends that you do not change the IP address of the SAC or rack leader controllers (RLCs) if at all possible. You can change the IP addresses of the InfiniBand network (IB0 and IB1) to match the IP requirements of the house network, and then select OK.
On the Cluster Network Settings screen, select D Configure Cluster Domain Name, and select OK.
On the Please enter the domain name for this cluster pop-up window, type the domain name, and select OK.
The domain you type becomes a subdomain to your house network.
For example, type ice.americas.sgi.com.
On the Cluster Network Settings screen, select Back.
On the Initial Cluster Setup screen, select T Configure Time Client/Server (NTP), and select OK.
Configure your NTP server.
On the subsequent screens, you set the SAC as the time server to the SGI ICE X system. For this step, the installer screens differ on RHEL platforms and SLES platforms.
On RHEL platforms, complete the following step:
On the A new ntp.conf has been put in to position ... screen, select OK.
On SLES platforms, complete the following steps:
On the A new ntp.conf has been put in to position ... screen, select OK.
Use the YaST interface and the SLES documentation to guide you through the NTP configuration.
On the This procedure will replace your ntp configuration file ... screen, select Yes.
On the Initial Cluster Setup Tasks menu, select S Perform Initial Admin Node Infrastructure Setup , and select OK.
On the A script will now perform the initial cluster ... screen, select OK.
This step runs a series of scripts that configure the SAC on the SGI ICE X system. The scripts also create the root images for the RLCs, service nodes, and compute nodes. The scripts run for approximately 30 minutes. At the end, the script issues a line that includes install-cluster completed in its output.
The final output of the script is as follows:
/opt/sgi/sbin/create-default-sgi-images Done! |
The output of the mksiimage commands are stored in a log file at the following location:
/var/log/cinstallman |
On the Initial Cluster Setup Complete window, select OK.
On the One or more ISOs were embedded on the admin install DVD and copied to ..., screen, select OK.
On the Initial Cluster Setup menu, select D Configure House DNS Resolvers, and select OK.
Figure 3-8 shows the Configure House DNS Resolvers screen.
The system autopopulates the values on the Configure House DNS Resolvers screen to match the DNS specifications on the SAC. The DNS resolvers you specify here enable the service nodes to resolve host names on your network. You can set the DNS resolvers to the same name servers used on the SAC itself.
Perform one of the following actions:
To accept these settings, select OK, and then select Yes.
To change the settings, type in different IP addresses, select OK, and then select Yes.
To disable house network resolvers, select Disable House DNS.
On the Setting DNS Forwarders to ... screen, select Yes.
On the Initial Cluster Setup screen, select Back.
This action returns you to the cluster configuration tool main menu.
On the Main Menu, select S Configure Switch Management Network (optional), and select OK.
On an SGI ICE X system, the switch management network enables the Ethernet switch to control all VLANs and trunking.
On the pop-up window that appears, select Y yes , and select OK.
Figure 3-9 shows the selection pop-up window:
(Conditional) On the Main Menu, select N Configure MCell Network (optional), and select OK.
Perform this step if your SGI ICE X system contains MCells.
(Conditional) On the screen that appears, select Y yes, and select OK.
Perform this step if your SGI ICE X system contains MCells.
Select Quit.
Type the cattr list -g command to verify the features you configured with the cluster configuration tool.
The following example output is appropriate for an SGI ICE X system with MCells. If your system does not include MCells, the mcell_network value should display no. The output is as follows:
# cattr list -g global redundant_mgmt_network : yes switch_mgmt_network : yes mcell_network : yes discover_skip_switchconfig : no max_rack_irus : 8 blademond_scan_interval : 120 my_sql_replication : no rack_vlan_start : 101 rack_vlan_end : 1100 head_vlan : 1 mcell_vlan : 3 replication_file : mysql-bin.000132 replication_position : 6580967 |
If you need to respecify any global values, start the cluster configuration tool again, and correct your specifications. To start the cluster configuration tool, type the following command:
# /opt/sgi/sbin/configure-cluster |
Proceed to one of the following:
To configure one or more external Domain Name Service (DNS) servers, proceed to “(Conditional) Configuring External Domain Name Service (DNS) Servers ”.
To enable the Intel Data Center Manager (DCM) Software Development Kit (SDK), proceed to “(Optional) Configuring the Intel Data Center Manager (DCM) Software Development Kit (SDK)”.
Perform the procedure in this section if you want to enable network address translation (NAT) gateways for the SGI ICE X system. A later procedure explains how to configure NAT on one of your service nodes. If you want to enable NAT, perform the procedure in this topic at this time.
When external DNS and NAT are enabled, the host names for the compute nodes (blades) in the cluster resolve through external DNS servers. The compute nodes need to be able to reach your house network.
| Note: You cannot configure this feature after you run the discover command. If you configure this feature after you run the discover command, the IP addresses assigned previously on the configured nodes remain. |
The following procedure explains how to configure external DNS servers.
Procedure 3-9. To configure external DNS servers
Obtain a large block of IP addresses from your network administrator.
This feature requires you to reserve a block of IP addresses on your house network. If you want to use external DNS servers, all nodes on the InfiniBand networks, both the ib0 and ib1 networks are included. The external DNS is enabled to provide addresses for all rack leader controllers (RLCs), all service nodes, and all compute nodes.
Through an ssh connection, log into the system admin controller (SAC) as the root user.
Type the following command to start the cluster configuration tool:
# /opt/sgi/sbin/configure-cluster |
Select E Configure External DNS Masters (optional) , and select OK.
On the This option configures SGI Tempo to look up the IP addresses for the InfiniBand networks from external DNS servers ... screen, select Yes.
On the Enter up to five external DNS master IPs screen, type the IP addresses of up to five external DNS servers on your house network, and select OK.
On the Setting external DNS masters to ip_addr, select Yes.
Proceed to one of the following:
To enable the Intel Data Center Manager (DCM) Software Development Kit (SDK), proceed to the following:
“(Optional) Configuring the Intel Data Center Manager (DCM) Software Development Kit (SDK)”.
To synchronize the software repository, proceed to the following:
“Synchronizing the Software Repository, Installing Software Updates, and Cloning the Images”.
The Intel DCM SDK enables you to monitor and manage energy consumption in your data center. This feature is easier to enable if you have not yet installed the SGI Management Center (SMC) license key because in this situation, the system creates DCM entries in the /opt/sgi/sgimc/etc/site-site.profile for you. If you want to enable DCM on a production system, however, you can use this procedure because it includes steps that explain how to enable DCM on a deployed system with installed license keys.
The following procedures explain how to configure the DCM SDK:
The following procedure explains how to verify your SGI ICE X environment and determine if you can enable the DCM SDK.
Procedure 3-10. To verify your environment
Verify that your SGI ICE X system supports the following:
The Intel Intelligent Power Node Manager (IPNM)-capable compute nodes. Consult your SGI technical support representative if you are unsure.
The SMC DCM plugin
If you are unsure about the preceding requirements, contact your SGI representative or review your hardware specifications. You can configure DCM at any time, but it is easier to configure at this point in the installation process because you have not yet installed the SGI Management Center (SMC) license key.
Verify that your SGI Management Center (SMC) license includes both of the following keys:
A premium license
A power opt license
If you are installing a completely new SGI ICE X system, there is no SMC license installed on the system at this time, and you need information about what your license includes, contact your SGI representative to verify your licenced features.
If you are enabling the DCM feature on a production SGI ICE X system that already has an SMC license installed upon it, use the lk_verify command to verify your system's license key. The following example shows these keys:
# lk_verify
1 /etc/lk/keys.dat:007 product=SGIMC, version=1.000, count=0, begDate=1374482126, \
expDate=1375765199, licenseID=0, key=Iym9+MynzDO2R0ABCcqJjgrXYqgu6rwp, \
info='SMC 1.7 PREM SVR/CL',attr='ST=CL NC=0 LC=24FF', \
vendor='Silicon Graphics International',ref_id='279071'
Verdict: SUCCESS.
2 /etc/lk/keys.dat:018 product=SGIMC, version=1.000, count=0, begDate=1373396727, \
expDate=1381208399, licenseID=b9e99f78, key=xybn6FAs4rx2UoC7WzclQElAQ7DVbc+6, \
info='SGIMC POWER OPT',attr='LC=0200', vendor='Silicon Graphics International', \
ref_id='278444'
Verdict: SUCCESS. |
In the preceding lk_verify output, license 1 is the premium license, and license 2 shows the power option in the info and attr fields.
(Conditional) Back up the current DCM configuration files on the system admin controller (SAC) and on all rack leader controller (RLC) nodes.
Perform this step if you already have the DCM SDK installed on this SGI ICE X system and you are reinstalling the DCM SDK. This can be the case if you are troubleshooting an incomplete installation.
The steps are as follows:
Log into the SAC or into one of the RLCs as the root user.
Type the following command to stop the DCM daemon:
# service mcdcm stop |
Type the following command to create a scratch directory and write the existing DCM configuration files to that directory:
# mkdir scratch; cp -a /opt/sgi/mcdcm/conf /scratch/mcdcm-conf |
Type the following command to remove the DCM package:
# rpm -e mcdcm |
Type the following command to remove any existing initialization scripts:
# rm -rf /etc/init.d/dcm* /etc/init.d/mcdcm* |
Type the following command to remove the mcdcm daemon:
# rm -rf /opt/sgi/mcdcm |
Repeat the preceding steps on the other computers in the SGI ICE X cluster.
Type the following commands to install the SMC DCM package:
# cinstallman --yum-node --node admin install mcdcm # cinstallman --yum-node --node r1lead install mcdcm |
If you have multiple RLCs, repeat the preceding command on other RLCs as needed.
On the SAC, type the following command to verify the installation:
# cat /opt/sgi/mcdcm/logs/dcmcmdstatus.log |
The command returns information such as the following if the installation was successful:
Success. You can now start the database server using: [...] waiting for server to start.... done server started |
On the SAC, type the following commands to change localhost to admin in the system files:
# sed -i 's/localhost/admin/g' \ /opt/sgi/mcdcm/conf/web.config.xml # sed -i 's/localhost/admin/g' \ /opt/sgi/mcdcm/external/apache-tomcat/webapps/DCMWsdl/internal.wsdl # sed -i 's/localhost/admin/g' \ /opt/sgi/mcdcm/external/apache-tomcat/webapps/DCMWsdl/dcm.wsdl |
Note that the preceding commands use the backslash character ( \) because the command continues to a second line.
On each RLC, type the following commands to change localhost to admin in the system files:
# pdsh -g leader 'sed -i "s/localhost/${HOSTNAME}/g" \
/opt/sgi/mcdcm/conf/web.config.xml'
# pdsh -g leader 'sed -i "s/localhost/${HOSTNAME}/g" \
/opt/sgi/mcdcm/external/apache-tomcat/webapps/DCMWsdl/internal.wsdl'
# pdsh -g leader 'sed -i "s/localhost/${HOSTNAME}/g" \
/opt/sgi/mcdcm/external/apache-tomcat/webapps/DCMWsdl/dcm.wsdl' |
Note that the preceding commands use the backslash character ( \) because the command continues to a second line.
Type the following commands to start the SMC DCM daemons on the SAC and on the RLCs:
# service mcdcm start # pdsh -g leader service mcdcm start |
Type the following commands on the SAC and on the RLCs, and observe the output:
# ps -ef | grep jsvc | grep -i "startup.Bootstrap" # ps -ef | grep jsvc | grep -i "DaemonImpl" |
In the output, make sure that the com.intel.dcm.daemon.DaemonImpl and org.apache.catalina.startup.Bootstrap daemons started.
On the SAC, and on each RLC, type the following command to make sure that the DCM daemon is started:
# grep "process started" /opt/sgi/mcdcm/logs/DatacenterManager.log Intel(R) Data Center Manager 2.x.xxxxx process started! |
On the SAC, type the following command to install the sgimc-dcm daemon:
# cinstallman --yum-node --node admin install sgimc-dcm |
On the SAC, use a text editor to open file /opt/sgi/sgimc/etc/dcm.properties , search for the string dcm.bmc, and modify the existing lines to be as follows:
dcm.bmc.user = ADMIN dcm.bmc.password = ADMIN |
The preceding lines are case sensitive.
On the SAC, use a text editor to open file /opt/sgi/sgimc/etc/tempo.properties , search for the string dcm.enabled, and modify the existing line (or add it) to be as follows:
dcm.enabled = true |
Proceed to one of the following:
If you need to install SMC license keys, for example on a new system, proceed to the following:
“(Conditional) Installing the SGI Management Center License Key ”
If the correct SMC license keys are already installed on your system, proceed to the following:
“Completing the Intel Data Center Power Manager (DCM) Software Development Kit (SDK) Configuration”
Perform the procedure in this topic if you need to install an SMC license key on your SGI ICE X system.
The SGI Management Center (SMC) software runs on the system admin controller (SAC). SMC provides a graphical user interface for system configuration, operation, and monitoring. For more information about using SMC, see SGI Management Center (SMC) System Administrator Guide. For more information about licensing, see the licensing FAQ on the following website:
http://www.sgi.com/support/licensing/faq.html
The following procedure explains how to obtain and install the license key for SMC.
Procedure 3-11. To license the SMC software
Use a text editor to open file /etc/lk/keys.dat.
Copy and paste the license key exactly as it was given to you.
There can be line breaks in the license key.
Save the file.
Type the following command to restart the SMC daemon:
# service mgr restart |
Type the following command to start SMC:
# mgrclient |
Proceed to “Completing the Intel Data Center Power Manager (DCM) Software Development Kit (SDK) Configuration”.
The following procedure explains how to complete the DCM enablement procedure.
| Note: Some output lines in this procedure were wrapped for inclusion in this documentation. |
Procedure 3-12. To complete the Intel DCM enablement procedure
Log into the system admin controller (SAC) and type the following command to restart the mgr daemon:
# service mgr restart |
On the SAC, use a text editor to open file /opt/sgi/sgimc/etc/perfhost_options.profile , and search for tempo.dcm.
Verify that lines similar to the following appear in the perfhost_options.profile file:
tempo.dcmBmc.r1i0n1=192.168.160.101 tempo.dcmBmc.r1i0n0=192.168.160.100 |
On the SAC, verify the contents of file /opt/sgi/sgimc/etc/site-site.profile .
At this time, the site-site.profile file should contain a line for each node that declares the rack leader controller (RLC) for each node. These lines are similar to the following:
PlatformManagementService.hostsettings.r1i0n0:BEGIN1,CPOWER,false,localhost,4,END1,BEGIN2,CONSERVER,false, localhost,admin,END2,BEGIN3,DCM,false,r1lead,400,0,,,4,IPNM2,END3,1,1,0,2,0 PlatformManagementService.hostsettings.r1i0n1:BEGIN1,CPOWER,false,localhost,4,END1,BEGIN2,CONSERVER,false, localhost,admin,END2,BEGIN3,DCM,false,r1lead,400,0,,,4,IPNM2,END3,1,1,0,2,0 |
If the site-site.profile file does not contain a line for each node, you need to add these lines manually. This can be the case if you installed the SMC license and are now using this procedure to enable DCM. You can use a loop, such as the following, to add these lines:
for node in `cnodes --compute --rack 1`; do echo PlatformManagementService.hostsettings.$node:BEGIN1,CPOWER,false,localhost,4,END1,BEGIN2,CONSERVER,false, localhost,admin,END2,BEGIN3,DCM,false,r1lead,400,0,,,4,IPNM2,END3,1,1,0,2,0 >> /opt/sgi/sgimc/etc/site-site.profile done |
(Optional) Add entries to the /opt/sgi/sgimc/etc/site-site.profile file for the RLCs and service nodes.
Perform this step if you want to enable DCM on nondefault nodes such as the RLCs and service nodes.
Edit the following configuration files to define DCM for the RLCs and service nodes:
Open file site-site.profile and add IP address information, such as the following examples show, to the file:
tempo.dcmBmc.r1lead=172.24.0.2 tempo.dcmBmc.service0=192.168.160.109 |
Open file /opt/sgi/sgimc/etc/site-site.profile and add information such as the following:
PlatformManagementService.hostsettings.r1lead:BEGIN1,CPOWER,false,localhost,4,END1,BEGIN2,CONSERVER,false, localhost,admin,END2,BEGIN3,DCM,false,admin,400,0,,,4,IPNM2,END3,1,1,0,2,0 PlatformManagementService.hostsettings.service0:BEGIN1,CPOWER,false,localhost,4,END1,BEGIN2,CONSERVER,false, localhost,admin,END2,BEGIN3,DCM,false,admin,400,0,,,4,IPNM2,END3,1,1,0,2,0 |
Type the mgrclient command to start the SGI Management Center GUI.
Click Edit > Preferences > Platform Management .
In the window that appears, select or set the following:
To the right of Management Device 3:, click Configure.
Select DCM as the third platform device.
Verify the defaults and change defaults if needed. The defaults are as follows:
DCM Primary Host Name: localhost Derated Power: 400 Nameplate Power 450 Unmanaged Power 100 Cooling Power Multipler: 0.5 Power Cost Per KWh: 1 Device Connector Type: IPNM2 Platform Management User: ADMIN |
For information about the defaults, see the SGI Management Center System Administrator's Guide.
Select the policy button.
Click OK.
Click OK.
On the SAC, type the following command to restart the mgr daemon:
# service mgr restart |
Wait for the changes to be applied to your SGI ICE X system.
On a large cluster, these changes can take 30 minutes or more to complete.
Type the mgrclient command to restart the SGI Management Center GUI.
In the SGI Management Center GUI, click Admin > Instrumentation > Power.
Visually inspect the information on the Power tab to verify that it presents data for the cluster and for the individual nodes.
Proceed to the following:
“Synchronizing the Software Repository, Installing Software Updates, and Cloning the Images”
For DCM enablment troubleshooting information, see “Repairing a Broken DCM Enablement” in Chapter 5.
The following procedure explains how to update the software in the repositories that you created with the cluster configuration tool. The following procedure assumes that the SGI ICE X system has a connection to the internet. If you need to perform this procedure on a secure SGI ICE X system, you need to modify this procedure. For a secure system, obtain the software updates from SGI SupportFolio manually and use the crepo command to install the software manually.
Procedure 3-13. To update the software
Through an ssh connection, log into the system admin controller (SAC) as the root user.
Type the following command to retrieve information about the network interface card (NIC) bonding method on the SAC:
# cadmin --show-mgmt-bonding --node admin |
If bonding has been set appropriately, the command returns 802.3ad.
If the command does not return 802.3ad, type the following commands to set the bonding appropriately and reboot the system:
# cadmin -set-mgmt-bonding -node admin 802.3ad # reboot |
Type the following command to retrieve the new images from SGI SupportFolio and the operating system vendor:
# sync-repo-updates |
For RHEL-based systems, make sure the system is subscribed as rhel-x86_64-server-6.
This step requires that the system be connected to the internet. Contact your SGI representative if this update method is not acceptable for your site.
Type the cinstallman --show-images command to retrieve the image names.
For example:
# cinstallman --show-images Image Name BT Path compute-rhel6.5 1 /var/lib/systemimager/images/compute-rhel6.5 service-rhel6.5 0 /var/lib/systemimager/images/service-rhel6.5 lead-rhel6.5 0 /var/lib/systemimager/images/lead-rhel6.5 |
The preceding output includes a line for the MPSS images if you have downloaded MPSS software for Intel Many Integrated Core Architecture (Intel MIC Architecture) based products.
(Optional) Clone the images.
Perform this step if you want to back up the current images before they are installed.
Type the following command:
cinstallman --create-image --clone --source src_image_name --image image |
For src_image_name, specify the name of the source image. For example: lead-rhel6.5.
For image, specify a file name for the copied file (the clone). For example: lead-rhel6.5.backup
For example:
# cinstallman --create-image --clone --source compute-rhel6.5 --image compute-rhel6.5.backup # cinstallman --create-image --clone --source service-rhel6.5 --image service-rhel6.5.backup # cinstallman --create-image --clone --source lead-rhel6.5 --image lead-rhel6.5.backup |
Type a series of cinstallman --update-image commands to update the software images.
For image, specify the software packages displayed in the previous step.
For example, to install the packages shows in Step 4, type the following commands:
# cinstallman --update-image --image compute-rhel6.5 # cinstallman --update-image --image service-rhel6.5 # cinstallman --update-image --image lead-rhel6.5 |
Proceed to one of the following:
If your SGI ICE X system contains Intel® Many Integrated Core Architecture (Intel MIC Architecture) devices on the compute nodes and/or the service nodes, proceed to “(Conditional) Downloading the Intel Manycore Platform Software Stack (MPSS) Software and Creating Images”.
If your SGI ICE X system does not contain MIC devices on the compute nodes and/or the service nodes, proceed to “Configuring the Switches”.
Perform the procedures in this topic if your SGI ICE X system is equipped with Intel Many Integrated Core Architecture (Intel MIC Architecture) based products. The Intel Many Integrated Core (MIC) devices are part of the Intel Manycore Platform Software Stack that runs on the Intel Xeon Phi Coprocessors found on SGI ICE X compute nodes and service nodes.
Intel Corporation provides software for its Intel MIC architecture products, and you need to download this software for use on your SGI ICE X system. The MPSS package that you download contains the software packages for the MIC devices on the compute blades and service nodes. The procedures in this topic explain how to download the RPMs from the Intel Corporation website and how to create images for compute nodes and service nodes that are equipped with MIC devices. Your system might have MIC devices on the compute nodes, on the service nodes, or both. Complete the procedures that are appropriate for your hardware configuration.
The following procedures explain how to obtain and deploy the MPSS software from Intel Corporation:
Complete the procedure in this topic if you have any MIC devices on your SGI ICE X system.
The following procedure explains how to download the MPSS software from Intel Corporation.
Procedure 3-14. To download the MPSS package
Open a browser, and navigate to the following website:
Click the Tools & Downloads tab.
Click the Software Drivers: Intel Manycore Platform Software Stack (Intel MPSS) link.
Follow the instructions on the website to download the Linux software version for your operating system platform. The download comes in the form of a tar(1) file.
Use the instructions from Intel to build the RPM files that you need.
A later procedure explains how to transfer these files to the SGI ICE X system and build new images.
Proceed to one of the following topics:
Perform both of the preceding procedures if you have MIC devices on both compute nodes and service nodes.
The following procedure explains how to create compute node images that include MIC device software.
Procedure 3-15. To create compute node images for compute nodes with MIC devices
On the SAC, use the mkdir(1) command, in the following format, to create a directory for the RPM repository:
mkdir -p /tftpboot/intel/mpss_repository_directory |
For mpss_repository_directory, type a name for the directory that is to contain the MPSS repository. For convenience, make sure to include an identifier for the MPSS release level you downloaded.
For example:
# mkdir -p /tftpboot/intel/mpss_u3-2.1.6720-19 |
Use operating system commands to copy the RPM files you downloaded to the /tftpboot/intel/mpss_repository_directory directory on the SAC node.
For example, use cp(1), ftp(1), rsync(1), scp(1), or another method.
Use the crepo command, in the following format, to specify a custom repository for the MPSS RPMs:
crepo --add rpm_repo_directory -custom rpm_repo_name |
The variables in this command are as follows:
For rpm_repo_directory, specify the full path to the directory that contains the RPM files.
For rpm_repo_name, create a name for the image. You can specify the same name for both rpm_repo_directory and rpm_repo_name. After the image is built, the cinstallman --show-images command returns this label in the Image Name column of its output.
For example:
# crepo --add /tftpboot/intel/mpss_u3-2.1.6720-19 --custom mpss_u3-2.1.6720-19 |
Type the following command to confirm that the MPSS image is in the correct repository:
# crepo --show |
Use the crepo command, in the following format, to add the custom repository to the generated RPM list in /etc/opt/sgi/rpmlists:
crepo --select rpm_repo_name |
For rpm_repo_name, create a name for the image. You can specify the same name for both rpm_repo_directory and rpm_repo_name. After the image is built, the cinstallman --show-images command returns this label in the Image Name column of its output. This is the same rpm_repo_name that you specified in the following step:
For example, the following command adds the custom repository and displays the content of the respository:
# crepo --select mpss_u3-2.1.6720-19 Selecting: mpss_u3-2.1.6720-19 Updating: /etc/opt/sgi/rpmlists/generated-compute-rhel6.5.rpmlist Updating: /etc/opt/sgi/rpmlists/generated-lead-rhel6.5.rpmlist Updating: /etc/opt/sgi/rpmlists/generated-service-rhel6.5.rpmlist |
Type the following command to confirm that you selected the new repository that contains the MPSS RPMs:
# crepo --show * mpss_u3-2.1.6720-19 : /tftpboot/intel/mpss_u3-2.1.6720-19 * Red-Hat-Enterprise-Linux-6.5 : /tftpboot/distro/rhel6.5 * SGI-MPI-1.7-rhel6 : /tftpboot/sgi/SGI-MPI-1.7-rhel6 * SGI-Management-Center-1.9-rhel6 : /tftpboot/sgi/SGI-Management-Center-1.9-rhel6 * SGI-Foundation-Software-2.9-rhel6 : /tftpboot/sgi/SGI-Foundation-Software-2.9-rhel6 * SGI-Tempo-2.9-rhel6 : /tftpboot/sgi/SGI-Tempo-2.9-rhel6 |
The asterisk character (*) in column 1 indicates an image that is selected.
Type the following command to display the images that are available for installation on the compute nodes:
# cinstallman --show-images Image Name BT Path lead-rhel6.5 0 /var/lib/systemimager/images/lead-rhel6.5 service-rhel6.5 0 /var/lib/systemimager/images/service-rhel6.5 compute-rhel6.5 1 /var/lib/systemimager/images/compute-rhel6.5 |
Use the cinstallman command, in the following format, to clone the current operating system image:
cinstallman --create-image --clone --source current_image --image new_image |
The variables in this command are as follows:
For current_image, type the name of the operating system image you want to use that is on the system right now. Choose one that appears in the output from the from the cinstallman --show-images command in the Image Name list. For example, compute-rhel6.5.
For new_image, type a new name for the operating system image that is to include the MPSS file RPMs. SGI recommends that you include information in the new name that can enable you to identify this image as the operating system image that includes MPSS software. For example, compute-rhel6.5-mic-6720-19 identifies the new image as a RHEL image that contains a revision of the MPSS MIC software.
For example:
# cinstallman --create-image --clone --source compute-rhel6.5 --image compute-rhel6.5-mic-6720-19 |
Type the following command to display the images and confirm that the new image appears in the list:
# cinstallman --show-images Image Name BT Path lead-rhel6.5 0 /var/lib/systemimager/images/lead-rhel6.5 service-rhel6.5 0 /var/lib/systemimager/images/service-rhel6.5 compute-rhel6.5 1 /var/lib/systemimager/images/compute-rhel6.5 compute-rhel6.5-mic-6720-19 1 /var/lib/systemimager/images/compute-rhel6.5-mic-6720-19 |
Use the cinstallman command, in the following format, to install the sgi-mic-compute package and the MPSS RPMs:
cinstallman --yum-image --image image_name install sgi-mic-compute |
For image_name, specify the new_image that you created in the following step:
For example:
# cinstallman --yum-image --image compute-rhel6.5-mic-6720-19 install sgi-mic-compute |
(Conditional) Enable the SLES operating system to load the MPSS package.
Perform this step if you are installing the MPSS packages on a SLES platform.
Complete the following steps:
Open file /etc/modprobe.d/unsupported-modules with a text editor.
Add the following line at the end of the file:
allow_unsupported_modules 1 |
Save and close the file.
Type the following command to display all the images:
# cimage --list-images
image: compute-rhel6.5
kernel: 2.6.32-358.el6.x86_64
image: compute-rhel6.5-mic-6720-19
kernel: 2.6.32-358.el6.x86_64 |
The preceding output shows the newly installed image, compute-rhel6.5-mic-6720-19.
Use the cimage command, in the following format, to set the default image for the compute nodes:
cimage --set-default --file_system compute-rhel6.5-mic-6720-19 kernel |
The variables in this command are as follows:
For file_system, type either nfs or tmpfs, according to your site practice.
For kernel, type the kernel associated with the new image, as shown in the output from the cimage --list-images command in the following step:
For example:
# cimage --set-default --compute-rhel6.5-mic-6720-19 2.6.32-358.el6.x86_64 |
Type the following command to confirm that the new image is the default image:
# cimage --show-default compute-rhel6.5-stout7-mic-6720-15 2.6.32-358.el6.x86_64 tmpfs |
Proceed to one of the following:
If your SGI ICE X system contains MIC devices on the service nodes, proceed to “Creating Images for the Service Nodes That Include MIC Devices ”.
If your SGI ICE X system does not contain MIC devices on the service nodes, proceed to “Configuring the Switches”.
The following procedure explains how to create service node images that include MIC device software.
Procedure 3-16. To create service node images for service nodes with MIC devices
On the SAC, use the mkdir(1) command, in the following format, to create a directory for the RPM repository:
mkdir -p /tftpboot/intel/mpss_repository_directory |
For mpss_repository_directory, type a name for the directory that is to contain the MPSS repository. For convenience, make sure to include an identifier for the MPSS release level you downloaded.
For example:
# mkdir -p /tftpboot/intel/mpss_u3-2.1.6720-19 |
Use operating system commands to copy the RPM files you downloaded to the /tftpboot/intel/mpss_repository_directory directory on the SAC node.
For example, use cp(1), ftp(1), rsync(1), scp(1), or another method.
Use the crepo command, in the following format, to specify a custom repository for the MPSS RPMs:
crepo --add rpm_repo_directory -custom rpm_repo_name |
The variables in this command are as follows:
For rpm_repo_directory, specify the full path to the directory that contains the RPM files.
For rpm_repo_name, create a name for the image. You can specify the same name for both rpm_repo_directory and rpm_repo_name. After the image is built, the cinstallman --show-images command returns this label in the Image Name column of its output.
For example:
# crepo --add /tftpboot/intel/mpss_u3-2.1.6720-19 --custom mpss_u3-2.1.6720-19 |
Type the following command to confirm that the MPSS image is in the correct repository:
# crepo --show |
Use the crepo command, in the following format, to add the custom repository to the generated RPM list in /etc/opt/sgi/rpmlists:
crepo --select rpm_repo_name |
For rpm_repo_name, create a name for the image. You can specify the same name for both rpm_repo_directory and rpm_repo_name. After the image is built, the cinstallman --show-images command returns this label in the Image Name column of its output. This is the same rpm_repo_name that you specified in the following step:
For example, the following command adds the custom repository and displays the content of the respository:
# crepo --select mpss_u3-2.1.6720-19 Selecting: mpss_u3-2.1.6720-19 Updating: /etc/opt/sgi/rpmlists/generated-compute-rhel6.5.rpmlist Updating: /etc/opt/sgi/rpmlists/generated-lead-rhel6.5.rpmlist Updating: /etc/opt/sgi/rpmlists/generated-service-rhel6.5.rpmlist |
Type the following command to confirm that you selected the new repository that contains the MPSS RPMs:
# crepo --show * mpss_u3-2.1.6720-19 : /tftpboot/intel/mpss_u3-2.1.6720-19 * Red-Hat-Enterprise-Linux-6.5 : /tftpboot/distro/rhel6.5 * SGI-MPI-1.7-rhel6 : /tftpboot/sgi/SGI-MPI-1.7-rhel6 * SGI-Management-Center-1.9-rhel6 : /tftpboot/sgi/SGI-Management-Center-1.9-rhel6 * SGI-Foundation-Software-2.9-rhel6 : /tftpboot/sgi/SGI-Foundation-Software-2.9-rhel6 * SGI-Tempo-2.9-rhel6 : /tftpboot/sgi/SGI-Tempo-2.9-rhel6 |
The asterisk character (*) in column 1 indicates an image that is selected.
Type the following command to display the images that are available for installation on the service nodes:
# cinstallman --show-images Image Name BT Path lead-rhel6.5 0 /var/lib/systemimager/images/lead-rhel6.5 service-rhel6.5 0 /var/lib/systemimager/images/service-rhel6.5 compute-rhel6.5 1 /var/lib/systemimager/images/compute-rhel6.5 |
Use the cinstallman command, in the following format, to clone the current operating system image:
cinstallman --create-image --clone --source current_image --image new_image |
The variables in this command are as follows:
For current_image, type the name of the operating system image you want to use. Choose one that appears in the output from the cinstallman --show-images command in the Image Name list. For example, service-rhel6.5.
For new_image, type a new name for the operating system image that is to include the MPSS file RPMs. SGI recommends that you include information in the new name that can enable you to identify this image as the operating system image that includes MPSS software. For example, service-rhel6.5-mic-6720-19 identifies the new image as a RHEL image that contains a revision of the MPSS MIC software.
For example:
# cinstallman --create-image --clone --source service-rhel6.5 --image service-rhel6.5-mic-6720-19 |
Type the following command to display the images and confirm that the new image appears in the list:
# cinstallman --show-images Image Name BT Path lead-rhel6.5 0 /var/lib/systemimager/images/lead-rhel6.5 service-rhel6.5 0 /var/lib/systemimager/images/service-rhel6.5 compute-rhel6.5 1 /var/lib/systemimager/images/compute-rhel6.5 service-rhel6.5-mic-6720-19 1 /var/lib/systemimager/images/service-rhel6.5-mic-6720-19 |
Use the cinstallman command, in the following format, to install the sgi-mic-service package and the MPSS RPMs:
cinstallman --yum-image --image image_name install sgi-mic-service |
For image_name, specify the new_image that you created in the following step:
For example:
# cinstallman --yum-image --image service-rhel6.5-mic-6720-19 install sgi-mic-service |
(Conditional) Enable the SLES operating system to load the MPSS package.
Perform this step if you are installing the MPSS packages on a SLES platform.
Complete the following steps:
Open file /etc/modprobe.d/unsupported-modules with a text editor.
Add the following line at the end of the file:
allow_unsupported_modules 1 |
Save and close the file.
Type the following command to display all the images:
# cimage --list-images
image: compute-rhel6.5
kernel: 2.6.32-358.el6.x86_64
image: compute-rhel6.5-mic-6720-19
kernel: 2.6.32-358.el6.x86_64
image: service-rhel6.5-mic-6720-19
kernel: 2.6.32-358.el6.x86_64 |
The preceding output shows the newly installed image, service-rhel6.5-mic-6720-19.
Use the cimage command, in the following format, to set the default image for the service nodes:
cimage --set-default --file_system service-rhel6.5-mic-6720-19 kernel |
The variables in this command are as follows:
For file_system, type either nfs or tmpfs, according to your site practice.
For kernel, type the kernel associated with the new image, as shown in the output from the cimage --list-images command in the following step:
For example:
# cimage --set-default --service-rhel6.5-mic-6720-19 2.6.32-358.el6.x86_64 |
Type the following command to confirm that the new image is the default image:
# cimage --show-default service-rhel6.5-stout7-mic-6720-15 2.6.32-358.el6.x86_64 tmpfs |
Proceed to the following:
The discover command initializes and configures the system components for the SGI ICE X system. The switch configuration procedures explain how to use the discover command to configure the SGI ICE system's management switches first. After you configure the management switches, if you have MCells, you configure the cooling equipment on the MCell network's switch ports.
The SGI ICE X configuration requires that the same IP address be assigned to the system's head gateway and to the first management switch, usually mgmtswitch0. The procedures in this topic assume that you want to use the default IP address, which is 172.23.0.254, for both components, but the procedures include example commands that show how to configure an alternate IP address.
Switch configuration can proceed much more quickly if you have a media access control (MAC) file. For new SGI ICE X systems, you can obtain a MAC file from your SGI representative.
The MAC file shows the MAC addresses of the components in your environment. Switch discovery and configuration can complete more quickly if you obtain this file. Without this file, you need to power cycle each switch manually.
The following is an example MAC file:
r1lead 00:30:48:9e:f2:59 00:30:48:F2:7E:A2 r2lead 00:25:90:01:4e:3c 00:25:90:01:6c:cc service0 00:25:90:00:3b:8f 00:25:90:01:6e:9e mgmtsw0 00:26:f3:c3:7a:40 00:26:f3:c3:7a:40 |
The content of a MAC file is as follows:
| Column | Content |
| 1 | The component's system ID. |
| 2 | For nodes, column 2 contains the MAC address of baseboard management controller (BMC) on the component. For switches, column 2 contains the MAC address of the first network interface card (NIC), eth0. Switches do not have a BMC. |
| 3 | The MAC address of the first network interface card (NIC), eth0. For switches, columns two and three are identical in the MAC file. |
The following list explains the procedures you need to follow to configure the switches:
| Procedure | Circumstance |
| “Verifying the Switch Cabling” | All switch configuration circumstances. Perform this procedure regardless of the presence of a MAC file or MCells. |
| “Configuring Management Switches With a MAC File” | If you have a MAC file. |
| “Configuring Management Switches Without a MAC File” | If you do not have a MAC file. |
| “(Conditional) Configuring the Cooling Racks and Cooling Distribution Units (CDUs) on the MCell Network's Switch Ports” | (Conditional) If you have MCells. This extra procedure configures the MCell switches separately from the rest of the ICE X switches. |
The following figures show example switch cabling. Depending on the switch configuration procedure you use, you might have to plug cables out from and into switch ports during the configuration process. Regardless of the procedure you use, the cables on your switches eventually become cabled as shown in the following figures.
Figure 3-10 shows a switch stack with two switches. In this switch stack, the two switches constitute the primary switch stack. One is the master switch and the other is the slave switch.
Figure 3-11 shows a switch stack with multiple switches. The first two switches constitute the primary switch stack, and the other switches constitute the secondary switch stack.
The following procedure explains how to inspect your switches and prepare for the configuration procedure.
Procedure 3-17. To verify your switches
Visually inspect your system.
Note the types of switches you have and their identifiers. Switches are configured in stacks of two switches per stack. In each stack, the top switch is typically the master switch, and the bottom switch is typically the slave switch. At a minimum, you have one primary switch stack. The system admin controller (SAC) connects to the master switch in the primary switch stack. You might have additional secondary switch stacks.
Within each stack, each switch is labeled MSW xx. In the primary switch stack, the master switch is labeled MSW0A, and the slave switch is labeled MSW0B. In the first secondary switch stack, the master switch is labeled MSW1A, and the slave switch is labeled MSW1B. The A and B on the switch labels identify the master switch and slave switch in the stack. The switchconfig set command operates on a switch stack (not just one switch), so you need to note only the characters on the label that precede the A and B when you provide information to the switchconfig set command. Use the following table to determine the value you need to use for switch on the switchconfig commands:
| switch | Label | |
| mgmtsw0 | MSW0A or MSW0B | |
| mgmtsw1 | MSW1A or MSW1B | |
| mgmtsw2 | MSW2A or MSW2B | |
| mgmtsw3 | MSW3A or MSW3B | |
| mgmtsw4 | MSW4A or MSW4B | |
| mgmtsw5 | MSW5A or MSW5B | |
| mgmtsw6 | MSW6A or MSW6B | |
| mgmtsw7 | MSW7A or MSW7B | |
| mgmtsw8 | MSW8A or MSW8B | |
| mgmtsw9 | MSW9A or MSW9B |
Make sure that only the SAC is plugged in and that all rack circuit breakers are powered off.
If you have a MAC file, connect all other nodes and switches to a power source, but do not power them on. That is, make sure that all chassis management controllers (CMCs), all rack leader controllers (RLCs), all service nodes, all switches, and so on, are not powered on.
If you do not have a MAC file, unplug all other nodes and switches. The SAC should be the only component that is plugged in.
(Conditional) Remove the cascade cables that connect the slaves switches together.
Perform this step only if you have two or more switch stacks.
Locate the cascading cables that connect the switch stacks to each other, and unplug the cascading cable end from the lower switch in the neighboring switch stack. On the front of the switch stack, unplug the cascading cables that plug into the ports on the slave switches. When you unplug one end of each cable, you prevent a switching loop.
| Note: Do not unplug the stacking cables in the rear of the switch stack. The installation procedure instructs you to plug or unplug only the cascading cables in the front of the switch stack. |
For example, if you have one secondary switch stack, locate the cascading cable that runs from the slave switch on the primary switch to the slave switch on the secondary switch. Unplug that cable from the port on the slave switch.
Proceed as follows:
If you have a MAC file, proceed to the following:
If you do not have a MAC file, proceed to the following:
The following procedure explains how to configure your switches when you have each switch's MAC information in a MAC file.
Procedure 3-18. To configure switches -- with a MAC file
Through an ssh connection, log in as root to the SAC, and write the MAC file to a location on your SAC.
For example, write it to /var/tmp/mac_file.
Plug in all the switches.
(Conditional) Specify a site-specific IP address for the head gateway.
Perform this step if you need to set a site-specific, nondefault IP address for the primary switch stack (head gateway).
By default, SGI sets the IP address for the primary switch stack (head gateway) to 172.23.0.254. Type the following command to set a site-specific IP address:
# cadmin -set-head-gateway IP_addr |
For IP_addr, specify your site-specific IP address.
Use the discover command, in the following format, to configure the primary switch stack (the switches labeled MSW0A and MSW0B), which is attached to the SAC:
# discover --mgmtswitch 0,gateway=yes --macfile path |
For path, type the full path to the location of the MAC file.
This step assigns an IP address to the primary switch stack. The primary switch stack becomes the head gateway.
(Conditional) Type additional discover commands for each secondary switch stack.
Complete this step if you have more than one switch.
The formats for these additional commands are as follows:
# discover --mgmtswitch num --macfile path |
The arguments are as follows:
| Argument | Specification |
| num | The identifier for the switch stack. |
| path | The full path to the location of the MAC file. |
For example:
# discover --mgmtswitch 1 --macfile /var/tmp/mac_file |
(Conditional) On the front of the switch stack, plug the cascading cables into the ports on the slave switches.
Perform this step only if you have two or more switch stacks.
This step is the opposite of the following step, in which you unplugged the cascading cables from the ports on the slave switches:
Type the following command to retrieve information about the switches that you discovered, and examine the output for errors:
# cnodes --mgmtswitch |
Use the ssh(1) command to open a console window to the SAC.
Type the following command, and monitor the power-on process in the SAC's console window:
# tail -f /var/log/cmcdetectd.log |
Flip the power breakers on the SGI ICE X's CMCs, one rack at a time.
The cmcdetectd daemon runs on the SAC. It configures the top level switches so that the CMCs are on the appropriate rack VLAN. After you power on rack one, the cmcdetectd daemon detects the rack and adds the rack to the switch. After the CMCs for rack one appear on the switch, power on the CMCs for rack two.
Use the switchconfig command, in the following format, to list the CMCs associated with each switch:
switchconfig list -s switch |
For switch, specify the system ID for a switch, for example mgmtsw0, mgmtsw1, and so on.
Issue one of these commands for each switch in your configuration and examine the output. If the output includes all the CMCs in your SGI ICE X system, you can proceed with your configuration. If the output contains errors or does not include all expected CMCs, contact your SGI representative for troubleshooting information.
Save the configuration to the nonvolatile memory (flash) on the switches.
| Note: This step is very important. In the event of a power outage or other interruption, the switch stack boots with the saved configuration. |
Type the switchconfig command in the following format:
# switchconfig save_running_config -s mgmtsw0[,mgmtsw1,mgmtsw2,...] |
Include the parameters mgmtsw1, mgmtsw2 , and so on, only if there are switches in addition to the primary switch (mgmtsw0).
Use the switchconfig command, in the following format, to back up the switch configuration to a file on the SAC:
switchconfig pull_switch_config -s switch_ID -f file [--debug] |
The arguments are as follows:
| Argument | Specification |
| switch_ID | The switch system ID. For example, this could be mgmtsw0, mgmtsw1, mgmtsw2, and so on. The output from the cnodes command includes this information. |
| file | The name of the file to receive the switch configuration information. The command writes the file to the /tftpboot/ file.cfg. If your file specification ends in .cfg, the command does not append another .cfg string to the file name. |
The --debug parameter is optional. When specified, the command writes debugging information to /var/log/switchconfig .
For example, the following command writes the configuration file for mgmtsw0 to file /tftpboot/mgmtsw0_startup1.cfg on the SAC:
switchconfig pull_switch_config -s mgmtsw0 -f mgmtsw0_startup1 --debug |
Issue one of these commands per switch.
In the future, if you need to replace the switch, you can save configuration time if you push out this configuration file from the SAC to the new switch.
After all management switches have been configured, proceed as follows::
If you have MCells, proceed to the following:
If you do not have MCells, proceed to the following:
“Configuring the Rack Leader Controllers (RLCs) and Service Nodes with the discover Command”
The following procedure explains how to configure your switches when you do not have the switch information in a MAC file.
Procedure 3-19. To configure switches -- without a MAC file
(Conditional) Specify a site-specific IP address for the head gateway.
Perform this step if you need to set a site-specific, nondefault IP address for the primary switch stack (head gateway).
By default, SGI sets the IP address for the primary switch stack (head gateway) to 172.23.0.254. Type the following command to set a site-specific IP address:
# cadmin -set-head-gateway IP_addr |
For IP_addr, specify your site-specific IP address.
Through an ssh connection, log in to the SAC as the root user, and type the following command:
# discover --mgmtswitch 0,gateway=yes |
When prompted, connect the primary switch stack to a power source.
To complete this step, plug in the master switch and then the slave switch so that the entire primary switch stack is powered up. In this way, the master switch boots just a few seconds before the slave switch.
The discover command configures the MAC address of the switch after you connect the primary switch stack to a power source.
(Conditional) Configure each secondary switch stack.
Perform this step if you have secondary switch stacks.
Complete the following steps for each secondary switch stack:
Plug in the switch stack when prompted by the system.
Type a discover command for the switch in the following format:
# discover --mgmtswitch num |
For num, type the identifier for the switch.
For example:
# discover --mgmtswitch 1 |
(Conditional) On the front of the switch stack, plug the cascading cables into the ports on the slave switches.
Perform this step only if you have secondary switch stacks.
This step is the opposite of the following step, in which you unplugged the cascading cables from the ports on the slave switches:
Type the following command to retrieve information about the switches that you configured, and examine the output for errors:
# cnodes --mgmtswitch |
Use the ssh(1) command to open a console window to the SAC.
Type the following command, and monitor the power-on process in the SAC's console window:
# tail -f /var/log/cmcdetectd.log |
Flip the power breakers on the SGI ICE X's CMCs, one rack at a time.
The cmcdetectd daemon runs on the SAC. It configures the top level switches so that the CMCs are on the appropriate rack VLAN. After you power on rack one, the cmcdetectd daemon detects the rack and adds the rack to the switch. After the CMCs for rack one appear on the switch, power on the CMCs for rack two.
Use the switchconfig command, in the following format, to list the CMCs associated with each switch:
switchconfig list -s switch |
For switch, specify the system ID for a switch, for example mgmtsw0, mgmtsw1, and so on.
Issue one of these commands for each switch in your configuration and examine the output. If the output includes all the CMCs in your SGI ICE X system, you can proceed with your configuration. If the output contains errors or does not include all expected CMCs, contact your SGI representative for troubleshooting information.
Save the configuration to the nonvolatile memory (flash) on the switches.
| Note: This step is very important. In the event of a power outage or other interruption, the switch stack boots with the saved configuration. |
Type the switchconfig command in the following format:
# switchconfig save_running_config -s mgmtsw0[,mgmtsw1,mgmtsw2,...] |
Include the parameters mgmtsw1, mgmtsw2 , and so on, only if there are switches in addition to the primary switch (mgmtsw0).
For each switch you configured, use the switchconfig command, in the following format, to back up the switch configuration to a file on the SAC:
switchconfig pull_switch_config -s switch_ID -f file [--debug] |
The arguments are as follows:
| Argument | Specification |
| switch_ID | The switch system ID. For example, this could be mgmtsw0, mgmtsw1, mgmtsw2, and so on. The output from the cnodes command includes this information. |
| file | The name of the file to receive the switch configuration information. The command writes the file to the /tftpboot/ file.cfg. If your file specification ends in .cfg, the command does not append another .cfg string to the file name. |
The --debug parameter is optional. When specified, the command writes debugging information to /var/log/switchconfig .
For example, the following command writes the configuration file for mgmtsw0 to file /tftpboot/mgmtsw0_startup1.cfg on the SAC:
switchconfig pull_switch_config -s mgmtsw0 -f mgmtsw0_startup1 --debug |
Issue one of these commands per switch.
In the future, if you need to replace a switch, you can save configuration time if you push out this configuration file from the SAC to the new switch.
Use the discover command, in the following format, to save the MAC addresses to a MAC file:
# discover --show-macfile > path |
For path, type the full path to the location of the MAC file. For example, /var/tmp/mac_file.
You can use this MAC file if you ever have to configure the switches again.
After all management switches have been configured, proceed as follows:
If you have MCells, proceed to the following:
If you do not have MCells, proceed to the following:
“Configuring the Rack Leader Controllers (RLCs) and Service Nodes with the discover Command”
Perform the procedure in this topic if you have an SGI ICE X system that includes MCells.
An SGI ICE X system contains CDUs and cooling rack controllers (CRCs). The CDUs and CRCs have statically assigned IP addresses. These IP addresses are critical to associating the individual rack units (IRUs) with specific CDUs or CRCs. For information about these IP addresses, see the following:
Appendix C, “MCell Network IP Addresses”
The following procedure explains how to configure the switches attached to the MCells.
Procedure 3-20. To configure MCell switches
Gather information about the MCell switches in your SGI ICE X system.
Visually inspect your system. Note the switches identifiers, and note the port identifiers.
Log in as the root user to the system admin controller (SAC).
Type following command to retrieve information about the virtual local area networks (VLANs) that are configured at this time:
# cattr list -g mcell_vlan global mcell_vlan : 3 |
The preceding output shows that the MCell VLAN is VLAN 3.
Use the switchconfig set command, in the following format, to configure the ports on which the CDUs and the CRCs are connected to the MCells:
switchconfig set -v num=vlan_num -b=none -d vlan_num -p ports -s switch |
Type an individual switchconfig set command for each switch on the SGI ICE X system network.
The arguments are as follows:
| Argument | Specification |
| vlan_num | The VLAN number of the MCell network. For vlan_num, use the output from the cattr list command as shown earlier in this procedure. The default is 3, and SGI recommends that you do not change this value. This argument appears in two places in the switchconfig command. |
| ports | Specify the target ports. The command configures both the target ports and the corresponding redundant ports. |
| switch | The ID number of the management switch to which the CDU or CDC is attached. For example: mgmtsw0. To determine this value, you need to visually inspect the switch, as follows:
|
Example 1. The following command configures VLAN 3 on management switch 0 for target ports 1/31, 1/32, and 1/33 and for redundant ports 2/31, 2/32, and 2/33:
switchconfig set -v num=3 -b=none -d 3 -p 1/31,1/32,1/33 -s mgmtsw0 |
Example 2. The following command configures VLAN 3 on management switch 0 for target ports 2/31, 2/32, and 2/33 and for redundant ports 1/31, 1/32, and 1/33:
switchconfig set -v num=3 -b=none -d 3 -p 2/31,2/32,2/33 -s mgmtsw0 |
Note: If you make a mistake in your configuration, you can disable
the ports from the VLANs you configured. The following example command
removes the configuration of VLAN 3 from the target ports and the redundant
ports:
|
Repeat the following step for each CDU and each CRC attached to your system:
If you encounter errors, issue a switchconfig set command again.
Save the configuration to the nonvolatile memory (flash) on the switches.
| Note: This step is very important. In the event of a power outage or other interruption, the switch stack boots with the saved configuration. |
Type the switchconfig command in the following format:
switchconfig save_running_config -s mgmtsw0[,mgmtsw1,mgmtsw2,...] |
Include the parameters mgmtsw1, mgmtsw2 , and so on, only if there are switches in addition to the primary switch (mgmtsw0).
Type the following command to back up the switch configuration to a file on the SAC:
switchconfig pull_switch_config -s switch_ID -f file |
The arguments are as follows:
| Argument | Specification |
| switch_ID | The switch system ID. For example, this could be mgmtsw0, mgmtsw1, mgmtsw2, and so on. The output from the cnodes command includes this information. |
| file | The name of the file to receive the switch configuration information. The command writes the file to the /tftpboot/ file.cfg. If your file specification ends in .cfg, the command does not append another .cfg string to the file name. |
For example, the following command writes the configuration file for mgmtsw0 to file /tftpboot/mgmtsw0_startup1.cfg on the SAC:
switchconfig pull_switch_config -s mgmtsw0 -f mgmtsw0_startup1 [--debug] |
The --debug parameter is optional. When specified, the command writes debugging information to /var/log/switchconfig .
In the future, if you need to replace the switch, you can save configuration time if you push out this configuration file from the SAC to the new switch.
After all switches have been discovered, proceed to the following:
“Configuring the Rack Leader Controllers (RLCs) and Service Nodes with the discover Command”
The discover command finds and configures rack leader controllers (RLCs), service nodes, and external switches. If you have a media access control file (MAC file), this procedure can complete more quickly. The procedure includes configuration steps that explain how to complete the procedure both with and without a MAC file.
The following procedure explains how to configure the RLCs and service nodes.
Procedure 3-21. To configure the RLCs and service nodes
Visually inspect your SGI ICE X system and note the labels on the RLCs and service nodes.
RLCs are numbered starting with 1.
Service nodes are numbered starting with 0.
Check the power cords on all RLCs and services nodes, as follows:
If you have a MAC file to specify as input to the discover command, make sure all RLCs and service nodes are plugged in.
Do not power-on the nodes at this time. When the node is plugged in and connected to a power source, the baseboard management controller (BMC) is started, and that is all that is required at this time.
If you do not have a MAC file, make sure that all RLCs and service nodes are unplugged from their power sources.
Through an ssh connection, log into the system admin controller (SAC) as the root user.
Type the following command to retrieve the option code that is in use:
# cadmin --show-dhcp-option |
The RLCs and service nodes determine the integrated Ethernet devices by accepting DHCP leases that belong only to the SGI ICE X system. SGI ICE X systems use DHCP option code 149 by default. In rare situations, a house network DHCP server could be configured to use this option code. In this case, nodes that are connected to the house network can mistake a house DHCP server as belonging to the SGI ICE X DCHP server, which can lead to an installation failure. Change this option code only if absolutely necessary.
To change the dhcp option code number used for this operation, type a command such as the following:
# cadmin --set-dhcp-option 150 |
This command sets the DHCP option code to 150.
(Conditional) Plug in all the racks and service nodes.
Perform this step if you plan to specify a MAC file as input to the discover command.
A MAC file contains the MAC addresses for RLCs and service nodes. If you use a MAC file, the configuration process can complete more quickly. Contact your SGI representative to find out if a MAC file is available. For more information about MAC files, see “Configuring the Switches”.
Use one or more discover commands, in the following format, to configure the RLCs and service nodes on your SGI ICE X system:
discover --rack[set] count[,mic=mic_num] --service[set] count[,mic=mic_num] [--macfile macfile] |
The arguments are as follows:
| Argument | Specification | ||||
| count | Specify the ID number for the rack (or racks) or service node (or service nodes) that you want to configure. If you want to configure one rack or one service node, specify --rack or --service and the system ID number that corresponds to that rack or service node. If you want to configure a range of racks or a range of service nodes, specify --rackset or --serviceset, the starting system ID number, a comma (,), and the ending system ID number. Examples:
| ||||
| mic_num | Specify the number of Intel® Many Integrated Core (MIC) devices that reside on each blade or service node. By default, the discover command assumes zero (0). If you have MIC devices on your compute nodes, specify the number you have, which can be 1 or 2. For service nodes, mic_num can be 1, 2, 3, or 4. Specify the ,mic=mic_num parameter only if your SGI ICE X system includes MIC devices. | ||||
| macfile | Specify the full path to the media access control (MAC) input file. Specify this option if you have a MAC file. |
To retrieve more information about the discover command, type discover --h.
Example 1. The following command uses a MAC file to configure rack 1 and service node 0:
# discover --rack 1 --service 0 --macfile mymacfile |
Example 2. If you have one rack and one service node, type the following command:
# discover --rack 1 --service 0 |
Example 3. If you have five racks and three service nodes, type the following command:
# discover --rackset 1,5 --serviceset 1,3 |
Example 4. If you have one rack, one service node, two MIC devices attached to each blade on the rack, and two MIC devices attached to the service node, type the following command:
# discover --rack 1,mic=2 --service 0,mic=2,image=mic_serv_image |
In example 4, for mic_serv_image, specify the label (name) of the service node image that you created in “Creating Images for the Service Nodes That Include MIC Devices ”.
(Conditional) When prompted to do so by the system, plug in each individual rack or service node.
Perform this step if you did not use a MAC file as input to the discover command.
The system prompt for this action is as follows:
At this time, please turn on the power to this service node. Do not turn the system on. |
The blue light on each component turns on when configuration is complete.
You can use the console(1) command if you want to watch the installation progress. The sessions are also logged.
Type the following commands to update the configuration files:
# update-configs # update-switch-routing |
Type the following commands to save the configuration to the nonvolatile memory (NVM) on the switches:
# switchconfig save_running_config -s mgmtsw0[,mgmtsw1,mgmtsw2,...] |
Include the parameters mgmtsw1, mgmtsw2 , and so on, only if there are switches in addition to management switch 0.
(Conditional) Confirm the status of the MIC devices.
Perform this step if your SGI ICE X system has MIC devices.
Complete the following steps:
Type the following command to make sure that the MIC devices came online:
# cexec --all 'micctrl --status' |
The following example output shows that the MIC devices came online correctly:
************************* rack_1 ************************* ************************* rack_1 ************************* --------- r1i0n6--------- mic0: online (mode: linux image: /lib/firmware/mic/uos.img) mic1: online (mode: linux image: /lib/firmware/mic/uos.img) --------- r1i0n7--------- mic0: online (mode: linux image: /lib/firmware/mic/uos.img) mic1: online (mode: linux image: /lib/firmware/mic/uos.img) --------- r1i0n8--------- mic0: online (mode: linux image: /lib/firmware/mic/uos.img) mic1: online (mode: linux image: /lib/firmware/mic/uos.img) |
Type the following command to verify the IP addresses and MTU size of the MIC devices:
# cexec --all 'micctrl --config | grep -E \"MIC IP|Bits|MtuSize\"' |
The following example output shows that all the MIC devices have addresses on the 10.157.1.0/24 and the 10.158.1.0/24 networks and that the MIC devices use the correct MTU size of 9000:
************************* rack_1 *************************
************************* rack_1 *************************
--------- r1i0n4---------
MIC IP: 10.157.1.6
Net Bits: 24
MtuSize: 9000
MIC IP: 10.158.1.6
Net Bits: 24
MtuSize: 9000
--------- r1i0n5---------
MIC IP: 10.157.1.7
Net Bits: 24
MtuSize: 9000
MIC IP: 10.158.1.7
Net Bits: 24
MtuSize: 9000
--------- r1i0n6---------
MIC IP: 10.157.1.8
Net Bits: 24
MtuSize: 9000
MIC IP: 10.158.1.8
Net Bits: 24
MtuSize: 9000
--------- r1i0n7---------
MIC IP: 10.157.1.9
Net Bits: 24
MtuSize: 9000
MIC IP: 10.158.1.9
Net Bits: 24
MtuSize: 9000
--------- r1i0n8---------
MIC IP: 10.157.1.10
Net Bits: 24
MtuSize: 9000
MIC IP: 10.158.1.10
Net Bits: 24
MtuSize: 9000 |
Proceed as follows:
If you want to configure a backup domain name service (DNS) server, proceed to the following:
“(Optional) Configuring a Backup Domain Name Service (DNS) Server ”
To configure the InfiniBand subnetworks, proceed to the following:
Typically, the DNS on the system admin controller (SAC) provides name services for the SGI ICE X system. When you configure a backup DNS, however, the compute nodes can use a service node as a secondary DNS server if the SAC is down, being serviced, or is otherwise not available. You can configure a backup DNS only after you run the discover command to configure the cluster. This is an optional feature.
The following procedure explains how to configure a service node to act as a DNS.
Procedure 3-22. To enable a backup DNS
Through an ssh connection, log into the system admin controller (SAC) as the root user.
Type the following command to retrieve a list of available service nodes:
# cnodes --service |
The service node you want to use as a backup DNS must be configured in the system already. That is, you must have run the discover command to configure the service nodes.
Type the following command to start the cluster configuration tool:
# /opt/sgi/sbin/configure-cluster |
On the Main Menu screen, select B Configure Backup DNS Server (optional), and select OK.
On screen that appears, type the identifier for the service node that you want to designate as the backup DNS, and select OK.
Figure 3-12 shows how to specify service0 as the backup DNS.
To disable this feature, select Disable Backup DNS from the same menu and select Yes to confirm your choice.
The InfiniBand network on SGI ICE X systems uses Open Fabrics Enterprise Distribution (OFED) software. For information about OFED, see the following website:
For more information about the InfiniBand fabric implementation on SGI ICE X systems, see the SGI ICE X Administration Guide.
Each SGI ICE X system has two InfiniBand fabric network cards, ib0 and ib1. Each subnetwork has a subnet manager, which runs on a rack leader controller (RLC). The following procedure explains how to specify the RLC that you want to configure as the master and the RLC that you want to configure as the stand-by.
Procedure 3-23. To configure the InfiniBand network
Through an ssh connection, log into the system admin controller (SAC) as the root user.
Type the following command to disable InfiniBand switch monitoring:
% cattr set disableIbSwitchMonitoring true |
The system sometimes issues InfiniBand switch monitoring errors before the InfiniBand network has been fully configured. The preceding command disables InfiniBand switch monitoring.
Use one of the following methods to access the InfiniBand network configuration tool:
Type the following command to start the cluster configuration tool:
# /opt/sgi/sbin/configure-cluster |
After the cluster configuration tool starts, select F Configure InfiniBand Fabric, and select OK.
Type the following command to start the InfiniBand management tool:
# tempo-configure-fabric |
Both of the preceding methods lead you to the same InfiniBand configuration page. On the InfiniBand configuration pages, Quit takes you to the previous screen.
Select A Configure InfiniBand ib0, and select Select.
On the Configure InfiniBand screen, select A Configure Topology, and select Select.
On the Topology screen, select the topology your system uses, and select Select.
The menu selections are as follows:
H HYPERCUBE
E EHYPERCUBE (Enhanced Hypercube)
F FAT TREE
G BFTREE
On the Configure InfiniBand screen, select B Master / Standby, and select Select.
On the Master / Standby screen, type the RLC identifiers for the master (primary) and the standby (backup, secondary) subnetwork, and select Select.
If you have only one rack leader controller (RLC), type r1lead in the MASTER field, and leave the STANDBY field blank.
If you have more than one RLC, specify different RLCs in the MASTER and STANDBY fields.
For example, Figure 3-13 shows a completed screen.
On the Configure InfiniBand screen, select Commit.
Wait for the comfirmatory messages to appear in the window before you continue.
The next few steps repeat the preceding steps, but this time you configure the ib1 interface.
On the InfiniBand Management Tool main menu screen, select B Configure InfiniBand ib1, and select Select.
On the Configure InfiniBand screen, select A Configure Topology, and select Select.
On the Topology screen, select the topology your system uses, and select Select.
Select the topology that exists on your system. The menu selections are as follows:
H HYPERCUBE
E EHYPERCUBE (Enhanced Hypercube)
F FAT TREE
G BFTREE
On the Configure InfiniBand screen, select B Master / Standby, and select Select.
On the Master / Standby screen, type the RLC identifiers for the master (primary) and the standby (backup, secondary) subnetwork, and select Select.
If you have only one rack leader controller (RLC), type r1lead in the MASTER field, and leave the STANDBY field blank.
If you have two RLCs, you can flip the specifications for ib1. For example, assume that for ib0, you specified MASTER as r1lead and STANDBY as r2lead. For ib1, you can specify MASTER as r2lead and STANDBY as r1lead.
If you have three or more RLCs, specify different RLCs in the MASTER and STANDBY fields.
On the Configure InfiniBand screen, select Commit.
Wait for the comfirmatory messages to appear in the window before you continue.
On the InfiniBand Management Tool main menu screen, select C Administer Infiniband ib0, and select Select .
On the Administer InfiniBand screen, select Start, and select Select.
On the Start SM master_ib0 on ib0 succeeded! screen, select OK.
Select Quit to return to the InfiniBand Management Tool main menu screen.
The next few steps repeat the preceding steps, but this time you start the ib1 interface.
On the InfiniBand Management Tool main menu screen, select D Administer Infiniband ib1, and select Select .
On the Administer InfiniBand screen, select Start, and select Select.
On the Start SM master_ib1 on ib1 succeeded! screen, select OK.
On the Administer InfiniBand screen, select Status, and select Select.
The Status option returns information similar to the following:
Master SM Host = r1lead Guid = 0x0002c9030006938b Fabric = ib0 Topology = hypercube Routing Engine = dor OpenSM = running |
Wait for the status messages to stop, and press Enter .
Select Quit on the menus that follow to exit the configuration tool.
Type the following command to retrieve the list of rack leader controller (RLC) IDs:
SAC:~ # cnodes --leader r1lead r2lead |
In the next few steps, you verify that the InfiniBand network is working.
Through an ssh(1) connection, log into a leader node.
For example:
SAC:~ # ssh r2lead |
Type the following command to retrieve the IDs of the compute nodes (blades):
r2lead:~ # cnodes --compute r2i0n0 r2i0n1 r2i0n2 r2i0n3 |
Type a ping(8) command to make sure that the RLC can reach its compute nodes.
For example:
r2lead:~ # ping -c1 r2i0n0 |
If the ping(8) is successful, the InfiniBand network is configured properly.
Type the following command to reenable InfiniBand switch monitoring:
% cattr unset disableIbSwitchMonitoring |
(Optional) Configure additional features.
The SGI ICE X system supports several optional features, for example, networking features such as network address translation. For information about how to configure optional features, see the following: