Step 3: Configure the cluster

fter you have deployed the head node (or head nodes) of your HPC cluster, you must configure the cluster by following the Deployment To-do List in HPC Cluster Manager. This section provides procedures for completing the required deployment tasks and several optional tasks that are needed for adding on-premises nodes. For information about other deployment tasks, see later sections of this guide and the topics in Configuration: HPC Cluster Manager.

At a minimum you must complete the required deployment tasks. Depending on how you will be adding nodes to your cluster and the types of jobs you will run, you might need to complete one or more of the optional deployment tasks.

3.1: Configure the HPC cluster network

The HPC cluster network configuration is the first step in the configuration process of your head node. The HPC cluster network is configured by following the Network Configuration Wizard in HPC Cluster Manager. When configuring the HPC cluster network, choose the network topology that you have planned for your cluster, as described in Step 1: Prepare for your deployment, earlier in this guide.

Important

Before you start configuring the HPC cluster network in HPC Cluster Manager, ensure that the head node and the computers that you will add as nodes to the cluster are physically connected according to the network topology that you have chosen for your cluster. Also, ensure that you are able to identify to which network each one of the network adapters in the head node is connected.

To configure the HPC cluster network

  1. If HPC Cluster Manager is not already open on the head node, open it.

  2. In the Deployment To-do List, click Configure your network. The Network Configuration Wizard appears.

  3. On the Network Topology Selection page, click the topology that you have chosen for your cluster, and then click Next.

  4. On the Enterprise Network Adapter Selection page, in the Network adapter list, click the name of the network adapter that is physically connected to your enterprise network, and then click Next.

    Important

    To ensure that you are selecting the correct network adapter, use the information displayed on this wizard page after you select a network adapter from the list. Use the IP address, domain information, and Media Access Control (MAC) address of each adapter as a reference.

  5. If you chose topology number 5 for your cluster, skip to step 9 in this procedure. Otherwise, repeat step 4 for the private network adapter.

  6. On the Private Network Configuration page, type a static IP address and a subnet mask for the head node. Optionally, select network services for that network:

    a. To give access to resources on the enterprise network to nodes that are connected to this network, select the Enable network address translation (NAT) on the head node check box.

    b. To enable DHCP services for the nodes that are connected to this network, select the Enable DHCP and define a scope check box, and then type the starting and ending IP addresses for the DHCP scope. If the Gateway and DNS server IP addresses have not been automatically detected, type each of these addresses.

    For more information about enabling NAT and DHCP on your cluster network, see "HPC network services" in Appendix 1: HPC Cluster Networking.

  7. Click Next after you are done configuring the private network.

  8. Repeat steps 4, 6, and 7 for the application network adapter. Click Next after you are done configuring the application network.

  9. On the Firewall Setup page, select the firewall setting for the cluster:

    a. To apply firewall settings automatically to the nodes on each network, click ON for that network.

    b. To disable the firewall on a network, click OFF.

    c. If you do not want to change any firewall settings, click Do not manage firewall settings.

    For more information about firewall settings for your cluster, see "HPC network services" in Appendix 1: HPC Cluster Networking.

  10. On the Review page, verify your settings and click Configure. If you want to change any of the settings, navigate to the appropriate wizard page by clicking it on the navigation pane or by clicking Previous.

  11. After the network configuration process is completed, on the Configuration Summary page, review the list of configuration items. If you want to save a report of the network configuration, click Save the configuration report.

  12. To close the wizard, click Finish.

3.2: Provide installation credentials

Note

iSCSI node deployment is not supported from HPC Pack 2016 on.

Installation credentials must be provided to add new on-premises nodes to the cluster. These credentials will be used when installing the operating system, installing applications, and adding nodes to the Active Directory domain.

To provide installation credentials

  1. In the Deployment To-do List, click Provide installation credentials. The Installation Credentials dialog box opens.

  2. Type the user name, including the domain (DOMAIN\User), and then the password for the domain user account you will use to deploy nodes. Note the following considerations for this account:

    • The account must be a domain account with enough privileges to create Active Directory computer accounts for the nodes and to join the nodes to the domain. If the policies of your organization restrict you from using a domain account that can add new computers to the domain, you will need to ask your domain administrator to pre-create the computer objects for you in Active Directory Domain Services before you deploy your nodes.
    • If part of your deployment requires access to resources on the enterprise network, the account should have the necessary permissions to access those resources.
    • If you want to restart nodes remotely by using HPC Cluster Manager, the account must be a member of the local Administrators group on the head node. This requirement is only necessary if you do not have scripted power control tools that you can use to remotely restart the nodes.
    • Active Directory Domain Services limits the number of computers that an authenticated user can join to the domain concurrently (the default is 10 computers). If you need to increase this number to configure new nodes, you (or your domain administrator) can use the Adsiedit tool to edit the ms-DS-MachineAccountQuota property in the domain. For information about how to do this, see article 24337 in the Microsoft Knowledge Base.
  3. To save the specified credentials, click OK.

3.3: Configure the naming of new nodes

If you will deploy Azure IaaS nodes with an Azure IaaS node template, or deploy nodes from bare metal, and you are not using a node XML file to import nodes to the cluster, computer names are automatically generated for the new nodes that are deployed. You must specify how those names will be generated, by defining a naming series.

The naming series is defined by selecting a root name and the starting number that will accompany that name. The starting number is enclosed in percentage signs (%).

For example, if you deploy three nodes after specifying the following naming series: Node-%100%, those nodes will be assigned these names:

  • Node-100
  • Node-101
  • Node-102

Considerations for the naming series

  • Node names are limited to 15 characters. When specifying the node naming series, consider the number of nodes in your deployment and ensure that the series that you specify will not generate names that exceed 15 characters. For example, if your deployment will consist of 1,000 nodes, and your starting number is 1, your root name cannot have more than 12 characters; otherwise, your node number 1,000 will need a name that consists of 16 characters.
  • Unlike the naming series for compute nodes that are deployed from bare metal, the naming series for Azure "burst" nodes cannot be configured. Azure nodes are numbered consecutively in a naming series that begins with the root name AzureCN. To avoid potential confusion, do not configure the naming series for compute nodes with the root name AzureCN.

To specify the node naming series

  1. In the Deployment To-do List, click Configure the naming of new nodes. The Specify Node Naming Series dialog box opens.
  2. Type the naming series that you want to use. The preview in the Specify Node Naming Series dialog box helps you to see an example of how the naming series will be applied to the names of the nodes.
  3. To save the node naming series that you have specified, click OK.

3.4: Import or create a certificate for deployment

You need to import or create a certificate to either deploy nodes from bare metal or manually add nodes to the cluster. In the Deployment To-do List, click Import a certificate for deployment.

  • If you plan to use one single certificate across all HPC nodes or you have already prepared the Certificate for other nodes in Step 1.8, select Import the certificate from a PFX file, and Browse to select the PFX file and input the Protection Password, and then click OK to import the certificate.

  • If you don't have the Certificate for other nodes yet, you can select Create a new self-signed certificate to generate a new self-signed certificate.

The certificate will be placed in the Certificates folder in your installation share with the file name HpcCnCommunication.pfx.

3.5: Create a node template (optional)

Node templates define the necessary tasks for configuring and adding nodes to your cluster. For example, with a compute node template, you can deploy an operating system image, add specific drivers and software to nodes, add a preconfigured node to your cluster, or apply software updates to your nodes.

Node template types

HPC Pack 2019 has three built-in node templates which cannot be modified:

  • HeadNode Template - All the head nodes are automatically assigned to this template.
  • LinuxNode Template - All the Linux nodes are automatically assigned to this template.
  • NonDomain ComputeNode Template - All the non-domain joined compute nodes are automatically assigned to this template.

Because you might have more than one type of node, or you may be adding nodes to your cluster in different ways, you can create different templates that apply to different nodes or situations.

Depending on the type of nodes that you want to add to your cluster, you can create four types of node templates:

  • Compute node template - Use to add compute nodes to the cluster. HPC Pack will create a default compute node template named Default ComputeNode Template.
  • Broker node template - Use to add broker nodes to the cluster.
  • Workstation node and unmanaged server node template - Use to add workstation nodes or unmanaged server nodes and to specify if you want these nodes to be brought online and offline according to a weekly availability policy or manually.
  • Azure node template - Use to add Azure nodes (worker role instances) to expand your cluster capacity, using accounts and features you have created or added to your Azure subscription. You specify if you want these nodes to be brought online and offline according to a weekly availability policy or manually.
  • Azure Batch Pool template - Use to add an Azure Batch pool to expand your cluster capacity.
  • Azure IaaS node template - Use to add Azure IaaS nodes to expand your cluster capacity if your cluster is entirely in Azure or is a hybrid cluster with a site-to-site VPN between Azure and your organization's network.

Compute node templates and broker node templates are further divided into two types, depending on how you want to add those types of nodes to your cluster:

  • With an operating system image (not currently supported) - This type of template includes a step to deploy an operating system on the compute nodes or broker nodes. Use this type of template when adding nodes from bare metal.
  • Without an operating system image - This type of template is used to add preconfigured compute nodes and broker nodes to the cluster.

To create a node template

  1. In the Deployment To-do List, click Create a node template. The Create Node Template Wizard appears.

  2. On the Choose Node Template Type page, choose the type of node template that you want to create. The remaining steps in this procedure assume that you choose one of the following: Compute node template, Broker node template, or Workstation node or unmanaged server node template. (If you want to create an Azure node template, see the topics in Configuring an Azure Node Template for Microsoft HPC Pack.)

  3. On the Specify Template Name page, type a descriptive name for the template, and, optionally, a description, and then click Next.

  4. If you are creating a compute node template or a broker node template, do the following:

    a. On the Select Deployment Type page, click Without operating system, and then click Next.

    b. On the Specify Windows Updates page, specify if you want to add a step in the template to download and install updates by using Microsoft Update or the enterprise Windows Server Update Services (WSUS). Also, you can specify specific updates to be added to the template. Click Next to continue. Go to the last step in this procedure.

  5. If you are creating a workstation node or unmanaged server node template, on the Configure Availability Policy page, select how you want workstation nodes to be brought online and offline:

    a. If you want workstation nodes to be brought online and offline manually, select that option, and then click Next. Go to the last step in this procedure.

    b. If you want workstation nodes to be brought online and offline automatically, select that option, and then click Configure Availability Policy. The Configure Workstation Availability Policy dialog box appears.

    c. In the Configure Workstation Availability Policy dialog box, on the Availability Policy tab, click and drag the mouse to select the days and hours at which nodes are Online (available to run cluster jobs) and Offline (not available to run cluster jobs).

    d. Optionally, specify the number of minutes before the workstation nodes are taken offline, when no new jobs will be started on the workstation nodes.

    e. Optionally, on the User Activity Detection page, configure user activity detection settings to restrict the workstation nodes and unmanaged server nodes that are brought online during an online time block.

    f. To save your settings, click OK, and then click Next.

  6. On the Review page, click Create.

Tip

The node templates that you create with the Create Node Template Wizard include the most common deployment and configuration tasks. You can add more tasks to the node templates that you create, or you can modify the template settings by using the Node Template Editor. For more information, see Appendix 3: Node Template Tasks and Properties.

Important

If you edit a node template at the same time that nodes are being deployed with that node template, the changes that you make will be implemented immediately after you save your changes, affecting any nodes that are waiting to be deployed with that template. For that reason, before you open a node template for editing, determine if there are any nodes being deployed or waiting to be deployed, and if there are, ensure that none of those nodes use the template that you want to edit.

3.5: Add users (optional)

If you will be giving cluster access to other members of your organization, you need to add their domain user accounts or domain groups to the cluster in a role such as HPC cluster users or HPC cluster administrators. For example, to submit jobs to the cluster or to perform diagnostic tests, a domain user must be at least an HPC cluster user. HPC cluster administrators have permissions to manage all aspects of the cluster. For information about other roles, see Managing Cluster Users.

Considerations for adding users

  • As a best practice, you can create domain groups to manage the different cluster roles. Then, add each domain group to the appropriate cluster role.
  • An HPC cluster administrator is also added as a member of the local Administrators group and the HPCAdminMirror group on the head node computer.
  • An HPC cluster user is also added as a member of the local HPCUsers group on the head node computer.

To add cluster users and administrators

  1. In the Deployment To-do List, under Optional deployment tasks, click Add or remove users.

  2. To add an HPC cluster user:

    a. In the Actions pane, click Add User. The Select Users or Groups dialog box appears.

    b. Type the name of the domain user or group that you want to add as an HPC cluster user, and then click Check Names. For more information, in the Select Users or Groups window, click examples.

    c. Repeat the previous step for all HPC cluster users that you want to add.

    d. After you are done adding HPC cluster users, click OK.

  3. To add an HPC cluster administrator:

    a. In the Actions pane, click Add Administrator. The Select Users or Groups dialog box appears.

    b. Type the name of the domain user or group that you want to add as an HPC cluster administrator, and then click Check Names. For more information, in the Select Users or Groups window, click examples.

    c. Repeat the previous step for all HPC cluster administrator that you want to add.

    d. After you are done adding HPC cluster administrators, click OK.

Next step