Microsoft Windows System Resource Manager Management Pack Guide

Archived content. No warranty is made as to technical accuracy. Content may contain URLs that were valid when originally published, but now link to sites or pages that no longer exist.

Published: October 1, 2003

Management Pack Guide

Developed by: Windows System Resource Manager Group

Click here to download a copy of this guide

On This Page

Acknowledgements
Recommended Knowledge and Skills
Overview of Windows System Resource Manager Management Pack
Configuration
Operations Tasks
Recommendations for the Network Operations Desk
Reference

Acknowledgements

Program Manager: Chris Stackhouse

Writer: Vilas Vyas

Technical Reviewers: Aruna Koka, Padmapriya A., Brenda Carter

Editor: Liz Bjorkman

To deploy and operate the Microsoft® Windows System Resource Manager (WSRM) Management Pack, it is recommended that you possess the following knowledge and skills:

  • Possess a working knowledge of the Microsoft Operations Manager (MOM) monitoring features.

  • Be familiar with the recommended process for deploying Management Packs as described in Chapter 7, “Deploying Management Packs” of the Microsoft Operations Manager 2000 SP1 Deployment Guide.

  • Be familiar with the basic concepts in Windows System Resource Manager (WSRM) such as process matching criteria, resource allocation policies, calendar, and accounting. For more information, refer to the WSRM Online Help.

Overview of Windows System Resource Manager Management Pack

The Windows System Resource Manager Management Pack for Microsoft Operations Manager 2000 (MOM) SP1 provides a monitoring and management system for the Windows System Resource Manager.

Overview of WSRM

WSRM is available to use with Microsoft Windows Server™ 2003, Enterprise Edition and Microsoft Windows Server™ 2003, Datacenter Edition. WSRM allows you to manage CPU and memory utilization on a per-process basis. An administrator sets targets for the amount of hardware resources that running applications or users are allowed to consume. This means that you can allocate resources among multiple applications on a server according to your business priorities.

Using WSRM, a system administrator can do the following:

  • Set CPU and memory allocation policies on processes. This includes selecting processes to be managed and setting resource usage targets or limits.

  • Manage CPU utilization.

  • Set CPU Affinity of Processes.

  • Limit the process working set size (physical resident pages in use).

  • Manage committed memory (pagefile usage).

  • Apply policies to users or groups on a server.

  • Apply policies on a date/time schedule.

  • Generate, store, view, and export resource-utilization accounting records that can be used to monitor resource usage, compare actual and expected performance, and provide the basis for charge-back accounting.

Overview of WSRM Management Pack

With WSRM Management Pack, MOM provides central monitoring and automatic problem notification for large networks, continuously monitoring the WSRM components. By using the WSRM Management Pack, an administrator can centrally monitor WSRM service running on various servers in an enterprise. You can monitor important indicators to discover and resolve minor problems before they develop into potentially lengthy service outages. This Management Pack helps administrators monitor the health of WSRM by providing public views and alerts.

  • Public views give information about the effectiveness of WSRM resource management.

  • Alerts notify the administrator of the failure of any configuration setting such as failure in setting a resource allocation policy, or failure to manage working set size.

For more information about MOM and other Management Packs, including installation and configuration, see www.microsoft.com/mom.

Windows System Resource Manager Management Pack Monitoring Scenarios

The WSRM Management Pack monitors events that WSRM places in the Application event log and generates alerts to notify the administrator of critical operational issues such as:

  • Low resource conditions

  • WSRM database issues

  • WSRM service errors

  • WSRM functionality errors

  • WMI errors

  • Errors in backing up the XML files that WSRM uses

  • Registry errors

The WSRM Management Pack also monitors key performance metrics such as Actual Managed CPU and Target Managed CPU to monitor the overall health of WSRM.

Windows System Resource Manager Components

This section provides an overview of WSRM components and how they are deployed.

WSRM Management Pack Deployment Topology

WSRM includes a server component and a client component. The WSRM Management Pack is deployed only on the server component. No other Management Packs are required to monitor WSRM.

WSRM Server Component

The WSRM server component includes the following sub-components:

  • WSRM service – A Microsoft Windows NT service that manages the CPU and memory usage based on the resource allocation policy specified.

  • XML data store – Configuration data such as process matching criteria, resource allocation policies, and calendar settings are stored in XML files as part of the XML data store.

  • Accounting store – Stores information about resource usage by processes. An administrator can use this information to generate reports of resource usage by user, by process, or by other properties of interest.

WSRM Client Component

The WSRM client component includes the graphical user interface and the command-line interface. The various actions that a system administrator can take, such as creating policies, identifying and classifying the processes to be managed into Process Matching Criteria, and scheduling different policies to be used with calendaring are exposed in both interfaces.

The WSRM client component can be used locally on the same server or by using a Remote connection with the client on a remote machine.

Terms and Definitions

This section includes a description of WSRM concepts and additional terms related to resource management. These terms are important to understand when you are working with WSRM.

WSRM Concepts

It is useful to understand the following main concepts in WSRM functionality when you deploy the WSRM Management Pack:

Process-matching criteria

Process matching is the mechanism that WSRM uses to match processes and then aggregate them into groups to be managed.

A process-matching criterion creates an association between a group of one or more matched processes and a resource-allocation policy. WSRM then manages all of the matched processes by using the resource allocations that are specified in the resource-allocation policy. If a CPU allocation is specified, the CPU resources are distributed evenly among all the processes in the group. If a memory limit is specified, the limit is applied on a per-process basis to all the processes in the group.

Resource-allocation policies

A resource-allocation policy is the mechanism that specifies the resource usage for a process.

After you have defined a process-matching criterion, you can create a resource-allocation policy to determine the resource usage for all processes that are matched by the process-matching criterion. A resource-allocation policy includes one or more resource allocations. A resource allocation consists of a process-matching criterion and one or more allocations, such as a CPU consumption target, memory limit, or processor affinity. The allocations that are specified apply to all processes that match the process-matching criterion associated with the resource allocation.

The default group

All running processes that are not matched by any of the process-matching criteria in the managing resource-allocation policy are grouped in the default group.

The default group is allocated any CPU bandwidth that is unaccounted for by the managing policy. This bandwidth is distributed on an equal-shares basis. This ensures that all processes in the default group get an equal proportion of the total CPU bandwidth that is available to the default group. WSRM enforces this allocation, which prevents one or more applications in the default group from using more than their share of CPU bandwidth.

WSRM calendar

You can use the WSRM calendar to schedule the automatic activation of a resource-allocation policy. You can specify that this be done on a one-time or recurring basis. You can also use a resource-allocation policy schedule to manage multiple resource-allocation policies and to activate different resource-allocation policies at different, predefined times over a 24-hour period.

When the calendar is enabled, the default calendar resource-allocation policy is in effect. You can configure the calendar default policy to be a user-defined resource-allocation policy or use the default calendar default resource-allocation policy. If no specific policy is scheduled, WSRM applies the default calendar resource-allocation policy. You can also configure the calendar to use a resource-allocation policy other than the default policy for times when no other policies are scheduled.

WSRM accounting

WSRM provides a mechanism you can use to store and retrieve records of the behavior of managed processes. With this information, you can easily generate reports of process-resource usage on a per-user basis, per-application basis, or other relevant basis. You can also use the accounting data to provide charge-back accounting.

Accounting information is stored in an accounting database for easy administration and retrieval. This information is logged in the accounting database at a default interval of ten minutes. You can modify this interval. The interval determines the frequency with which the data for all managed processes on the computer managed by WSRM is logged to the accounting database.

Additional Resource Management Terms

It is useful to know the following terms and definitions when you deploy the WSRM Management Pack:

Committed memory

Memory that either resides in the working set of a process or maps to a disk file, such as a page file or other memory-mapped file.

CPU affinity

On systems with more than one processor, CPU affinity specifies the set of processors on which the process is allowed to run exclusively. Also known as processor affinity.

CPU allocation

An allocation of CPU bandwidth that is specified in a resource-allocation. The CPU bandwidth is then allocated to the managed processes matched by the process-matching criterion in the resource-allocation.

CPU bandwidth

The rate of usage of the CPU over time. WSRM represents CPU resources as a percentage from 0 through 100.

CPU consumption target

Specifies a target CPU bandwidth for the resource allocation. WSRM uses a dynamic process priority management algorithm to try to limit the CPU usage of a process to the target specified in the matching resource-allocation policy of that process.

Managed CPU

The total CPU resources being used by managed processes.

Managed process

All running processes, except those on the system-defined exclusion list or the user-defined exclusion list.

Managing policy

The resource-allocation policy that controls how the computer is managed when WSRM is running in the managing state. If no user-defined resource-allocation policy is specified, WSRM uses the default resource-allocation policy.

Process

An object created by Windows to represent a running program. This includes the virtual address space and the control information necessary for the program to run.

Process priority

A value for each process that helps determine how CPU resources are used. WSRM dynamically changes the priority of a process so the CPU usage matches the targeted CPU allocation for that process.

Resource allocation

The part of a resource-allocation policy that specifies the set of resources to be allocated to the processes that match a given process-matching criterion.

Working set

The amount of physical memory assigned to a process by the operating system.

Configuration

This section provides some notes about agent configuration. It also provides an option for filtering events to reduce the impact of the Management Pack if it is being used over slow network connections or for other scenarios.

Agent Log File Settings

The general recommendations for configuring agent logs (included in Chapter 6, “Deploying Agents”, of the Microsoft Operations Managemer 2000 SP1 Deployment Guide) are sufficient for use with the WSRM Management Pack.

Configuring the Filtering of WSRM Monitoring Data

There are several cases in which you might decide not to collect warnings, performance data, and miscellaneous noncritical events. These include:

  • Deployments across satellite links.

  • Large branch office deployments.

  • Deployments with very slow WAN links.

  • Deployments where alerts are forwarded to a global network operations center.

  • Warnings and informational messages that are not needed.

You can filter events that you do not want to be notified about. First, you must create a folder to hold the new filter rules, and then you must add the filter rules.

In addition, you might decide to disable certain performance data to decrease traffic. After making changes, you must commit changes to the system.

**Note :**WSRM Management Pack public views do not operate if performance data gathering is disabled.

If you have not already created a folder to hold new rules, you can do it now.

To create a folder for filter rule

  1. In the MOM Administrative console, navigate to Rules\Processing Rule Groups.

  2. Right-click Microsoft Windows System Resource Manager (enabled), click New, and then click New Processing Rule Group.

  3. Enter a name and description for the new folder.

  4. Ensure that the Enabled check box is selected.

  5. Click Next, and then click Finish.

  6. When a dialog box appears with the message, “Would you like to deploy the processing rules in this newly created processing rule group to a group of computers?” click Yes.

  7. On the Processing Rule Group Properties dialog box, click Add.

  8. Select Microsoft Windows System Resource Manager, and then click OK.

To create filtering rules

  1. In the MOM Administrative console, navigate to Rules\Processing Rule Groups\Microsoft Windows System Resource Manager.

  2. Expand the processing rule group you just created.

  3. Right-click Event Processing Rules, and then click Filter Events

  4. Verify that the provider name is Application, and then click Next.

  5. Click Advanced.

  6. In the Advanced Criteria dialog box, set the following parameters:

    • In the Field box, select Event Type.

    • In the Condition box, select is less than.

    • In the Value box, select Error.

  7. Click Add to list

  8. Click Close, and then click Next

  9. In the Filter Processing Rule Properties - Schedule dialog box, in the drop-down box click Always process data, and then click Next.

  10. In the Filter Processing Rule Properties - Filter dialog box, select Do not evaluate further processing rules, nor insert matching events into the database (Pre-Filter), and then click Next.

  11. In the Company Knowledge Base dialog box, click Next.

  12. In the Filter Processing Rule Properties - General dialog box, in the Name box type a name for the filter (for example, Filter non-error Application events).

  13. Click Finish.

To apply new filtering rules

  1. In the MOM Administration console, navigate to Rules\Processing Rule Groups.

  2. Right-click Rules, and then click Commit. These changes take approximately 10 minutes to take effect

To disable performance data

  1. In the MOM Administration console, navigate to Rules\Processing Rule Groups\Microsoft Windows System Resource Manager (enabled)\Windows System Resource Manager (enabled)\Generic (enabled).

  2. Click Performance Processing Rules.

  3. For each rule of type Measuring in the right pane, perform the following steps to disable the rule:

    • Double-click the rule.

    • Make sure the Enabled check box is cleared, and then click OK.

  4. In the MOM Administration console, navigate to Rules\Processing Rule Groups.

  5. Right-click Rules, and then click Commit. These changes take approximately 10 minutes to take effect.

Operations Tasks

You must triage all alerts on a daily basis. In addition, you must perform other tasks on a regular basis, depending on your environment. Many important problems do not cause alerts, but they still require periodic attention.

You can perform the daily and weekly tasks as specified in this section, but you must adjust the frequency of these tasks to meet the needs of your particular environment.

Daily Tasks

On a daily basis, perform the following tasks:

  • Review all open alerts.

  • Verify that all domain controllers are communicating with the MOM Administrator console.

  • Review warnings. (Optional)

Reviewing All Open Alerts

Triage all new alerts in the following order of priority:

  • Critical errors

  • Errors

  • Warnings (optional)

  • Information (optional)

To review open alerts

  1. In the MOM Administrator console, click Monitor, All Open Alerts.

  2. Review all alerts that are older than 24 hours to make sure they are addressed in a timely manner.

Verifying That All Domain Controllers Are Communicating with the MOM Administrator Console

Communication failure between the domain controller and the monitoring infrastructure prevents you from receiving alerts.

To verify that domain controllers are communicating with the MOM Administrator console

  1. In the MOM Administrator console, click Monitor, All Agents.

  2. In the details pane, click the Last Contact column header. This sorts all computers based on the last contact time. If the last contact time is greater than five minutes, troubleshoot why the computer is not communicating with MOM.

Weekly Tasks

To determine the effectiveness of WSRM in managing the server resources (CPU and memory), review the alerts generated during the week and categorize them into the following areas:

  • Accounting

  • Configuration

  • Generic

  • Low resource conditions

This also provides an indication of the common error conditions occurring.

Recommendations for the Network Operations Desk

The Network Operations Desk can participate in monitoring WSRM by performing the daily and weekly tasks that are described in the “Operations Tasks” section of this guide. Although all alerts and error conditions require the attention of a WSRM administrator, the Network Operations Desk can try to resolve the problem based on the Knowledge Base information provided in the WSRM Management Pack for that alert or error condition.

Based on your monitoring requirements, you can reduce the volume of monitoring data that is sent to the Network Operations Desk by filtering monitoring data. For more information, see the “Configuring the Filtering of WSRM Monitoring Data” section earlier in this guide.

Reference

This section describes some of the contents of the WSRM Management Pack.

Reports

The WSRM Management Pack does not provide reports.

Computer Groups

The WSRM Management Pack has the following computer group:

Microsoft Windows System Resource Manager

This computer group has the following processing rule group(s) associated with it:

Microsoft Windows System Resource Manager \ Windows System Resource Manager

With this computer group to processing rule group association all the rules that are in the Microsoft Windows System Resource Manager \ Windows System Resource Manager processing rule group and child processing rule groups are deployed to computers that are running Windows System Resource Manager on Windows Server 2003.

Processing Rules

The WSRM Management Pack contains the following processing rule groups. These are contained in the following location:

Rules \ Processing Rule Groups \ Microsoft Windows System Resource Manager \ Windows System Resource Manager

Processing Rule Group: Accounting

The Accounting processing rule group contains Event processing rules for errors related to Accounting.

Table 1 Event Processing Rules in the Accounting Processing Rule Group

Rule

Default alert severity

WSRM - Database location contains non-English characters

Critical Error

WSRM - Error Initializing Accounting thread

Critical Error

WSRM - Error Initializing database

Critical Error

WSRM - Running low on disk space

Alert Disabled

WSRM - Unable to log accounting information

Alert Disabled

WSRM - WMI inaccessible

Critical Error

Processing Rule Group: Configuration

The Configuration processing rule group contains Event processing rules for Configuration related errors.

Table 2 Event Processing Rules in the Configuration Processing Rule Group

Rule

Default alert severity

WSRM - Backup Operation Failed

Alert Disabled

WSRM - Backup XML files missing

Alert Disabled

WSRM - Calendar Default Policy not found

Alert Disabled

WSRM - Error accessing xml file

Critical Error

WSRM - Managing policy not found

Alert Disabled

WSRM - Registry Access Error

Alert Disabled

WSRM - Registry Value Error

Alert Disabled

WSRM - Resource Allocation Policy could not be set as managing policy

Critical Error

WSRM - Unable to start WSRM

Critical Error

WSRM - User Name not logged

Alert Disabled

WSRM - WSRM could not start

Critical Error

WSRM - WSRM not supported

Security Breach

WSRM - XML files missing

Alert Disabled

Processing Rule Group: Generic

The Generic processing rule group contains Event processing rules for generic errors.

Table 3 Event Processing Rules in the Generic Processing Rule Group

Rule

Default alert severity

WSRM - Cannot locate user name

Alert Disabled

WSRM - Could not apply maximum working set size

Alert Disabled

WSRM - Process exceeded committed memory limit and terminated

Alert Disabled

WSRM - Unable to start WSRM Calendar

Critical Error

WSRM - Unable to terminate process

Alert Disabled

WSRM - WSRM could not start and timed out

Alert Disabled

WSRM - WSRM managing processes when there is no CPU contention

Alert Disabled

The Generic processing rule group also contains the following Performance processing rules:

Table 4 Performance Processing Rules in the Generic Processing Rule Group

Rule

Enabled/disabled

WSRM: PMC - Actual Managed CPU

Enabled

WSRM: PMC - Page File Size

Enabled

WSRM: PMC - Process Count

Enabled

WSRM: PMC - Target Managed CPU

Enabled

WSRM: PMC - Working Set Size

Enabled

Processing Rule Group: Low Resource Conditions

The Low Resource Conditions processing rule group contains Event processing rules for events related to Low Resource Conditions.

Table 5 Event Processing Rules in the Low Resource Conditions Processing Rule Group

Rule

Default alert severity

WSRM - Unable to allocate memory

Critical Error

WSRM - WSRM counters cannot function properly

Critical Error

Scripts

The WSRM Management Pack does not include any scripts.

Default Views

The WSRM Management Pack provides a default set of public views. The public view taxonomy is:

Public Views

Microsoft Windows System Resource Manager

Alerts

Attributes

Computer Groups

Computers

Events

Performance

WSRM: PMC - Actual Managed CPU

WSRM: PMC - Page File Size

WSRM: PMC - Process Count

WSRM: PMC - Target Managed CPU

WSRM: PMC - Working Set Size