Chapter 7: Support
Overview
Services are perceived by the user through the lens of the desktop. At the same time, desktop operating systems and their related applications generate the largest number of support calls. Therefore, providing both a quality desktop experience and a high level of customer support is critical to maintaining IT's alignment to the business.
The IT manager and the Service Desk Manager should work together to ensure that the support desk is capable of meeting service level agreements (SLAs) and that customer expectations are satisfied. To accomplish this, the following actions need to occur:
- Ensure support SLAs are in place.
- Ensure proper escalation procedures and triggers are in place.
- Identify training and awareness requirements for staff and users as a means to reduce call volume.
- Conduct frequent reviews of support processes and call volumes to identify trends.
Scenario...
Now that the Windows Vista desktops have been successfully deployed, Gurinder Singh, the Service Desk manager at Woodgrove Bank, is aware that her team will need to focus on providing proactive and comprehensive support for the business.
Providing Effective Windows Vista Support
Scenario...
April Reagan is the first line support for end users at Woodgrove National Bank. Due to the excessive call volume, many service requests are forwarded to the desktop support manager in New York corporate headquarters. Insufficient call details result in having to contact customers repeatedly to collect more information. Moreover, VIP on-site support is an issue since there is a staff shortage. The service desk manager, Gurinder Singh, has received many customer complaints regarding the level of support and delays in fulfilling VIPs' requests. Gurinder has assigned April the task of improving the process of handling incidents and providing effective support by initiating a service improvement project to be completed by the next SLA Review.
To operate Windows Vista-based desktops as a service, processes should be in place to handle daily service requests and incident calls. This is taken care of by the Microsoft Operations Framework (MOF) Incident Management and Problem Management processes. Normal service operation is defined as a service operation within service level agreement (SLA) limits.
The objectives of Incident Management are to:
- Restore service as quickly as possible, even if it requires a Windows Vista restart or desktop replacement.
- Minimize the adverse impact of incidents on business operations.
- Maintain the best possible levels of service quality and availability.
- Ensure that all incidents and service requests are processed consistently.
- Direct support resources where they are most needed.
Repeatedly occurring incidents should be analyzed and addressed using the MOF Problem Management service management function. The goal of problem management is to minimize the adverse impact on business operations due to incidents and problems caused by errors within infrastructure and to prevent the recurrence of these incidents. In order to achieve this goal, problem management seeks to establish the root cause of incidents and then initiate actions to improve or correct the situation.
The objectives of Problem Management are to:
- Identify and take ownership of problems affecting the service.
- Proactively take steps to reduce the impact of incidents and problems.
- Identify the root cause of problems.
- Initiate activity aimed at establishing workarounds or permanent solutions.
- Perform trend analysis, using recorded problem and incident data:
- To predict future problems.
- To prioritize problem management activity.
Table 7.1. Summary of Incident and Problem Management Processes
Incident Management |
Problem Management |
|
|
Relationships Between Incident and Problem Management Processes
The following figure shows the relationships between Incident Management and Problem Management processes and how they interact with each other.
Figure 7.1. Relationship between Incident Management and Problem Management processes
The table below summarizes the inputs required to perform major tasks in Incident Management and Problem Management processes and their related outputs.
Table 7.2. Inputs and Outputs in Incident Management and Problem Management Processes
Inputs
Process
Outputs
- Incident details reported by Service Desk
- Configuration details from SMS Server 2003
- Response from incident matching against problems and known errors
- Resolution details
- A central repository of desktops using Event Forwarding and Subscription features of Windows Vista
- Completed changes on Windows Vista desktop service as a result of a Request for Change (RFC)
- Service Level Agreement Details (SLA)
- Incident Management
Updated Incident Record with the solution or workaround
Resolved and closed incidents
Communication to customers
Management information/reports
Possible Request for Change (RFC) to be processed by Problem Management
Incident details
Configuration details
Any defined workarounds (Incident Management)
Proactive known errors (KEs) from the Release Responsibility cluster
Problem Management
- Known errors (knowledge base)
- Request for Change
- Updated Problem Record
- Closed problem record for a resolved problem
- Response from incident matching to problems and known errors
- Management information
- Needs identification (training, documentation, and so on)
Incident Management in Windows Vista
Typical incidents could include:
- A desktop service being unavailable.
- Software corruption.
- A hardware failure.
- The detection of a virus.
- An update (patch) causing failures in the service operation.
Incident Management Resolution Process
Figure 7.2 illustrates the steps in the incident management resolution process.
Figure 7.2. Windows Vista Desktop incident management process
Detection, Self-Service, and Recording
This process deals with the initial detection of incidents through a variety of different methods. Windows Vista includes built-in diagnostics that can automatically detect and diagnose common support problems and then help users resolve the problems themselves. Some problems that Windows Vista diagnostics address are:
- Failing disks
- Degraded performance
- Lack of network connectivity
- Failure to shut down properly
Figure 7.3 is an example of a Windows Vista proactive alert to prevent information loss due to low memory status. For more information about Windows Vista built-in diagnostics, please refer to https://www.microsoft.com/windows/products/windowsvista/features/details/builtindiagnostics.mspx.
Figure 7.3. Resource exhaustion prevention
This alert will be logged in the Windows Vista Events repository, and the Events forwarding feature can be configured to forward them to a central server/repository. The problem manager in turn can proactively analyze the recurrence of such events and their related applications and produce an action plan for resolving them permanently.
Classification and Initial Support
The classification process categorizes the incident and uses information on impact and urgency to determine the priority of the incident.
Investigation and Diagnosis
This process deals with the investigation of the incident and the gathering of diagnostic data. The goals of the process are to identify how the incident can be resolved as quickly as possible, based on SLA targets. This is achieved by checking incidents against known errors, existing problems, and previous incidents in order to identify documented workarounds.
IT departments can add custom content to User Assistance—the Windows Vista version of help files—to provide answers to questions about custom applications and internal network resources. With good user assistance content, most reported incidents (or service requests) can be either resolved by users themselves or by first-tier of support, thereby eliminating escalation.
Examples include:
- Tips and tricks on various Windows, Office, and LOB applications.
- How to create an Outlook calendar meeting request.
- How to add the closest printer to your location.
- How to download hardware drivers for your desktop from internal resources.
User Assistance can also be customized to link users directly to an internal support center. For more information on how to customize help content to meet your needs, visit http://technet2.microsoft.com/Windows Vista/f/?en/library/bcffdbb6-82df-4a0f-b215-759a1ecfbdd61033.mspx.
Windows Vista Error Reporting is a feature that allows Microsoft to track and address errors relating to the operating system, Windows features, and applications. This feature gives users the opportunity to send data about errors to Microsoft and to receive information about solutions. Solution information can include the following:
- A link to the Windows Update Web site.
- A link to updated drivers and patches.
- A link to Microsoft Knowledge Base articles.
Resolution and Recovery
This process deals with finding and applying the required fix to resolve the incident. Following successful resolution of an incident, such as replacement of a faulty hard disk, there may be recovery actions that need to be taken, such as the restoration of user data.
Closure
This process ensures that the customer is satisfied that the incident has been resolved prior to closing the incident record. This is also where the incident record is fully updated and assigned a closure category. Finally, to ensure customer satisfaction with the support experience, a follow-up survey should be sent to users to evaluate the overall experience.
Scenario...
Now that April understands how to apply Incident Management processes to the service improvement project, she works within her team to identify the responsibilities for each team member to align with MOF Incident Management best practices.
Principal roles and their associated responsibilities for Windows Vista incident management have been defined according to industry best practices. It is important to remember that these are roles rather than job descriptions.
The roles required within the incident management process are:
Incident manager. Responsible for the end-to-end ownership of the Windows Vista desktop incident management process. Usually, the Service Desk manager takes on this role.
Service desk analyst. Responsible for many tasks including (but not limited to) incident recording, initial support and classification, escalating, resolution and recovery of incidents, and closure. The call screeners normally take on this role.
Support technicians. Responsible for providing assistance to end users.
Problem Management in Windows Vista
A difficult area of Problem Management for most organizations is the root cause analysis of problems. Often, visual techniques are employed in assisting support personnel with their investigation.
Performing Root Cause Analysis for Windows Vista
To perform root cause analysis
Define the problem (Effect).
Gather data/evidence (Categorize).
Identify issues that contributed to the problem (Primary/secondary causes).
Find root cause (Process of elimination).
Develop solution recommendation (Change Request).
Implement solution (Change Implemented).
Fishbone Diagram
One tool that is useful in visually diagramming this process is the Ishikawa, or fishbone diagram, shown below.
Figure 7.4. Fishbone diagram
Example Problem: Display Flickering
Scenario...
A high call volume support issue has been identified. A number of Linda's users with CRT monitors have been complaining that their monitors have been flickering intermittently after upgrading to Windows Vista, and most often when waking up from Sleep or Hibernate modes. Linda suspects that it may be a video card issue, but since it is intermittent, she decides to dig deeper to ensure proper resolution.
After gathering additional data, Linda compiles a list of likely issues, listed below in no particular order:
- Electrical interference from other devices
- Monitor driver
- Video card driver
- Faulty cabling
- Out-of-date video card driver
- Dirtypower
- Motherboard incompatibility
- Faulty video card
- Faulty monitor
- Unsupported monitor resolution
- Refresh rate mismatch
After evaluating the list, Linda concludes that these possible problems seem to fall into three separate categories.
Table 7.3. Problem Categories
Video Card |
Monitor |
Other |
Resolution unsupported |
"Dirty" power |
Electrical interference |
Refresh rate mismatch |
Faulty cabling |
Motherboard incompatibility |
Driver faulty/out of date |
Monitor drive faulty/out of date |
|
Video card faulty |
Monitor faulty |
Finding Root Cause
Scenario...
Now that Linda has a pretty good idea of what may be contributing to the monitor flickering issue, she decides to start with the most plausible root cause, the Video Card category. She moves along the fishbone to research the following primary causes:
- Refresh rate mismatch
- Unsupported resolution
- Driver
- Faulty video card
Since the problem is intermittent, she decides to set up a Data Collector with the new Windows Vista Reliability Monitor and Performance Monitor tools.
To create a Data Collector Set from Performance Monitor
Start Performance Monitor, and add counters to create a custom view you want to save as a Data Collector Set.
Right-click anywhere in the Performance Monitor display pane, point to New, and click Data Collector Set. The Create New Data Collector Set Wizard starts. The Data Collector Set created will contain all of the data collectors selected in the current Performance Monitor view.
Type a name for your Data Collector Set, and then click Next.
The Root Directory will contain data collected by the Data Collector Set. Change this setting if you want to store your Data Collector Set data in a different location than the default. Browse to and select the directory, or type the directory name.
Note If you enter the directory name manually, you must not enter a backslash at the end of the directory name.
Click Next to define a user for the Data Collector Set to run as, or click Finish to save the current settings and exit
Note If you are a member of the Performance Log Users group, you must configure Data Collector Sets that you create to run under your own credentials.
Click Finish to return to Windows Vista Performance Monitor.
To view the properties of the Data Collector Set or make additional changes
Select Open properties for this data collector set. You can get more information about the properties of Data Collector Sets by clicking the Help button in the Properties page.
To start the Data Collector Set immediately (and begin saving data to the location specified in Step 4)
Click Start this data collector set now.
To save the Data Collector Set without starting collection
Click Save and close.
Scenario...
After taking a look at the Data Collector set she set up, Linda determines that both the refresh rate and resolution settings are in line with what the hardware supports. One thing she did notice while looking at the Reliability Monitor is that there were a number of failures in the Application Failures section.
Figure 7.5. Reliability Monitor report
Scenario...
Linda also notes from the graph that the errors are becoming more frequent and happen on days in which she received the most complaints from her users. The application failures showed that indeed there are recurring issues with the video card software not recognizing the card properly.
Since all the cards in question are an IT standard, she tests the latest driver and doesn't notice any problems on her test computer. She notes this and submits a change request to update the drivers on the affected computers.
In working through this problem, Linda has identified many steps of Incident Management and Problem Management in her new service improvement project and is now ready to incorporate these improvements at the next SLA Review.
Once marked as a known error pending resolution, a Request for Change (RFC), as driven by the Service Desk, is communicated to the affected users and they are told when to expect the problem to be resolved. When the RFC is approved and pushed to the desktops, the Service Desk follows up to make sure that the incidents have been resolved, and the tickets are closed.
Fault Tree Analysis
Fault Tree Analysis is another visual technique used to assist with root cause analysis. It is a top-down approach to identify all potential causes leading to a defect. In the final stage of diagnosis, the root cause is identified and the problem moved from unknown state to known. See the example shown in Figure 7.6.
Figure 7.6. FTA analysis for a problem
Proactive Analysis and Event Resolution
Proactive analysis activities are concerned with identifying and resolving problems and known errors before incidents occur, thus minimizing the adverse impact on the service and business as a whole. This can be accomplished by reviewing the following:
- The current incident database.
- All forwarded events stored in a central repository.
- Corporate Error Report data.
- Knowledge Base with unknown errorstate.
The basis for selecting which problems to attempt to proactively resolve is based on several factors including:
- Cost to the business.
- Customers affected.
- Volume, duration, and cost of the incidents.
- Cost of implementation.
- Likelihood of success.
Figure 7.7. Targeting Preventative Action
Using these factors, an algorithm can be created and used to calculate the business impact of support events. This can be a useful, cost-effective way to determine which problems to address.
Exhibit 7.1. Sample Incident Business Impact
Access this content as part of theWVSLM download package.
Periodic Tasks
The most common tasks to be executed on a periodical basis in order to support the Windows Vista desktop service effectively and efficiently are listed in the table below.
Table 7.4. Common Periodic Tasks
Task
Frequency
Owner
Incident detection and recording in IT Service Management service desk tool
As needed (when an incident is open)
Service Desk Analyst
Classification and initial support
Use the Windows Vista Knowledge Base or User Assistance features to search for similar symptoms and offer possible solutions
Windows Vista Remote Desktop and User Assistance
As needed
Service Desk Analyst
Resolution and Recovery
Windows Vista help and support features
Use Virtual PC 2007 as a free download for enterprise customers to reproduce the issue
Windows Vista Hardware Development Center
Troubleshooting and Support
Patch Management Diagnostics and Recovery toolset available in the Desktop Optimization kit
As needed
Support Technician
Major incident manager (in high severity incidents)
Incident closure
As needed (after getting customer/user approval)
Service Desk Analyst
Incident ownership, monitoring, tracking, and communication
Daily
Bi-daily (for low severity incident)
Service Desk Analyst
OR
Desktop Support Technician
(depending on the Service Desk Model implemented)
Review to drive the efficiency and effectiveness of Incident Management process
Monthly
Incident Manager/Service Desk Manager (Doris Krieger)
Producing Service Management information
Quarterly for SLA Reviews
Incident Manager/Service Desk Manager
Managing escalations
As needed
Service lead
Problem recording
As needed
Monthly
Problem Manager
Problem control and root cause determination
Fault Tree Analysis (FTA) explained above
As needed
Problem Analyst (A second assistant is Support Specialist
Error control
As needed/scheduled
Quick Fix Engineering team (subset of Development team)
Proactive prevention of problems
Windows Vista Scripts to collect selective data
Windows Vista Corporate Error Reporting
Problem Reports and Solutions
Auto Event forwarding and subscription
Business Impact Calculation
Monthly: Proactive analysis for the Windows Vista Desktops Events forwarded or Corporate Windows Vista Error Reports
Calculates the pain value to narrow down the focus to top five problems
Problem Manager
Major Problem Reviews
As needed
Problem Manager
Essential Tasks of the Support Role
The following table illustrates some of the essential tasks required by this capability to drive higher customer satisfaction and ensure consensus between the IT service desk team and business. It also describes the objective, frequency, and responsible party for each task.
Table 7.5. Support Role Essential Tasks
Task
Description/Why
Frequency
Responsibility Role
Produce business-related metrics reports
These reports will be a great input during service review meetings and internal operational reviews. Most IT Service Management tools offer some reporting features that include the above mentioned ones.
Weekly
Service Desk lead
Internal support review meetings
Ensure clearing any outstanding issues from previous week.
Weekly
Service Desk Manager
Service Review Meeting with IT MGMT
Communicate any feedback or complaints from business side and explore ways to fix them.
Monthly
IT Manager
Service Desk Manager
Attend SLA Review
Review support SLA targets, discuss areas to be accountable for successful achievement and responsibility for any future service improvement opportunities
Quarterly (or at whatever was agreed on during SLA negotiations)
IT Manager
Service Desk Manager
Partner underpinning contracts (UC) review
Periodic review should be conducted with partners to ensure they are on track and meeting all targets.
Monthly
Partner Account Manager and Service Desk Manager
Critical Success Factors and Key Performance Indicators
Sample Scorecard
Use a scorecard such as the sample below to measure performance against KPIs and to track trends.
Table 7.6. Scorecard Sample
Exhibit 7.2. Sample Scorecard
Access this content as part of the WVSLM download package.
Exhibits
Exhibit 7.3. Sample Incident Survey
Sample customer survey to be used after closing out an incident.
Access this content as part of the WVSLM download package.
Exhibit 7.4. Sample Major Problem Review
It is a learning opportunity (instead of a blaming session) to discuss what went wrong, what went right, and what corrective and preventative actions shall be taken and by who and when.
Access this content as part of the WVSLM download package.
Technical Guidance
- Incident Management service management function in MOF https://www.microsoft.com/technet/solutionaccelerators/cits/mo/smf/smfincmg.mspx
- Problem Management service management function in MOF https://www.microsoft.com/technet/solutionaccelerators/cits/mo/smf/smfprbmg.mspx
- Help and Support Features http://technet2.microsoft.com/Windows Vista/en/library/8531e599-f2b0-4943-a572-d8527fd466e61033.mspx?mfr=true
- Walkthrough: Customize Help and Support http://technet2.microsoft.com/Windows Vista/f/?en/library/bcffdbb6-82df-4a0f-b215-759a1ecfbdd61033.mspx
- Windows Error Reporting and the Problem Reports and Solutions Feature in Windows Vista http://technet2.microsoft.com/Windows Vista/f/?en/library/03a97dba-6b24-42c7-81f0-fb751f6e839d1033.mspx
- How to: Configure Microsoft Error Reporting https://msdn2.microsoft.com/en-us/library/bb219076.aspx
- Event Log, Event Viewer, and Task Scheduler in Windows Vista https://www.microsoft.com/technet/community/events/vista/dsk-07.mspx
- Event Forwarding and Subscription https://www.microsoft.com/technet/technetmag/issues/2006/11/EventManagement/default.aspx
- Microsoft Diagnostics and Recovery Toolset https://download.microsoft.com/download/e/4/4/e4442c9f-30d9-41f3-9876-82bbfc5aa4e6/datasheet-drt.pdf