Basic Data Mining Tutorial

Welcome to the Microsoft SQL Server 2008 Analysis Services (SSAS) Basic Data Mining Tutorial. Microsoft SQL Server provides an integrated environment for creating and working with data mining models. In this Basic Data Mining Tutorial, you will complete a scenario for a targeted mailing campaign in which you create three models for analyzing customer purchasing behavior and targeting potential buyers. The tutorial demonstrates how to use the data mining algorithms, mining model viewers, and data mining tools that are included in Microsoft SQL Server Analysis Services. The fictitious company, Adventure Works Cycles, is used for all examples.

When you are comfortable using the data mining tools, we recommend that you also complete the Intermediate Data Mining Tutorial, which demonstrates how to use forecasting, market basket analysis, time series, association models, nested tables, and sequence clustering.

Tutorial Scenario

In this tutorial, you are an employee of Adventure Works Cycles who has been tasked with learning more about the company's customers based on historical purchases, and then using that historical data to make predictions that can be used in marketing. The company has never done data mining before, so you must create a new database specifically for data mining and set up several data mining models.

What You Will Learn

This tutorial teaches you how to create and work with several different types of data mining models. It also teaches you how to create a copy of a mining model, and apply a filter to the mining model. You then process the new model and evaluate the model using a lift chart. After the model is complete, you use drillthrough to retrieve additional data from the underlying mining structure.

In SQL Server 2008, Microsoft provides several new features that help you develop custom data mining models and use the results more effectively.

  • Holdout Test Sets - When you create a mining structure, you can now divide the data in the mining structure into training and testing sets.

  • Mining model filters - You can now attach filters to a mining model, and apply the filter during both training and testing.

  • Drillthrough to Structure Cases and Structure Columns - You can now easily move from the general patterns in the mining model to actionable detail in the data source.

This tutorial is divided into the following lessons:

Requirements

Make sure that the following are installed:

  • Microsoft SQL Server 2008 

  • Microsoft SQL Server Analysis Services 

  • The AdventureWorks2008 database.

To enhance security, the sample databases are not installed with SQL Server 2008. To install the official databases for Microsoft SQL Server, visit Microsoft SQL Sample Databaseshttps://msdn.microsoft.com/en-us/library/ms143804(v=sql.100).

Note

When you are working through a tutorial, you might find it easier to move back and forth between the steps if you add the Next topic and Previous topic buttons to the document viewer toolbar. For more information, see Adding Next and Previous Buttons to Help.