Specifying a Testing Data Set for the Structure (Basic Data Mining Tutorial)
Applies To: SQL Server 2016 Preview
In the final few screens of the Data Mining Wizard you will split your data into a testing set and a training set. You will then name your structure and enable drillthrough on the model.
Separating data into training and testing sets when you create a mining structure makes it possible to easily assess the accuracy of the mining models that you create later. For more information on testing sets, see Training and Testing Data Sets.
To specify the testing set
On the Create Testing Set page, for Percentage of data for testing, leave the default value of 30.
For Maximum number of cases in testing data set, type 1000.
Drillthrough can be enabled on models and on structures. The checkbox in this dialog box enables drillthrough on the named model. After the model has been processed, you will be able to retrieve detailed information from the training data that were used to create the model.
If the underlying mining structure has also been configured to allow drillthrough, you can retrieve detailed information from both the model cases and the mining structure, including columns that were not included in the mining model. For more information, see Drillthrough Queries (Data Mining).
To name the model and structure and specify drillthrough
On the Completing the Wizard page, in Mining structure name, type Targeted Mailing.
In Mining model name, type TM_Decision_Tree.
Select the Allow drill through check box.
Review the Preview pane. Notice that only those columns selected as Key, Input or Predictable are shown. The other columns you selected (e.g., AddressLine1) are not used for building the model but will be available in the underlying structure, and can be queried after the model is processed and deployed.