Adding a Logistic Regression Model to the Call Center Structure (Intermediate Data Mining Tutorial)
Applies To: SQL Server 2016 Preview
In addition to analyzing the factors that might affect call center operations, you were also asked to provide some specific recommendations on how the staff can improve service quality. In this task, you will use the same mining structure that you used to build the exploratory model and add a mining model that will be used for creating predictions.
In Analysis Services, a logistic regression model is based on the neural networks algorithm, and therefore provides the same flexibility and power as a neural network model. However, logistic regression is particularly well-suited for predicting binary outcomes.
For this scenario, you will use the same mining structure that you used for the neural network model. However, you will customize the new model to target your business questions. You are interested in improving service quality and determining how many experienced operators you need, so you will set up your model to predict those values.
To ensure that all the models based on the call center data are as similar as possible, you will use the same seed value as before. Setting the seed parameter ensures that the model processes the data from the same starting point, and minimizes variations caused by artifacts in the data.
In SQL Server Data Tools (SSDT), in Solution Explorer, right-click the mining structure, Call Center Binned, and select Open Designer.
In Data Mining Designer, click the Mining Models tab.
Click Create a related mining model.
In the New Mining Model dialog box, for Model name, type Call Center - LR. For Algorithm name, select Microsoft Logistic Regression.
The new mining model is displayed in the Mining Models tab.
In the column for the new mining model, Call Center - LR, leave Fact CallCenter ID as the key.
Change the value of ServiceGrade and Level Two Operators to Predict.
These columns will be used both as input and for prediction. In essence, you are creating two separate models on the same data: one that predicts the number of operators, and one that predicts the service grade.
Change all other columns to Input.
In the Mining Model tab, right-click the column for the model named Call Center - LR, and select Set Algorithm Parameters.
In the row for the HOLDOUT_SEED parameter, click the empty cell under Value, and type 1. Click OK.
The value that you choose as the seed does not matter, as long as you use the same seed for all related models.
In the Mining Models menu, select Process Mining Structure and All Models. Click Yes to deploy the updated data mining project to the server.
In the Process Mining Model dialog box, click Run.
Click Close to close the Process Progress dialog box, and then click Close again in the Process Mining Model dialog box.