Classified Columns (Data Mining)

Article
04/01/2012

When you define a classified column, you create a relationship between the current column and another column in the mining structure. The data in the mining structure column that you designate as the classified column contains categorical information that describes the values in another column in the mining structure.

For example, suppose you have two columns with numerical data: one column, [Yearly Purchases], contains the total yearly purchases per customer for a specific calendar year, and the other column, [Standard Deviations], contains the standard deviations for those values. In this case you could designate the [Yearly Purchases] column as the classified column, and the model would be able to use this relationship in analysis.

Note

The algorithms provided in Analysis Services do not support the use of classified columns; this feature is provided for use in creating custom algorithms.

Defining a Classified Column

The data type of a classified column must be either Long or Double.

The following list describes the content types that Analysis Services supports for classified columns.

PROBABILITY
The value in the column is the probability of the associated value, and is a number between 0 and 1.
VARIANCE
The value in the column is the variance of the associated value.
STDEV
The value in the column is the standard deviation of the associated value.
PROBABILITY_VARIANCE
The value in the column is the variance of the probability for the associated value.
PROBABILITY_STDEV
The value in the column is the standard deviation of the probability for the associated value.
SUPPORT
The value in the column is the weight, or case replication factor, of the associated value.

Classified Columns (Data Mining)

Defining a Classified Column

See Also

Concepts

Additional resources