Classified Columns (Data Mining)

When you define a classified column, you create a relationship between the current column and another column in the mining structure. The data in the mining structure column that you designate as the classified column contains categorical information that describes the values in another column in the mining structure.

For example, suppose you have two columns with numerical data: one column, [Yearly Purchases], contains the total yearly purchases per customer for a specific calendar year, and the other column, [Standard Deviations], contains the standard deviations for those values. In this case you could designate the [Yearly Purchases] column as the classified column, and the model would be able to use this relationship in analysis.

Note

The algorithms provided in Analysis Services do not support the use of classified columns; this feature is provided for use in creating custom algorithms.

Defining a Classified Column

The data type of a classified column must be either Long or Double.

The following list describes the content types that Analysis Services supports for classified columns.

  • PROBABILITY
    The value in the column is the probability of the associated value, and is a number between 0 and 1.

  • VARIANCE
    The value in the column is the variance of the associated value.

  • STDEV
    The value in the column is the standard deviation of the associated value.

  • PROBABILITY_VARIANCE
    The value in the column is the variance of the probability for the associated value.

  • PROBABILITY_STDEV
    The value in the column is the standard deviation of the probability for the associated value.

  • SUPPORT
    The value in the column is the weight, or case replication factor, of the associated value.

See Also

Concepts

Content Types (Data Mining)

Mining Structures (Analysis Services - Data Mining)

Data Types (Data Mining)