Skip to content

Latest commit

 

History

History
53 lines (37 loc) · 2.77 KB

classified-columns-data-mining.md

File metadata and controls

53 lines (37 loc) · 2.77 KB
title description ms.date ms.service ms.custom ms.topic ms.author ms.reviewer author
Classified Columns (Data Mining) | Microsoft Docs
Learn that when you define a classified column, you create a relationship between the current column and another column in the mining structure.
10/31/2023
analysis-services
data-mining
conceptual
kfollis
kfollis
kfollis

Classified Columns (Data Mining)

[!INCLUDEappliesto-sql2019-earlier]

[!INCLUDEdm-dep-banner]

When you define a classified column, you create a relationship between the current column and another column in the mining structure. The data in the mining structure column that you designate as the classified column contains categorical information that describes the values in another column in the mining structure.

For example, suppose you have two columns with numerical data: one column, [Yearly Purchases], contains the total yearly purchases per customer for a specific calendar year, and the other column, [Standard Deviations], contains the standard deviations for those values. In this case you could designate the [Yearly Purchases] column as the classified column, and the model would be able to use this relationship in analysis.

Note

The algorithms provided in [!INCLUDEssASnoversion] do not support the use of classified columns; this feature is provided for use in creating custom algorithms.

Defining a Classified Column

The data type of a classified column must be either Long or Double.

The following list describes the content types that [!INCLUDEssASnoversion] supports for classified columns.

PROBABILITY
The value in the column is the probability of the associated value, and is a number between 0 and 1.

VARIANCE
The value in the column is the variance of the associated value.

STDEV
The value in the column is the standard deviation of the associated value.

PROBABILITY_VARIANCE
The value in the column is the variance of the probability for the associated value.

PROBABILITY_STDEV
The value in the column is the standard deviation of the probability for the associated value.

SUPPORT
The value in the column is the weight, or case replication factor, of the associated value.

See Also

Content Types (Data Mining)
Mining Structures (Analysis Services - Data Mining)
Data Types (Data Mining)