Skip to content

Latest commit

 

History

History
60 lines (47 loc) · 2.74 KB

File metadata and controls

60 lines (47 loc) · 2.74 KB
title Classified Columns (Data Mining) | Microsoft Docs
ms.custom
ms.date 06/13/2017
ms.prod sql-server-2014
ms.reviewer
ms.technology analysis-services
ms.topic conceptual
helpviewer_keywords
content types [data mining]
STDEV column
VARIANCE column
PROBABLILITY column
PROBABILITY_STDEV column
columns [data mining], classified
classified columns [data mining]
PROBABILITY_VARIANCE column
SUPPORT column
ms.assetid 68bf3b78-dc12-497c-898f-b43a45646123
author minewiskan
ms.author owend
manager craigg

Classified Columns (Data Mining)

When you define a classified column, you create a relationship between the current column and another column in the mining structure. The data in the mining structure column that you designate as the classified column contains categorical information that describes the values in another column in the mining structure.

For example, suppose you have two columns with numerical data: one column, [Yearly Purchases], contains the total yearly purchases per customer for a specific calendar year, and the other column, [Standard Deviations], contains the standard deviations for those values. In this case you could designate the [Yearly Purchases] column as the classified column, and the model would be able to use this relationship in analysis.

Note

The algorithms provided in [!INCLUDEssASnoversion] do not support the use of classified columns; this feature is provided for use in creating custom algorithms.

Defining a Classified Column

The data type of a classified column must be either Long or Double.

The following list describes the content types that [!INCLUDEssASnoversion] supports for classified columns.

PROBABILITY
The value in the column is the probability of the associated value, and is a number between 0 and 1.

VARIANCE
The value in the column is the variance of the associated value.

STDEV
The value in the column is the standard deviation of the associated value.

PROBABILITY_VARIANCE
The value in the column is the variance of the probability for the associated value.

PROBABILITY_STDEV
The value in the column is the standard deviation of the probability for the associated value.

SUPPORT
The value in the column is the weight, or case replication factor, of the associated value.

See Also

Content Types (Data Mining)
Mining Structures (Analysis Services - Data Mining)
Data Types (Data Mining)