Skip to content

Latest commit

 

History

History
83 lines (57 loc) · 7.46 KB

File metadata and controls

83 lines (57 loc) · 7.46 KB
title Basic Data Mining Tutorial | Microsoft Docs
ms.custom
ms.date 06/13/2017
ms.prod sql-server-2014
ms.reviewer
ms.technology
analysis-services
ms.topic conceptual
helpviewer_keywords
databases [Analysis Services], tutorials
data mining [Analysis Services], tutorials
tutorials [Data Mining]
ms.assetid 6602edb6-d160-43fb-83c8-9df5dddfeb9c
author minewiskan
ms.author owend
manager craigg

Basic Data Mining Tutorial

Welcome to the [!INCLUDEmsCoName] [!INCLUDEssASnoversion] Basic Data Mining Tutorial. [!INCLUDEmsCoName] [!INCLUDEssNoVersion] provides an integrated environment for creating data mining models and making predictions. In this tutorial, you will complete a scenario for a targeted mailing campaign in which you use machine learning to analyze and predict customer purchasing behavior. The tutorial demonstrates how to use three of the most important data mining algorithms: clustering, decision trees, and Naive Bayes. You will also learn how to analyze your findings using the mining model viewers, and to create predictions and accuracy charts using the data mining tools that are included in [!INCLUDEmsCoName] [!INCLUDEssNoVersion] [!INCLUDEssASnoversion]. The fictitious company, [!INCLUDEssSampleDBCoFull], is used for all examples.

When you are comfortable using the data mining tools, we recommend that you also complete the Intermediate Data Mining Tutorial (Analysis Services - Data Mining). The lessons demonstrate how to use forecasting, market basket analysis, time series, association models, nested tables, and sequence clustering.

Tutorial Scenario

In this tutorial, you are an employee of [!INCLUDEssSampleDBCoFull] who has been tasked with learning more about the company's customers based on historical purchases, and then using that historical data to make predictions that can be used in marketing. The company has never done data mining before, so you must create a new database specifically for data mining and set up several data mining models.

What You Will Learn

This tutorial teaches you how to create and work with several different types of machine learning methods. You will also learn how to create a copy of a mining model, and apply a filter to the input data to get different results. Afterwards you can compare the results of both models using a lift chart. Finally, you will use drillthrough to retrieve additional data from the underlying mining structure.

[!INCLUDEmsCoName] [!INCLUDEssASnoversion] Data Mining includes the following features that help you easily develop and compare multiple predictive models and then take actions on the results :

  • *Holdout Test Sets -*When you create a mining structure, you can now divide the data in the mining structure into training and testing sets. This lets you test models on similar data sets, and compare the accuracy of related models.

  • *Mining model filters -*You can now attach filters to a mining model, and apply the filter during both training and testing. This lets you easily build related models on different subsets of the data.

  • Drillthrough to Structure Cases and Structure Columns - You can now easily move from the general patterns in the mining model to actionable detail in the data source.

This tutorial is divided into the following lessons:

Lesson 1: Preparing the Analysis Services Database (Basic Data Mining Tutorial)
In this lesson, you will learn how to create a new [!INCLUDEssASnoversion] database, add a data source and data source view, and prepare the new database to be used with data mining.

Lesson 2: Building a Targeted Mailing Structure (Basic Data Mining Tutorial)
In this lesson, you will learn how to create a mining model structure that can be used as part of a targeted mailing scenario.

Lesson 3: Adding and Processing Models
In this lesson you will learn how to add models to a structure. The models you create are built with the following algorithms:

Lesson 4: Exploring the Targeted Mailing Models (Basic Data Mining Tutorial)
In this lesson you will learn how to explore and interpret the findings of each model using the Viewers.

Lesson 5: Testing Models (Basic Data Mining Tutorial)
In this lesson, you make a copy of one of the targeted mailing models, add a mining model filter to restrict the training data to a particular set of customers, and then assess the viability of the model.

Lesson 6: Creating and Working with Predictions (Basic Data Mining Tutorial)
In this final lesson of the Basic Data Mining Tutorial, you use the model to predict which customers are most likely to purchase a bike. You then drill through to the underlying cases to obtain contact information.

Requirements

Make sure that the following are installed:

To enhance security, the sample databases are not installed with [!INCLUDEssNoVersion]. To install the official databases for [!INCLUDEmsCoName] [!INCLUDEssNoVersion], visit the Microsoft SQL Sample Databases page and select [!INCLUDEssCurrent].

Note

When you are working through a tutorial, you might find it easier to move back and forth between the steps if you add the Next topic and Previous topic buttons to the document viewer toolbar.

See Also

Data Mining Solutions
Mining Model Tasks and How-tos
Creating and Querying Data Mining Models with DMX: Tutorials (Analysis Services - Data Mining)