Skip to content

Commit 37cbe81

Browse files
committed
Update for content perf
1 parent ad5853b commit 37cbe81

1 file changed

Lines changed: 26 additions & 36 deletions

File tree

Lines changed: 26 additions & 36 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: "What is PolyBase? | Microsoft Docs"
33
description: PolyBase enables your SQL Server instance to process Transact-SQL queries that read data from external data sources such as Hadoop and Azure Blob Storage.
4-
ms.date: 06/10/2019
4+
ms.date: 12/14/2019
55
ms.prod: sql
66
ms.technology: polybase
77
ms.topic: "overview"
@@ -15,38 +15,43 @@ helpviewer_keywords:
1515
- "Hadoop export"
1616
- "Hadoop export, PolyBase overview"
1717
- "Hadoop import, PolyBase overview"
18+
ms.custom: contperf-fy21q2
1819
author: MikeRayMSFT
1920
ms.author: mikeray
2021
ms.reviewer: ""
2122
monikerRange: ">=sql-server-2016||>=sql-server-linux-2017||>=aps-pdw-2016||=azure-sqldw-latest"
2223
---
24+
2325
# What is PolyBase?
2426

2527
[!INCLUDE[appliesto-ss-xxxx-asdw-pdw-md](../../includes/appliesto-ss-xxxx-asdw-pdw-md.md)]
2628

27-
<!--SQL Server 2016/2017-->
28-
::: moniker range="= sql-server-2016 || = sql-server-2017 || >= aps-pdw-2016 || = azure-sqldw-latest"
29+
PolyBase enables your SQL Server instance to process Transact-SQL queries that read data from external data sources. The same query can also access relational tables in your instance of SQL Server. PolyBase enables the same query to also join the data from external sources and and SQL Server.
2930

30-
PolyBase enables your SQL Server 2016 instance to process Transact-SQL queries that read data from Hadoop. The same query can also access relational tables in your SQL Server. PolyBase enables the same query to also join the data from Hadoop and SQL Server. In SQL Server, an [external table](../../t-sql/statements/create-external-table-transact-sql.md) or [external data source](../../t-sql/statements/create-external-data-source-transact-sql.md) provides the connection to Hadoop.
31+
To use PolyBase, in an instance of SQL Server, create:
3132

32-
![PolyBase logical](../../relational-databases/polybase/media/polybase-logical.png "PolyBase logical")
33+
- An [external data source](../../t-sql/statements/create-external-data-source-transact-sql.md)
3334

34-
PolyBase pushes some computations to the Hadoop node to optimize the overall query. However, PolyBase external access is not limited to Hadoop. Other unstructured non-relational tables are also supported, such as delimited text files.
35+
&
3536

36-
> [!TIP]
37-
> SQL Server 2019 introduces new connectors for PolyBase, including SQL Server, Oracle, Teradata, and MongoDB. For more information, see the [PolyBase documentation for SQL Server 2019](polybase-guide.md?view=sql-server-ver15)
37+
- An [external table](../../t-sql/statements/create-external-table-transact-sql.md)
3838

39-
::: moniker-end
40-
<!--SQL Server 2019-->
41-
::: moniker range=">= sql-server-linux-ver15 || >= sql-server-ver15 "
39+
Together, these provide the connection to the external data source.
4240

43-
PolyBase enables your SQL Server instance to process Transact-SQL queries that read data from external data sources. SQL Server 2016 and higher can access external data in Hadoop and Azure Blob Storage. Starting in SQL Server 2019, you can now use PolyBase to access external data in [SQL Server](polybase-configure-sql-server.md), [Oracle](polybase-configure-oracle.md), [Teradata](polybase-configure-teradata.md), and [MongoDB](polybase-configure-mongodb.md).
41+
SQL Server 2016 introduces PolyBase with support for connections to Hadoop and Azure Blob Storage.
4442

45-
The same queries that access external data can also target relational tables in your SQL Server instance. This allows you to combine data from external sources with high-value relational data in your database. In SQL Server, an [external table](../../t-sql/statements/create-external-table-transact-sql.md) or [external data source](../../t-sql/statements/create-external-data-source-transact-sql.md) provides the connection to Hadoop.
43+
SQL Server 2019 introduces additional connectors, including SQL Server, Oracle, Teradata, and MongoDB. For more information, see the [PolyBase documentation for SQL Server 2019](polybase-guide.md?view=sql-server-ver15)
4644

47-
PolyBase pushes some computations to the Hadoop node to optimize the overall query. However, PolyBase external access is not limited to Hadoop. Other unstructured non-relational tables are also supported, such as delimited text files.
45+
![PolyBase logical](../../relational-databases/polybase/media/polybase-logical.png "PolyBase logical")
4846

49-
::: moniker-end
47+
PolyBase pushes some computations to the external source to optimize the overall query. PolyBase external access is not limited to Hadoop. Other unstructured non-relational tables are also supported, such as delimited text files.
48+
49+
Examples of external connectors include:
50+
51+
- [SQL Server](polybase-configure-sql-server.md)
52+
- [Oracle](polybase-configure-oracle.md)
53+
- [Teradata](polybase-configure-teradata.md)
54+
- [MongoDB](polybase-configure-mongodb.md).
5055

5156
### Supported SQL products and services
5257

@@ -62,20 +67,20 @@ With the underlying help of PolyBase, T-SQL queries can also import and export d
6267

6368
## Why use PolyBase?
6469

65-
In the past it was more difficult to join your SQL Server data with external data. You had the two following unpleasant options:
70+
PolyBase allows you to join data from a SQL Server instance with external data. Prior to PolyBase to join data to external data sources you could either:
6671

67-
- Transfer half your data so that all your data was in one format or the other.
72+
- Transfer half your data so that all the data was in one location.
6873
- Query both sources of data, then write custom query logic to join and integrate the data at the client level.
6974

70-
PolyBase avoids those unpleasant options by using T-SQL to join the data.
75+
PolyBase allows you to simply use Transact-SQL to join the data.
7176

72-
To keep things simple, PolyBase does not require you to install additional software to your Hadoop environment. You query external data by using the same T-SQL syntax used to query a database table. The support actions implemented by PolyBase all happen transparently. The query author does not need any knowledge about Hadoop.
77+
PolyBase does not require you to install additional software to your Hadoop environment. You query external data by using the same T-SQL syntax used to query a database table. The support actions implemented by PolyBase all happen transparently. The query author does not need any knowledge about the external source.
7378

7479
### PolyBase uses
7580

7681
PolyBase enables the following scenarios in SQL Server:
7782

78-
- **Query data stored in Hadoop from SQL Server or PDW.** Users are storing data in cost-effective distributed and scalable systems, such as Hadoop. PolyBase makes it easy to query the data by using T-SQL.
83+
- **Query data stored in Hadoop from a SQL Server instance or PDW.** Users are storing data in cost-effective distributed and scalable systems, such as Hadoop. PolyBase makes it easy to query the data by using T-SQL.
7984

8085
- **Query data stored in Azure Blob Storage.** Azure blob storage is a convenient place to store data for use by Azure services. PolyBase makes it easy to access the data by using T-SQL.
8186

@@ -91,29 +96,14 @@ PolyBase enables the following scenarios in SQL Server:
9196

9297
- **Scale compute resources.** To improve query performance, you can use SQL Server [PolyBase scale-out groups](../../relational-databases/polybase/polybase-scale-out-groups.md). This enables parallel data transfer between SQL Server instances and Hadoop nodes, and it adds compute resources for operating on the external data.
9398

94-
<!--SQL Server 2016/2017-->
95-
::: moniker range="=sql-server-2016||=sql-server-2017"
96-
9799
## Next steps
98100

99101
Before using PolyBase, you must [install the PolyBase feature](polybase-installation.md). Then see the following configuration guides depending on your data source:
100102

101-
- [Hadoop](polybase-configure-hadoop.md)
102-
- [Azure Blob Storage](polybase-configure-azure-blob-storage.md)
103-
104-
::: moniker-end
105-
<!--SQL Server 2019-->
106-
::: moniker range=">= sql-server-linux-ver15||>= sql-server-ver15"
107-
108-
## Next steps
109-
110-
Before using PolyBase, you must [install the PolyBase feature](polybase-installation.md). Then see the following configuration guides depending on your data source:
111103
- [Hadoop](polybase-configure-hadoop.md)
112104
- [Azure Blob Storage](polybase-configure-azure-blob-storage.md)
113105
- [SQL Server](polybase-configure-sql-server.md)
114106
- [Oracle](polybase-configure-oracle.md)
115107
- [Teradata](polybase-configure-teradata.md)
116108
- [MongoDB](polybase-configure-mongodb.md)
117-
- [ODBC Generic Types](polybase-configure-odbc-generic.md)
118-
119-
::: moniker-end
109+
- [ODBC Generic Types](polybase-configure-odbc-generic.md)

0 commit comments

Comments
 (0)