Skip to content

Commit a5ef447

Browse files
20221115 1423 polybase aps and hadoop tweaks
1 parent e878dbf commit a5ef447

1 file changed

Lines changed: 7 additions & 9 deletions

File tree

docs/relational-databases/polybase/polybase-guide.md

Lines changed: 7 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
title: "Introducing data virtualization with PolyBase"
3-
description: PolyBase enables your SQL Server instance to process Transact-SQL queries that read data from external data sources such as Hadoop and Azure Blob Storage.
3+
description: PolyBase enables your SQL Server instance to process Transact-SQL queries that read data from external data sources such as Azure Blob Storage.
44
ms.date: 08/22/2022
55
ms.prod: sql
66
ms.technology: polybase
@@ -43,7 +43,7 @@ PolyBase provides these same functionalities for the following SQL products from
4343

4444
- [!INCLUDE[sssql16-md](../../includes/sssql16-md.md)] and later versions (Windows)
4545
- [!INCLUDE[sssql19-md](../../includes/sssql19-md.md)] and later versions (Windows and Linux)
46-
- [!INCLUDE[ssNoVersion](../../includes/ssnoversion-md.md)] [!INCLUDE[pdw](../../includes/sspdw-md.md)] (PDW), hosted in the Analytics Platform System (APS)
46+
- [!INCLUDE[ssNoVersion](../../includes/ssnoversion-md.md)] [!INCLUDE[pdw](../../includes/sspdw-md.md)]
4747
- [!INCLUDE[ssazuresynapse_md](../../includes/ssazuresynapse_md.md)]
4848

4949
> [!NOTE]
@@ -101,8 +101,6 @@ For an example using PolyBase in [!INCLUDE[sssql22-md](../../includes/sssql22-md
101101
1. Create an [external data source](../../t-sql/statements/create-external-data-source-transact-sql.md).
102102
1. Create an [external table](../../t-sql/statements/create-external-table-transact-sql.md).
103103

104-
105-
106104
### Azure integration
107105

108106
With the underlying help of PolyBase, T-SQL queries can also import and export data from Azure Blob Storage. Further, PolyBase enables [!INCLUDE[ssazuresynapse_md](../../includes/ssazuresynapse_md.md)] to import and export data from Azure Data Lake Store, and from Azure Blob Storage.
@@ -122,10 +120,10 @@ PolyBase does not require you to install additional software to your Hadoop envi
122120

123121
PolyBase enables the following scenarios in [!INCLUDE[ssNoVersion](../../includes/ssnoversion-md.md)]:
124122

125-
- **Query data stored in Hadoop from a [!INCLUDE[ssNoVersion](../../includes/ssnoversion-md.md)] instance or PDW.** Users are storing data in cost-effective distributed and scalable systems, such as Hadoop. PolyBase makes it easy to query the data by using T-SQL.
126-
127123
- **Query data stored in Azure Blob Storage.** Azure Blob Storage is a convenient place to store data for use by Azure services. PolyBase makes it easy to access the data by using T-SQL.
128124

125+
- **Query data stored in Hadoop from a [!INCLUDE[ssNoVersion](../../includes/ssnoversion-md.md)] instance or PDW.** Users are storing data in cost-effective distributed and scalable systems, such as Hadoop. PolyBase makes it easy to query the data by using T-SQL.
126+
129127
- **Import data from Hadoop, Azure Blob Storage, or Azure Data Lake Store.** Leverage the speed of Microsoft SQL's columnstore technology and analysis capabilities by importing data from Hadoop, Azure Blob Storage, or Azure Data Lake Store into relational tables. There is no need for a separate ETL or import tool.
130128

131129
- **Export data to Hadoop, Azure Blob Storage, or Azure Data Lake Store.** Archive data to Hadoop, Azure Blob Storage, or Azure Data Lake Store to achieve cost-effective storage and keep it online for easy access.
@@ -142,15 +140,15 @@ PolyBase enables the following scenarios in [!INCLUDE[ssNoVersion](../../include
142140

143141
## Upgrading to SQL Server 2022
144142

145-
Starting in [!INCLUDE [sssql22-md](../../includes/sssql22-md.md)] Hadoop is no longer supported. Azure Storage connectors must be changed based on the reference table below:
143+
Starting in [!INCLUDE [sssql22-md](../../includes/sssql22-md.md)] Hortonworks Data Platform (HDP) and Cloudera Distributed Hadoop (CDH) are no longer supported. Due to these changes, it is required to manually drop PolyBase external data sources created on previous versions of SQL Server that use `TYPE = HADOOP` or Azure Storage before migrating to [!INCLUDE [sssql22-md](../../includes/sssql22-md.md)]. Dropping external data sources also requires dropping the associated database objects, such as database scoped credentials and external tables.
144+
145+
Azure Storage connectors must be changed based on the reference table below:
146146

147147
| External Data Source | From | To |
148148
|:--|:--|:--|
149149
| Azure Blob Storage | wasb[s] | abs |
150150
| ADLS Gen 2 | abfs[s] | adls |
151151

152-
Due to these changes, it is required to manually drop PolyBase external data sources created on previous versions of SQL Server that use `TYPE = HADOOP` or Azure Storage before migrating to [!INCLUDE [sssql22-md](../../includes/sssql22-md.md)]. Dropping external data sources also requires dropping the associated database objects, such as database scoped credentials and external tables.
153-
154152
## Next steps
155153

156154
Before using PolyBase, you must [install PolyBase on Windows](polybase-installation.md) or [install PolyBase on Linux](polybase-linux-setup.md), and [enable PolyBase in sp_configure](polybase-installation.md#enable) if necessary. For more tutorials on creating external data sources and external tables to a variety of data sources, see [PolyBase Transact-SQL reference](polybase-t-sql-objects.md).

0 commit comments

Comments
 (0)