| title | Install PolyBase on Linux |
|---|---|
| titlesuffix | SQL Server |
| description | Learn how to install SQL Server PolyBase on Linux. PolyBase enables you to run external queries against remote data sources. |
| author | MikeRayMSFT |
| ms.author | mikeray |
| ms.reviewer | dakryze |
| ms.date | 8/18/2020 |
| ms.topic | conceptual |
| ms.prod | sql |
| ms.technology | linux |
| monikerRange | >= sql-server-linux-ver15 || >= sql-server-ver15 || =sqlallproducts-allversions |
[!INCLUDE sqlserver2019-linux]
The following steps install PolyBase (mssql-server-polybase and mssql-server-polybase-hadoop) on Linux. PolyBase enables you to run external queries against remote data sources.
Note
Before installing PolyBase, first install SQL Server 2019. This configures the keys and repositories that you use when installing the mssql-server-polybase and mssql-server-polybase-hadoop package.
Note
- PolyBase is not supported on SQL Server 2017 for Linux.
- Scale-out for PolyBase on Linux is currently unavailable.
Install PolyBase for your operating system:
Use the following command to install the mssql-server-polybase on Red Hat Enterprise Linux.
sudo yum install -y mssql-server-polybaseYou will be prompted to restart the SQL Server instance. Use the following command to do so.
sudo systemctl restart mssql-serverNote
After installation, you must enable the PolyBase feature.
Use the following command to install the mssql-server-polybase-hadoop.
sudo yum install -y mssql-server-polybase-hadoopThe PolyBase Hadoop package has dependencies on the following packages:
mssql-servermssql-server-polybasemssql-server-extensibilitymssql-zulu-jre-11.
Installation prompts to restart launchpadd. Use the following command to do so.
sudo systemctl restart mssql-launchpaddNote
After installation, you must set the Hadoop connectivity level.
If you need an offline installation, locate the PolyBase package download in the Release notes. Then use the same offline installation steps described in the article Install SQL Server.
Use the following command to install the mssql-server-polybase on Ubuntu.
sudo apt-get install mssql-server-polybaseYou will be prompted to restart the SQL Server instance. Use the following command to do so.
sudo systemctl restart mssql-serverNote
After installation, you must enable the PolyBase feature.
If you need an offline installation, locate the PolyBase package download in the Release notes. Then use the same offline installation steps described in the article Install SQL Server.
Use the following command to install the mssql-server-polybase-hadoop.
sudo apt-get install mssql-server-polybase-hadoopThe PolyBase Hadoop package has dependencies on the following packages:
mssql-servermssql-server-polybasemssql-server-extensibilitymssql-zulu-jre-11.
Installation prompts to restart launchpadd. Use the following command to do so.
sudo systemctl restart mssql-launchpaddNote
After installation, you must set the Hadoop connectivity level.
Use the following commands to install the mssql-server-polybase on SUSE Linux Enterprise Server.
sudo zypper install mssql-server-polybaseYou will be prompted to restart the SQL Server instance. Use the following command to do so.
sudo systemctl restart mssql-serverNote
After installation, you must enable the PolyBase feature.
If you need an offline installation, locate the PolyBase package download in the Release notes. Then use the same offline installation steps described in the article Install SQL Server.
After installation, PolyBase must be enabled to access its features. Connect to the installed SQL Server instance and use the following Transact-SQL command to enable.
exec sp_configure @configname = 'polybase enabled', @configvalue = 1;
RECONFIGURE WITH OVERRIDE;If you already have mssql-server-polybase installed, you can update to the latest version with the following commands:
sudo yum remove -y mssql-server-polybase-hadoop
sudo yum remove -y mssql-server-polybase
sudo yum check-update
sudo yum install -y mssql-server-polybase
sudo yum install -y mssql-server-polybase-hadoopYou will be prompted to restart the SQL Server instance. Use the following command to do so.
sudo systemctl restart mssql-server
sudo apt-get remove mssql-server-polybase-hadoop
sudo apt-get remove mssql-server-polybase
sudo apt-get update
sudo apt-get install mssql-server-polybase
sudo apt-get remove mssql-server-polybase-hadoopYou will be prompted to restart the SQL Server instance. Use the following command to do so.
sudo systemctl restart mssql-server
sudo zypper remove mssql-server-polybase
sudo zypper refresh
sudo zypper install mssql-server-polybaseYou will be prompted to restart the SQL Server instance. Use the following command to do so.
sudo systemctl restart mssql-server
Note
After installation, you must enable the PolyBase feature.
PolyBase on Linux can access the following data sources. Follow the provided links for more information on how to create an external table from these sources on PolyBase is enabled.
- SQL Server, SQL Database, Azure Synapse Analytics)
- Hadoop
- Azure Blob Storage
- Oracle
- Teradata
- MongoDB (& Cosmos DB)
For more information on how this is used, see Transact-SQL reference article for CREATE EXTERNAL TABLE.