| title | Install on SQL Server 2022 for Linux | ||
|---|---|---|---|
| titleSuffix | SQL Server Machine Learning Services | ||
| description | Learn how to install SQL Server 2022 Machine Learning Services on Linux: Red Hat, Ubuntu, and SUSE. | ||
| author | WilliamDAssafMSFT | ||
| ms.author | wiassaf | ||
| manager | rothja | ||
| ms.date | 05/24/2022 | ||
| ms.topic | how-to | ||
| ms.prod | sql | ||
| ms.technology | machine-learning-services | ||
| monikerRange | >=sql-server-ver16||>=sql-server-linux-ver16 | ||
| ms.custom |
|
[!INCLUDE SQL Server 2022 - Linux]
This article guides you in the installation of SQL Server Machine Learning Services on Linux. Python and R scripts can be executed in-database using Machine Learning Services.
You can install Machine Learning Services on Ubuntu. Currently, Red Hat Enterprise Linux (RHEL), SUSE Linux Enterprise Server (SLES) are unsupported.
You can install ML Services on a Docker container running a Linux distribution. Inside the Docker container, the steps would be the same as below.
For more information, see the Supported platforms section in the installation guidance for SQL Server on Linux.
Important
This article refers to [!INCLUDEsssql22-md]. For SQL Server 2019 on Linux, see to Install SQL Server 2019 Machine Learning Services (Python and R) on Linux.
-
Install SQL Server on Linux and verify the installation.
-
Check the SQL Server Linux repositories for the Python and R extensions. If you already configured source repositories for the database engine install, you can run the mssql-server-extensibility package install commands using the same repo registration.
-
You should have a tool for running T-SQL commands.
- You can use Azure Data Studio, a free database tool that runs on Linux, Windows, and macOS.
On an internet-connected device, packages are downloaded and installed independently of the database engine using the package installer for each operating system. The following table describes all available packages, but for R and Python, you specify packages that provide either the full feature installation or the minimum feature installation.
Available installation packages for SQL Server 2022 on Linux:
| Package name | Applies-to | Description |
|---|---|---|
| mssql-server-extensibility | All | Extensibility framework used to run Python and R. |
Refer to Quickstart: Install SQL Server and create a database on Ubuntu for SQL Server 2022 on Linux installation. Then, use the following steps to install SQL Server Machine Learning Services on Ubuntu:
The following commands register the repository providing the R language platform.
Tip
If possible, run apt-get update to refresh packages on the system prior to installation.
-
Begin installation as root.
sudo su
-
Optionally, if your system does not have the
https apt transportoption:apt-get install apt-transport-https
-
Set the location of the package repo the "prod" directory containing the distribution. This example specifies 20.04. Replace with 16.04 or 14.04 if you want those versions.
wget https://packages.microsoft.com/config/ubuntu/20.04/packages-microsoft-prod.deb
-
Register the repository.
dpkg -i packages-microsoft-prod.deb
-
Update packages on your system (required).
apt-get update
-
Install the SQL Server extensibility feature.
apt-get install mssql-server-extensibility
-
Download and install the version of R that is desired. Choose a version of R 4.2 or higher, available for download directly from cran.r-project.org. Follow the instructions for the desired runtime.
-
Install CompatibilityAPI and RevoScaleR dependencies. From the R terminal of the version you have installed, run the following:
# R Terminal install.packages("iterators") install.packages("foreach") install.packages("R6") install.packages("jsonlite")
-
Download the CompatibilityAPI for Linux.
-
Install CompatibilityAPI for Linux, specifying the absolute file path to the
.tar.gzfile.install.packages('/path/to/CompatibilityAPI.tar.gz', repos = NULL)
-
Download RevoScaleR Linux.
-
Install RevoScaleR for Linux, specifying the absolute file path to the
.tar.gzfile.install.packages('/path/to/RevoScaleR.tar.gz', repos = NULL)
-
Verify RevoScaleR installation from the R terminal.
library("RevoScaleR") -
Configure the installed R runtime with SQL Server for Linux, where
path/to/is the file path to the R binary, andRFolderVersionis the version-specific folder name for your installation of R runtime, for example,R4.2./opt/mssql/bin/mssql-conf set extensibility rbinpath /path/to/RFolderVersion/lib/R/bin/R /opt/mssql/bin/mssql-conf set extensibility datadirectories /path/to/RFolderVersion/
-
Restart the Launchpad service.
systemctl restart mssql-launchpadd.service
-
Configure SQL Server for Linux to allow external scripts using the
sp_configuresystem stored procedure.EXEC sp_configure 'external scripts enabled', 1; GO RECONFIGURE GO
-
Verify the installation by executing a simple T-SQL command to return the version of R:
EXEC sp_execute_external_script @script=N'print(R.version)',@language=N'R'; GO
-
Begin installation as root.
sudo su
-
Update packages on your system (required).
apt-get update
-
Install the SQL Server extensibility feature.
apt-get install mssql-server-extensibility
-
Download and install the version of Python that is desired. Choose a version of Python 3.10 or higher, available for download directly from python.org. Follow the instructions for the desired runtime.
-
Download revoscalepy for Linux.
-
Install revoscalepy for the root user, specifying the absolute file path to the
.whlfile.pip install /path/to/revoscalepy.whl
-
Verify the revoscalepy installation from the python terminal. Verify the library can be imported.
import revoscalepy
-
Configure the installed Python runtime with SQL Server, where
/path/tois the path to the python installation binary, andpythonFolderVersionis the desired version of Python installed, for example,python3.10. Use the following script with your actual installation path:/opt/mssql/bin/mssql-conf set extensibility pythonbinpath /path/to/pythonFolderVersion /opt/mssql/bin/mssql-conf set extensibility datadirectories /path/to/:/path/to/lib/pythonFolderVersion/dist-packages/
-
Restart the Launchpad service.
systemctl restart mssql-launchpadd.service
-
Configure SQL Server for Linux to allow external scripts using the
sp_configuresystem stored procedure.EXEC sp_configure 'external scripts enabled', 1; GO RECONFIGURE GO
-
Verify the installation by executing a simple T-SQL command to return the version of python:
EXEC sp_execute_external_script @script=N'import sys;print(sys.version)',@language=N'Python'; GO
Installation for SQL Server 2022 (Preview) Machine Learning Services for Red Hat Enterprise Linux (RHEL) is currently not supported.
Installation for SQL Server 2022 (Preview) Machine Learning Services for SUSE Linux Enterprise Server (SLES) v15 is currently not supported.
To install the Java language extension, see Install SQL Server Java Language Extension on Linux.
To validate installation:
-
Run a T-SQL script that executes a system stored procedure invoking Python or R using a query tool.
-
Execute the following SQL command to test R execution in SQL Server. Errors? Try a service restart,
sudo systemctl restart mssql-server.service.EXEC sp_execute_external_script @language =N'R', @script=N' OutputDataSet <- InputDataSet', @input_data_1 =N'SELECT 1 AS hello' WITH RESULT SETS (([hello] int not null)); GO
-
Execute the following SQL command to test Python execution in SQL Server.
EXEC sp_execute_external_script @language =N'Python', @script=N' OutputDataSet = InputDataSet; ', @input_data_1 =N'SELECT 1 AS hello' WITH RESULT SETS (([hello] int not null)); GO
Follow the Offline installation instructions for steps on installing the packages. Find your download site, and then download specific packages using the package list below.
Tip
Several of the package management tools provide commands that can help you determine package dependencies. For yum, use sudo yum deplist [package]. For Ubuntu, use sudo apt-get install --reinstall --download-only [package name] followed by dpkg -I [package name].deb.
Python developers can learn how to use Python with SQL Server by following these tutorials:
- Python tutorial: Predict ski rental with linear regression in SQL Server Machine Learning Services
- Python tutorial: Categorizing customers using k-means clustering with SQL Server Machine Learning Services
R developers can get started with some simple examples, and learn the basics of how R works with SQL Server. For your next step, see the following links: