You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/big-data-cluster/big-data-cluster-overview.md
+5-2Lines changed: 5 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -30,7 +30,9 @@ Use [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)
30
30
- Virtualize data with [PolyBase](../relational-databases/polybase/polybase-guide.md). Query data from external SQL Server, Oracle, Teradata, MongoDB, and ODBC data sources with external tables.
31
31
- Provide high availability for the SQL Server master instance and all databases by using Always On availability group technology.
32
32
33
-
For more information about new features and known issues for latest release, see the [release notes](release-notes-big-data-cluster.md).
33
+
For more information about new features and known issues for latest release, see the [release notes](release-notes-big-data-cluster.md).
34
+
35
+
For more information Big Data Clusters, see [Big Data Clusters FAQ](big-data-cluster-faq.yml).
34
36
35
37
## Scenarios
36
38
@@ -114,4 +116,5 @@ The storage pool consists of storage pool pods comprised of SQL Server on Linux,
114
116
115
117
## Next steps
116
118
117
-
For more information about deploying [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)], see [Get started with [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)]](deploy-get-started.md).
119
+
* For more information about deploying [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)], see [Get started with [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)]](deploy-get-started.md).
120
+
* Review the [Big Data Clusters FAQ](big-data-cluster-faq.yml).
Copy file name to clipboardExpand all lines: docs/big-data-cluster/concept-architecture-pods.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -183,5 +183,5 @@ For for specific open-source projects and versions, see [Open-source software re
183
183
To learn more about the [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)], see the following resources:
184
184
185
185
-[What are [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ver15.md)]?](big-data-cluster-overview.md)
186
-
-[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/Microsoft/sqlworkshops/tree/master/sqlserver2019bigdataclusters)
186
+
-[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/microsoft/sqlworkshops-bdc)
187
187
-[How to deploy [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] on Kubernetes](deployment-guidance.md#configfile)
Copy file name to clipboardExpand all lines: docs/big-data-cluster/concept-compute-pool.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -60,4 +60,4 @@ Scenarios where the compute pool is **not** used include:
60
60
To learn more about the [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)], see the following resources:
61
61
62
62
-[What are [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ver15.md)]?](big-data-cluster-overview.md)
63
-
-[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/Microsoft/sqlworkshops/tree/master/sqlserver2019bigdataclusters)
63
+
-[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/microsoft/sqlworkshops-bdc)
Copy file name to clipboardExpand all lines: docs/big-data-cluster/concept-data-pool.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -38,4 +38,4 @@ Access to the SQL server instances in the data pool is managed from the SQL Serv
38
38
To learn more about the [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)], see the following resources:
39
39
40
40
-[What are [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ver15.md)]?](big-data-cluster-overview.md)
41
-
-[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/Microsoft/sqlworkshops/tree/master/sqlserver2019bigdataclusters)
41
+
-[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/microsoft/sqlworkshops-bdc)
Copy file name to clipboardExpand all lines: docs/big-data-cluster/concept-master-instance.md
+7-7Lines changed: 7 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,7 +15,7 @@ ms.technology: big-data-cluster
15
15
16
16
[!INCLUDE[SQL Server 2019](../includes/applies-to-version/sqlserver2019.md)]
17
17
18
-
This article describes the role of the *SQL Server master instance* in a SQL Server big data cluster. The master instance is a SQL Server instance running in a SQL Server big data cluster to manage connectivity, scale-out queries, metadata and user databases, and machine learning services.
18
+
This article describes the role of the *SQL Server master instance* in a SQL Server big data cluster. The master instance is a SQL Server instance running in a SQL Server big data cluster. The master instance manages connectivity, scale-out queries, metadata and user databases, and machine learning services.
19
19
20
20
The SQL Server master instance provides the following functionality:
21
21
@@ -25,11 +25,11 @@ The SQL Server master instance provides an externally accessible TDS endpoint fo
25
25
26
26
## Scale-out query management
27
27
28
-
The SQL Server master instance contains the scale-out query engine that is used to distribute queries across SQL Server instances on nodes in the [compute pool](concept-compute-pool.md). The scale-out query engine also provides access through Transact-SQL to all Hive tables in the cluster without any additional configuration.
28
+
The SQL Server master instance contains the scale-out query engine that is used to distribute queries across SQL Server instances on nodes in the [compute pool](concept-compute-pool.md). The scale-out query engine also provides access through Transact-SQL to all Hive tables in the cluster without any more configuration.
29
29
30
30
## Metadata and user databases
31
31
32
-
In addition to the standard SQL Server system databases, the SQL master instance also contains the following:
32
+
In addition to the standard SQL Server system databases, the SQL master instance also contains:
33
33
34
34
- A metadata database that holds HDFS-table metadata.
35
35
- A data plane shard map.
@@ -40,13 +40,13 @@ You can also choose to add your own user databases to the SQL Server master inst
40
40
41
41
## Machine learning services
42
42
43
-
SQL Server machine learning services is an add-on feature to the database engine, used for executing Java, R and Python code in SQL Server. This feature is based on the SQL Server extensibility framework, which isolates external processes from core engine processes, but fully integrates with the relational data as stored procedures, as T-SQL script containing R or Python statements, or as Java, R or Python code containing T-SQL.
43
+
The SQL Server machine learning services feature is an add-on feature to the database engine. The machine learning services feature used for executing Java, R and Python code in SQL Server. This feature is based on the SQL Server extensibility framework, which isolates external processes from core engine processes, but fully integrates with the relational data as stored procedures, as T-SQL script containing R or Python statements, or as Java, R or Python code containing T-SQL.
44
44
45
-
As part of a SQL Server big data cluster, machine learning services will be available on the SQL Server master instance by default. This means that once external script execution is enabled on the SQL Server master instance, it is going to be possible to execute Java, R and Python scripts using sp_execute_external_script.
45
+
As part of a SQL Server big data cluster, machine learning services will be available on the SQL Server master instance by default. Once external script execution is enabled on the SQL Server master instance, it is possible to execute Java, R and Python scripts using sp_execute_external_script.
46
46
47
47
### Advantages of machine learning services in a big data cluster
48
48
49
-
[!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] makes it easy for big data to be joined to the dimensional data typically stored in the enterprise database. The value of the big data greatly increases when it is not just in the hands of parts of an organization, but is also included in reports, dashboards, and applications. At the same time, data scientists can continue to use the Spark/HDFS ecosystem tools and have easy, realtime access to the data in the SQL Server master instance and in external data sources accessible _through_ the SQL Server master instance.
49
+
[!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] makes it easy for big data to be joined to the dimensional data typically stored in the enterprise database. The value of the big data greatly increases when it is not just in the hands of parts of an organization, but is also included in reports, dashboards, and applications. At the same time, data scientists can continue to use the Spark/HDFS ecosystem tools and have easy, real-time access to the data in the SQL Server master instance and in external data sources accessible _through_ the SQL Server master instance.
50
50
51
51
With [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)], you can do more with your enterprise data lakes. SQL Server developers and analysts can:
52
52
@@ -65,4 +65,4 @@ With [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md
65
65
To learn more about the [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)], see the following resources:
66
66
67
67
-[What are [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ver15.md)]?](big-data-cluster-overview.md)
68
-
-[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/Microsoft/sqlworkshops/tree/master/sqlserver2019bigdataclusters)
68
+
-[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/microsoft/sqlworkshops-bdc)
Copy file name to clipboardExpand all lines: docs/big-data-cluster/concept-security.md
+5-3Lines changed: 5 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -48,7 +48,7 @@ Currently, there is no option of opening up additional ports for accessing the c
48
48
49
49
## Authorization
50
50
51
-
Throughout the cluster, integrated security between different components allows the original user’s identity to be passed through when issuing queries from Spark and SQL Server, all the way to HDFS. As mentioned above, the various external cluster endpoints support AD authentication.
51
+
Throughout the cluster, integrated security between different components allows the original user's identity to be passed through when issuing queries from Spark and SQL Server, all the way to HDFS. As mentioned above, the various external cluster endpoints support AD authentication.
52
52
53
53
There are two levels of authorization checks in the cluster for managing data access. Authorization in the context of big data is done in SQL Server, using the traditional SQL Server permissions on objects and in HDFS with control lists (ACLs), which associate user identities with specific permissions.
54
54
@@ -101,6 +101,8 @@ Cluster admin password:
101
101
102
102
[What are [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ver15.md)]?](big-data-cluster-overview.md)
103
103
104
-
[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/Microsoft/sqlworkshops/tree/master/sqlserver2019bigdataclusters)
104
+
[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/microsoft/sqlworkshops-bdc)
Copy file name to clipboardExpand all lines: docs/big-data-cluster/concept-storage-pool.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -61,4 +61,4 @@ You can also interact with HDFS using:
61
61
To learn more about the [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)], see the following resources:
62
62
63
63
-[What are [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ver15.md)]?](big-data-cluster-overview.md)
64
-
-[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/Microsoft/sqlworkshops/tree/master/sqlserver2019bigdataclusters)
64
+
-[Workshop: Microsoft [!INCLUDE[big-data-clusters-2019](../includes/ssbigdataclusters-ss-nover.md)] Architecture](https://github.com/microsoft/sqlworkshops-bdc)
0 commit comments