You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/relational-databases/polybase/polybase-performance.md
+8-4Lines changed: 8 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ description: Performance considerations for PolyBase in your SQL Server instance
4
4
author: MikeRayMSFT
5
5
ms.author: mikeray
6
6
ms.reviewer: hudequei
7
-
ms.date: 01/17/2024
7
+
ms.date: 10/04/2024
8
8
ms.service: sql
9
9
ms.subservice: polybase
10
10
ms.custom: linux-related-content
@@ -66,11 +66,15 @@ WITH FULLSCAN, NORECOMPUTE;
66
66
The `WITH` options are mandatory, and for the sample size, the allowed options are `FULLSCAN` and `SAMPLE n PERCENT`.
67
67
68
68
- To create single-column statistics for multiple columns, execute `CREATE STATISTICS` for each of the columns.
69
-
- Multi-column statistics are not supported.
69
+
- Multi-column statistics are not supported.
70
70
71
71
## Query partitioned data
72
72
73
-
Data is often organized in subfolders also called partitions. You can instruct the SQL Server instance to query only particular folders and files. Doing so reduces the number of files and the amount of data the query needs to read and process, resulting in better performance. This type of query optimization is known as partition pruning or *partition elimination*. You can eliminate partitions from query execution by using metadata function `filepath()` in the `WHERE` clause of the query.
73
+
***Applies to***[!INCLUDE [ssazuremi-md](../../includes/ssazuremi-md.md)] and [!INCLUDE [ssazuresynapse-md](../../includes/ssazuresynapse-md.md)].
74
+
75
+
When data is organized into folders or files (also called partitions), use *partition elimination* to query only specific folders and files. Partition elimination reduces the number of files and the amount of data the query needs to read and process, resulting in better performance.
76
+
77
+
To eliminate partitions from query execution, use the metadata function `filepath()` in the `WHERE` clause of the query.
74
78
75
79
First, create an external data source:
76
80
@@ -111,7 +115,7 @@ ORDER BY filepath;
111
115
112
116
If your stored data isn't partitioned, consider partitioning it to improve query performance.
113
117
114
-
If you are using external tables, `filepath()` and `filename()` functions are supported but not in the `WHERE` clause. You can still filter by `filename` or `filepath` if you use them in computed columns. The following example demonstrates this:
118
+
If you are using external tables, `filepath()` and `filename()` functions are supported but not in the `WHERE` clause. You can still filter by `filename` or `filepath` if you use them in computed columns. The following example demonstrates this:
0 commit comments