Skip to content

Commit aaf4108

Browse files
Merge pull request #31982 from MikeRayMSFT/241003-polybase-partition-elimination
Polybase performance update.
2 parents eb27384 + 83802df commit aaf4108

1 file changed

Lines changed: 8 additions & 4 deletions

File tree

docs/relational-databases/polybase/polybase-performance.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Performance considerations for PolyBase in your SQL Server instance
44
author: MikeRayMSFT
55
ms.author: mikeray
66
ms.reviewer: hudequei
7-
ms.date: 01/17/2024
7+
ms.date: 10/04/2024
88
ms.service: sql
99
ms.subservice: polybase
1010
ms.custom: linux-related-content
@@ -66,11 +66,15 @@ WITH FULLSCAN, NORECOMPUTE;
6666
The `WITH` options are mandatory, and for the sample size, the allowed options are `FULLSCAN` and `SAMPLE n PERCENT`.
6767

6868
- To create single-column statistics for multiple columns, execute `CREATE STATISTICS` for each of the columns.
69-
- Multi-column statistics are not supported.
69+
- Multi-column statistics are not supported.
7070

7171
## Query partitioned data
7272

73-
Data is often organized in subfolders also called partitions. You can instruct the SQL Server instance to query only particular folders and files. Doing so reduces the number of files and the amount of data the query needs to read and process, resulting in better performance. This type of query optimization is known as partition pruning or *partition elimination*. You can eliminate partitions from query execution by using metadata function `filepath()` in the `WHERE` clause of the query.
73+
***Applies to*** [!INCLUDE [ssazuremi-md](../../includes/ssazuremi-md.md)] and [!INCLUDE [ssazuresynapse-md](../../includes/ssazuresynapse-md.md)].
74+
75+
When data is organized into folders or files (also called partitions), use *partition elimination* to query only specific folders and files. Partition elimination reduces the number of files and the amount of data the query needs to read and process, resulting in better performance.
76+
77+
To eliminate partitions from query execution, use the metadata function `filepath()` in the `WHERE` clause of the query.
7478

7579
First, create an external data source:
7680

@@ -111,7 +115,7 @@ ORDER BY filepath;
111115

112116
If your stored data isn't partitioned, consider partitioning it to improve query performance.
113117

114-
If you are using external tables, `filepath()` and `filename()` functions are supported but not in the `WHERE` clause. You can still filter by `filename` or `filepath` if you use them in computed columns. The following example demonstrates this:
118+
If you are using external tables, `filepath()` and `filename()` functions are supported but not in the `WHERE` clause. You can still filter by `filename` or `filepath` if you use them in computed columns. The following example demonstrates this:
115119

116120
```sql
117121
CREATE EXTERNAL TABLE tbl_TaxiRides (

0 commit comments

Comments
 (0)