• Home
  • SQL Server
    • Articles
    • T-SQL Puzzles
    • Output Puzzles
    • Interview Questions
    • Performance Tuning
    • SQL SERVER On Linux
    • Resources
  • SSRS
    • SSRS Articles
    • Interview Questions
  • SSAS
    • SSAS Articles
    • DAX
  • SQL Puzzles
  • Interview Questions
    • SQL Interview Questions
    • Data Interview Questions
  • Python Interview Puzzles
  • New Features(SQL SERVER)
    • SQL SERVER 2017
    • SQL SERVER 2016
    • SQL SERVER On Linux
  • Social
    • Expert Exchange
      • Top Expert in SQL
      • Yearly Award
      • Certifications
      • Achievement List
      • Top Expert of the Week
    • HackerRank (SQL)
    • StackOverflow
    • About Me
      • Contact Me
      • Blog Rules

Improving my SQL BI Skills

Improving my SQL BI Skills

Daily Archives: July 1, 2015

Why Order By is bad in Queries? / Execution/Query Plan Operator – The Sort Operator Basics

01 Wednesday Jul 2015

Posted by Pawan Kumar Khowal in SQL Performance Tuning

≈ Leave a comment

Tags

26 performance tuning questions and solutions, A block-sorting lossless data compression algorithm, Are "bad" statistics the reason my query is slow?, Avoiding Sort task in Execution Plan, Bad performance of SQL query due to ORDER BY clause, Blocked sort-based indexing, but the execution plan is still showing a distinct sort, Case in order clause causes bad page queries, Collapse, Estimated Execution Plan SQL Server Sort?, Execution plan - Sort Operator, How to tune SQL queries, How to understand SQL Server 2008 Execution Plan *sort*?, Identifying and Solving Sort Warnings Problems in SQL Server, Indexing for Sort Performance, Interview questions for SQL Server Performance Tuning, Looking for SQL Optimization Interview Questions, Operations in the SQL Sever execution plan, Optimizing SQL queries by removing Sort operator, Order By, performance sql server, Performance tips for faster SQL queries, Performance Tuning, Performance Tuning for SQL Server, Performance Tuning the Whole Query Plan, Query Optimization, Query Performance Tuning, Showplan Operator, Showplan Operator of the Week - SORT, ShowPlan Operator of the Week - Split, Sort, Sort - Is it Really a Blocking Iterator?, Sort operation in execution plan, SORT Operator Performance Improvement, Sort Showplan Operator, sorting - Blocking sort operators, SQL Complex Queries, SQL Optimization Interview Questions, sql performance, sql performance and tuning, sql performance explained pdf, sql performance tips, SQL Performance Tuning, sql performance tuning and optimization, sql performance tuning interview questions, sql performance tuning tips, SQL Query Optimizer, SQL Query Tuning or Query Optimization, sql server - Optimizing SQL queries by removing Sort, sql server - Why is there a sort showing up in my execution, SQL SERVER Interview questions, SQL server optimization interview questions and answers, sql server performance query, sql server performance slow, SQL Server Performance Tuning, SQL Server Performance Tuning Tips, SQL Server query execution plans, SQL SERVER Tips, SQL Tuning Overview, The blocking nature of aggregates, Tips for SQL Database Tuning and Performance, Top 10 performance tuning tips for relational databases, Understanding SQL Server's Query Execution Plans, Ways to minimize sort operations, Why is there a sort showing up in my execution plan?, Why Order By is bad in Queries?, Why order is bad in queries, with no ORDER BY clause


Why Order By is bad in Queries? / Execution/Query Plan Operator – The Sort Operator Basics

Download PDF – Execution Plan Operator – The Sort Operator – Basics

Order By is not always a bad thing in the query/execution plans.

Internally Order by is implemented by Sort operator or by an index.

For example, In the below examples, in first query we are having an clustered index scan and in the second one we are using a sort operator as we don’t have any index on TransactionDate column.

Pawan Kumar Khowal - The Sort Operator 1

The SORT operator sorts all rows received by the operator into order. Well in some cases, the SORT operation is performed in TempDB. This is because we don’t have proper indexes in place or we didn’t pass proper information to the query optimizer.

Also note that TempDB is used for all databases within the SQL Server Instance, this can lead to TempDB becoming a bottle-neck and thereby affecting performance.

Basically sort is required by many operators like Order By, Merge Join, Stream Aggregate, Distinct and Window functions. Let’s say you wanted the data in a particular order so its need to be sorted. This sort operator is not scaled well. It scales extra linearly. So Sorting 10000 rows is more than 10 time slower than sorting 1000 rows. Sort is very important and on the other hand it could degrade your queries also if you lots and lots of sort operator in your queries execution plan.

Always check you requirement whether you really need order by in your query or not. Lot of developer just put order by in their queries just to see the ordered output. Remember this is not free. There is cost involved for sort operator. This can really degrade your queries performance because sorting data at SQL Server is lot more expensive than at some other places like app tier.

Also note that doing smaller sorts is always better than a very large sort. Very large here refers to huge volume of data.

Algorithmic cost of sort is O (N*Log (N)) – N here is the number of input rows.

So mathematically we can prove that doing multiple mini sorts is much better than performing a sort on a large dataset. Example below-

Pawan Kumar Khowal - The Sort Operator 2

Example of Sort on smaller set of data

--

SELECT p.ProductID,tr.SalesOrderID
FROM [Production].[Product] p
CROSS APPLY
    (
      SELECT TOP 1 SalesOrderID
      FROM [Sales].[SalesOrderDetail] s
      WHERE s.ProductID = p.ProductID
	  ORDER BY SalesOrderID DESC
    ) tr

--

What we are doing in the above query is we are reading all product ids from products master table and then for each product id we are hitting orders table to find out the latest salesorderid which works perfectly in our case since we need only 1 salesorderid per product that too we are getting with the help of nonclustered index seek. Also check the cost each query took.

The quickest and easiest way to avoid a SORT operator is by creating an Index. As we know, indexes are ordered by the columns so if you create an index covering your query, the Query Optimizer identifies this index and uses it to avoid a SORT operation.

Summary

This is the small information about Sort Operator. I will write another port about sort operator where we will dig more into the topic like how internally sort functions, about methods like Quick sort and merge sort, how sort spills data into tempDB and disks, etc.

That’s all folks; I hope you’ve enjoyed learning about Sort Operator, and I’ll see you soon with more “Performance Tuning” articles.

Thanks!

Pawan Kumar Khowal

MSBISKills.com

Share this

  • LinkedIn
  • Facebook
  • Twitter
  • WhatsApp
  • Email

What is Selectivity? How we can use this in creating our indexes?

01 Wednesday Jul 2015

Posted by Pawan Kumar Khowal in SQL Performance Tuning

≈ Leave a comment

Tags

26 performance tuning questions and solutions, Cardinality vs. Selectivity, How to measure Index Selectivity, How to tune SQL queries, Index Selectivity and Column Order, Interview questions for SQL Server Performance Tuning, Looking for SQL Optimization Interview Questions, performance sql server, Performance tips for faster SQL queries, Performance Tuning, Performance Tuning for SQL Server, Query Optimization, Query Performance Tuning, selectivity, Selectivity in SQL Databases, SQL Complex Queries, SQL Optimization Interview Questions, sql performance, sql performance and tuning, sql performance explained pdf, sql performance tips, SQL Performance Tuning, sql performance tuning and optimization, sql performance tuning interview questions, sql performance tuning tips, SQL Query Optimizer, SQL Query Tuning or Query Optimization, SQL SERVER Interview questions, SQL server optimization interview questions and answers, sql server performance query, sql server performance slow, SQL Server Performance Tuning, SQL Server Performance Tuning Tips, SQL SERVER Tips, SQL Tuning Overview, Tips for SQL Database Tuning and Performance, Top 10 performance tuning tips for relational databases, What is index selectivity?


What is Selectivity? How we can use this in creating our indexes?

Download PDF – Selectivity?

Hi Guys Today I am going to talk about Selectivity in performance tuning and optimization. Selectivity refers to number of unique values in a column. Now if the column has high number of unique values then we should use that column as the first member while creating indexes. We will see this in a short span of time. If the column has high number of unique values then it means it has high selectivity. In layman terms we can use that column in where clause more as compared to the remaining columns.

Now most people say that if the column is highly selective then we should use it in the index as a first column in a multi column index. This is because histogram builds on the first column and the Meta data builds on the first column. It also helps when the B+ tree is internally created based on the unique values.

But there are scenarios where this is not the case. Let’s go through an example where this rule does not work.

First let’s create a table and insert some data.

--

CREATE TABLE TestIndexes
(
	 ID INT
	,NAME VARCHAR(5)
	,GENDER VARCHAR(1)
)
GO

INSERT INTO TestIndexes
SELECT	  Number ID
		, LEFT(NEWID(),5) NAME
		, CASE WHEN LEFT(RAND() * 10 + 1 ,1) * 5 % 2 = 0 THEN 'M' ELSE 'F' END GENDER 
FROM master..spt_values WHERE Number > 0 AND Number < 3000
UNION ALL
SELECT	  Number + 1 ID
		, LEFT(NEWID(),5) NAME
		, CASE WHEN LEFT(RAND() * 10 + 1 ,1) * 5 % 2 = 0 THEN 'F' ELSE 'M' END GENDER 
FROM master..spt_values WHERE Number > 0 AND Number < 3000


--

Ok so the above query will create a new table for us and will insert 4496 records for us. Now let’s say we write a query which will give us all males and ID is between 50 and 10000.

--

SELECT Gender,ID FROM TestIndexes
WHERE GENDER = 'M' AND ID > 50 AND ID < 10000


--

Now when you execute the above query you will get desired data with a table scan since we don’t have any index.

Ok now let’s create an index. For that we need to consider the selectivity. In this case since Id column has high selectivity as it has more unique values then the gender column which has only 2 values ‘M’ and ‘F’.

Let’s go ahead and create that index.

--

CREATE NONCLUSTERED INDEX Ix_ID_Gender ON TestIndexes(ID,GENDER)

--

Note – Please note that the above index is a multi-column NonClustered index and we are using ID as a first column. Now execute the query again and check the execution plan and Statistics. [Add actual execution plan and Statistics IO ON]

Pawan Kumar Khowal - Selectivity 1

So we are doing an index seek which is good and our stats are also good as estimated and actual number of rows is also same. In this case we are reading 15 logical pages.

Now let’s drop this newly created index and create the same index with Gender as the first column and ID as the second column.

Now let’s drop this newly created index and create the same index with Gender as the first column and ID as the second column.

--

DROP INDEX Ix_ID_Gender ON TestIndexes
GO

CREATE NONCLUSTERED INDEX Ix_ID_Gender ON TestIndexes(GENDER,ID)
GO


--

Okay so our new index is in place let’s executing our query again and check the execution plan and statistics.

Pawan Kumar Khowal - Selectivity 2

So it’s clearly visible that the new query and the old query’s execution plans are same. But their statistics are different. Query with the old index is having 15 logical reads and the new query is having only 9 logical reads.

Summary

So all in all we can say that if the column is highly selective then we should use it in the index as a first column but this is not the case always. So always create the index and try out different options against the number of queries and then finalize the index.

That’s all folks; I hope you’ve enjoyed learning about selectivity and how we can effectively use it while designing our indexes, and I’ll see you soon with more “Performance Tuning” articles.

Thanks!

Pawan Kumar Khowal

MSBISKills.com

Share this

  • LinkedIn
  • Facebook
  • Twitter
  • WhatsApp
  • Email

Blog Stats

  • 1,084,487 hits

Enter your email address to follow this blog and receive notifications of new posts by email.

Join 1,131 other subscribers

Pawan Khowal

502 SQL Puzzles with answers

Achievement - 500 PuzzlesJuly 18, 2018
The big day is here. Finally presented 500+ puzzles for SQL community.

200 SQL Server Puzzle with Answers

The Big DayAugust 19, 2016
The big day is here. Completed 200 SQL Puzzles today

Archives

July 2015
M T W T F S S
 12345
6789101112
13141516171819
20212223242526
2728293031  
« Jun   Aug »

Top Articles

  • pawankkmr.wordpress.com/2…
  • pawankkmr.wordpress.com/2…
  • pawankkmr.wordpress.com/2…
  • pawankkmr.wordpress.com/2…
  • pawankkmr.wordpress.com/2…

Archives

  • October 2020 (29)
  • September 2018 (2)
  • August 2018 (6)
  • July 2018 (25)
  • June 2018 (22)
  • May 2018 (24)
  • April 2018 (33)
  • March 2018 (35)
  • February 2018 (53)
  • January 2018 (48)
  • December 2017 (32)
  • November 2017 (2)
  • October 2017 (20)
  • August 2017 (8)
  • June 2017 (2)
  • March 2017 (1)
  • February 2017 (18)
  • January 2017 (2)
  • December 2016 (5)
  • November 2016 (23)
  • October 2016 (2)
  • September 2016 (14)
  • August 2016 (6)
  • July 2016 (22)
  • June 2016 (27)
  • May 2016 (15)
  • April 2016 (7)
  • March 2016 (5)
  • February 2016 (7)
  • December 2015 (4)
  • October 2015 (23)
  • September 2015 (31)
  • August 2015 (14)
  • July 2015 (16)
  • June 2015 (29)
  • May 2015 (25)
  • April 2015 (44)
  • March 2015 (47)
  • November 2012 (1)
  • July 2012 (8)
  • September 2010 (26)
  • August 2010 (125)
  • July 2010 (2)

Article Categories

  • Analysis Services (6)
    • DAX (6)
  • Data (2)
    • Data warehousing (2)
  • Integration Services (2)
  • Magazines (3)
  • Python (29)
  • Reporting Services (4)
  • SQL SERVER (820)
    • Download SQL Interview Q's (212)
    • SQL Concepts (323)
    • SQL Performance Tuning (155)
    • SQL Puzzles (331)
    • SQL SERVER 2017 Linux (6)
    • SQL Server Interview Questions (308)
    • SQL SERVER Puzzles (332)
    • T SQL Puzzles (547)
    • Tricky SQL Queries (439)
  • UI (30)
    • ASP.NET (5)
    • C# (13)
    • CSS (9)
    • OOPS (3)
  • Uncategorized (5)

Recent Posts

  • Python | The Print and Divide Puzzle October 30, 2020
  • Python | Count consecutive 1’s from a list of 0’s and 1’s October 30, 2020
  • Python | How to convert a number into a list of its digits October 26, 2020
  • Python | Validate an IP Address-IPV6(Internet Protocol version 6) October 26, 2020
  • Python | Print the first non-recurring element in a list October 26, 2020
  • Python | Print the most recurring element in a list October 26, 2020
  • Python | Find the cumulative sum of elements in a list October 26, 2020
  • Python | Check a character is present in a string or not October 26, 2020
  • Python | Check whether a string is palindrome or not October 26, 2020
  • Python | Find the missing number in the array of Ints October 26, 2020
  • Python | How would you delete duplicates in a list October 26, 2020
  • Python | Check whether an array is Monotonic or not October 26, 2020
  • Python | Check whether a number is prime or not October 26, 2020
  • Python | Print list of prime numbers up to a number October 26, 2020
  • Python | Print elements from odd positions in a list October 26, 2020
  • Python | Print positions of a string present in another string October 26, 2020
  • Python | How to sort an array in ascending order October 26, 2020
  • Python | How to reverse an array October 26, 2020
  • Python | Find un-common words from two strings October 26, 2020
  • Python | How to convert a string to a list October 26, 2020
  • Python | Find unique words from a string October 26, 2020
  • Python | Calculate average word length from a string October 26, 2020
  • Python | Find common words from two strings October 26, 2020
  • Python | Find the number of times a substring present in a string October 26, 2020
  • Python | Find maximum value from a list October 26, 2020
  • Python | How to find GCF of two numbers October 26, 2020
  • Python | How to find LCM of two numbers October 26, 2020
  • Python | How to convert a list to a string October 26, 2020
  • Python | Replace NONE by its previous NON None value October 26, 2020
  • Microsoft SQL Server 2019 | Features added to SQL Server on Linux September 26, 2018

Create a website or blog at WordPress.com

  • Follow Following
    • Improving my SQL BI Skills
    • Join 231 other followers
    • Already have a WordPress.com account? Log in now.
    • Improving my SQL BI Skills
    • Customize
    • Follow Following
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar
 

Loading Comments...
 

You must be logged in to post a comment.