OPTION RECOMPILE is Always Faster Why

Question

I encountered an odd situation where appending OPTION  RECOMPILE  to my query causes it to run in half a second  while omitting it causes the query to take well over five minutes   This is the case when the query is executed from Query Analyzer or from my C  program via SqlCommand ExecuteReader    Calling  or not calling  DBCC FREEPROCCACHE or DBCC dropcleanbuffers makes no difference  Query results are always returned instantaneously with OPTION  RECOMPILE  and greater than five minutes without it  The query is always called with the same parameters  for the sake of this test    I m using SQL Server 2008   I m fairly comfortable with writing SQL but have never used an OPTION command in a query before and was unfamiliar with the whole concept of plan caches until scanning the posts on this forum  My understanding from the posts is that OPTION  RECOMPILE  is an expensive operation  It apparently creates a new lookup strategy for the query  So why is it then  that subsequent queries that omit the OPTION  RECOMPILE  are so slow  Shouldn t the subsequent queries be making use of the lookup strategy that was computed on the previous call which included the recompilation hint   Is it highly unusual to have a query that requires a recompilation hint on every single call   Sorry for the entry-level question but I can t really make heads or tails of this   UPDATE  I ve been asked to post the query     select acctNo min date  earliestDate  from       select acctNo tradeDate as date      from datafeed trans      where feedid  feedID and feedDate  feedDate       union       select acctNo feedDate as date      from datafeed money      where feedid  feedID and feedDate  feedDate       union       select acctNo feedDate as date      from datafeed jnl      where feedid  feedID and feedDate  feedDate   t1  group by t1 acctNo OPTION RECOMPILE    When running the test from Query Analyzer  I prepend the following lines   declare  feedID int select  feedID 20  declare  feedDate datetime select  feedDate  1 2 2009    When calling it from my C  program  the parameters are passed in via the SqlCommand Parameters property   For the purposes of this discussion  you can assume that the parameters never change so we can rule out sub-optimal parameter smelling as the cause

User · Answer

To add to the excellent list  given by  CodeCowboyOrg  of situations where OPTION RECOMPILE  can be very helpful     Table Variables  When you are using table variables  there will not be any pre-built statistics for the table variable  often leading to large differences between estimated and actual rows in the query plan   Using OPTION RECOMPILE  on queries with table variables allows generation of a query plan that has a much better estimate of the row numbers involved   I had a particularly critical use of a table variable that was unusable  and which I was going to abandon  until I added OPTION RECOMPILE    The run time went from hours to just a few minutes  That is probably unusual  but in any case  if you are using table variables and working on optimizing  it s well worth seeing whether OPTION RECOMPILE  makes a difference

User · Answer

The very first actions before tunning queries is to defrag rebuild the indexes and statistics  otherway you re wasting your time   You must check the execution plan to see if it s stable  is the same when you change the parameters   if not  you might have to create a cover index  in this case for each table   knowing th system you can create one that is usefull for other queries too    as an example     create index idx01 datafeed trans    On datafeed trans   feedid  feedDate        INCLUDE  acctNo  tradeDate       if the plan is stable or you can stabilize it you can execute the sentence with sp executesql  sql sentence   to save and use a fixed execution plan   if the plan is unstable you have to use an ad-hoc statement or EXEC  sql sentence   to evaluate and create an execution plan each time   or a stored procedure  with recompile     Hope it helps

User · Answer

Often when there is a drastic difference from run to run of a query I find that it is often one of 5 issues     STATISTICS - Statistics are out of date  A database stores statistics on the range and distribution of the types of values in various column on tables and indexes  This helps the query engine to develop a  Plan  of attack for how it will do the query  for example the type of method it will use to match keys between tables using a hash or looking through the entire set  You can call Update Statistics on the entire database or just certain tables or indexes   This slows down the query from one run to another because when statistics are out of date  its likely the query plan is not optimal for the newly inserted or changed data for the same query  explained more later below   It may not be proper to Update Statistics immediately on a Production database as there will be some overhead  slow down and lag depending on the amount of data to sample   You can also choose to use a Full Scan or Sampling to update Statistics  If you look at the Query Plan  you can then also view the statistics on the Indexes in use such using the command DBCC SHOW STATISTICS  tablename  indexname    This will show you the distribution and ranges of the keys that the query plan is using to base its approach on  PARAMETER SNIFFING - The query plan that is cached is not optimal for the particular parameters you are passing in  even though the query itself has not changed  For example  if you pass in a parameter which only retrieves 10 out of 1 000 000 rows  then the query plan created may use a Hash Join  however if the parameter you pass in will use 750 000 of the 1 000 000 rows  the plan created may be an index scan or table scan   In such a situation you can tell the SQL statement to use the option OPTION  RECOMPILE  or an SP to use WITH RECOMPILE   To tell the Engine this is a  Single Use Plan  and not to use a Cached Plan which likely does not apply   There is no rule on how to make this decision  it depends on knowing the way the query will be used by users  INDEXES - Its possible that the query haven t changed  but a change elsewhere such as the removal of a very useful index has slowed down the query  ROWS CHANGED - The rows you are querying drastically changes from call to call   Usually statistics are automatically updated in these cases   However if you are building dynamic SQL or calling SQL within a tight loop  there is a possibility you are using an outdated Query Plan based on the wrong drastic number of rows or statistics   Again in this case OPTION  RECOMPILE  is useful  THE LOGIC Its the Logic  your query is no longer efficient  it was fine for a small number of rows  but no longer scales   This usually involves more indepth analysis of the Query Plan   For example  you can no longer do things in bulk  but have to Chunk things and do smaller Commits  or your Cross Product was fine for a smaller set but now takes up CPU and Memory as it scales larger  this may also be true for using DISTINCT  you are calling a function for every row  your key matches don t use an index because of CASTING type conversion or NULLS or functions     Too many possibilities here    In general when you write a query  you should have some mental picture of roughly how certain data is distributed within your table   A column for example  can have an evenly distributed number of different values  or it can be skewed  80  of the time have a specific set of values  whether the distribution will varying frequently over time or be fairly static   This will give you a better idea of how to build an efficient query   But also when debugging query performance have a basis for building a hypothesis as to why it is slow or inefficient

User · Answer

Necroing this question but there s an explanation that no-one seems to have considered   STATISTICS - Statistics are not available or misleading  If all of the following are true    The columns feedid and feedDate are likely to be highly correlated  e g  a feed id is more specific than a feed date and the date parameter is redundant information   There is no index with both columns as sequential columns  There are no manually created statistics covering both these columns    Then sql server may be incorrectly assuming that the columns are uncorrelated  leading to lower than expected cardinality estimates for applying both restrictions and a poor execution plan being selected  The fix in this case would be to create a statistics object linking the two columns  which is not an expensive operation

User · Answer

There are times that using OPTION RECOMPILE  makes sense   In my experience the only time this is a viable option is when you are using dynamic SQL   Before you explore whether this makes sense in your situation I would recommend rebuilding your statistics   This can be done by running the following   EXEC sp updatestats   And then recreating your execution plan   This will ensure that when your execution plan is created it will be using the latest information   Adding OPTION RECOMPILE  rebuilds the execution plan every time that your query executes   I have never heard that described as creates a new lookup strategy but maybe we are just using different terms for the same thing   When a stored procedure is created  I suspect you are calling ad-hoc sql from  NET but if you are using a parameterized query then this ends up being a stored proc call  SQL Server attempts to determine the most effective execution plan for this query based on the data in your database and the parameters passed in  parameter sniffing   and then caches this plan  This means that if you create the query where there are 10 records in your database and then execute it when there are 100 000 000 records the cached execution plan may no longer be the most effective   In summary - I don t see any reason that OPTION RECOMPILE  would be a benefit here   I suspect you just need to update your statistics and your execution plan   Rebuilding statistics can be an essential part of DBA work depending on your situation   If you are still having problems after updating your stats  I would suggest posting both execution plans   And to answer your question - yes  I would say it is highly unusual for your best option to be recompiling the execution plan every time you execute the query

[sql] OPTION (RECOMPILE) is Always Faster; Why?

Examples related to sql

Examples related to sql-server

Examples related to sql-server-2008

Examples related to compilation

Examples related to hint