How to speed up query on table with millions of rows
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ margin-bottom:0;
}
The Issue:
I'm working on a big table that consists about 37mln rows. Data include measurements of many devices made in certain time e.g. '2013-09-24 10:45:50'. Each day all of those devices are sending many measurements in different intervals on different times. I want to make a query which selects all the most actual ( 'actual' I mean the latest from all measurements made in each day) measurement of each day for 2 months e.g from 2013-01-01 to 2013-02-01.
The problem is that this query takes so much time to go, despite all of the indexes i've made on different columns. I've also created auxiliary table that contains max(MeterDate) and MeasurementsId when the measurement was given. I've noticed that index can't be made on MeterDate because it contains date and time which is not useful for making an index on it. So i converted the MeterDate -> CONVERT(date, MeterDate). I though that after joining The auxiliary table with [dbo].[Measurements] the query would be faster but still query takes more than 12s which is too long for me.
The structure of table:
Create table [dbo].[Measurements]
[Id] [int] IDENTITY(1,1) NOT NULL,
[ReadType_Id] [int] NOT NULL,
[Device_Id] [int] NULL,
[DeviceInterface] [tinyint] NULL,
[MeterDate] [datetime] NULL,
[MeasureValue] [decimal](18, 3) NULL
Every row of Measurements table include measurement value on direct MeterDate e.g. "2008-04-04 13:28:44.473"
Direct select structure:
DECLARE @startdate datetime= '2013-07-01';
DECLARE @enddate datetime = '2013-08-01';
SELECT *
FROM [dbo].[Measurements]
WHERE [MeterDate] BETWEEN @startdate and @enddate
Does anyone knows how to rebuilt table or add new or add indexes on which column that speed up query a bit ? Thanks in advance for any info.
Edit:
The table that I used was created by this query
with t1 as
(
Select [Device_Id], [DeviceInterface], CONVERT(date, MeterDate) as OnlyDate, Max(MeterDate) as MaxMeterDate
FROM [dbo].[Measurements]
GROUP BY [Device_Id], [DeviceInterface], CONVERT(date, MeterDate)
)
Select t1.[Device_Id], t1.[DeviceInterface],t1.[OnlyDate], r.Id
INTO [dbo].[MaxDatesMeasurements]
FROM t1
JOIN [dbo].[Measurements] as r ON r.Device_Id = t1.Device_Id AND r.DeviceInterface = t1.DeviceInterface AND r.MeterDate = t1.MaxMeterDate
Then I wanted to join the new created table [dbo].[MaxDatesMeasurements] with old [dbo].[Measurements] and select direct rows
DECLARE @startdate datetime= '2013-07-01';
DECLARE @enddate datetime = '2013-08-01';
Select *
From [dbo].[MaxDatesMeasurements] as t1
Join [dbo].[Measurements] as t2 on t1.[Id] = t2.[Id]
WHERE t1.[OnlyDate] BETWEEN @startdate AND @enddate
sql-server index query-performance clustered-index
bumped to the homepage by Community♦ 11 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
|
show 2 more comments
The Issue:
I'm working on a big table that consists about 37mln rows. Data include measurements of many devices made in certain time e.g. '2013-09-24 10:45:50'. Each day all of those devices are sending many measurements in different intervals on different times. I want to make a query which selects all the most actual ( 'actual' I mean the latest from all measurements made in each day) measurement of each day for 2 months e.g from 2013-01-01 to 2013-02-01.
The problem is that this query takes so much time to go, despite all of the indexes i've made on different columns. I've also created auxiliary table that contains max(MeterDate) and MeasurementsId when the measurement was given. I've noticed that index can't be made on MeterDate because it contains date and time which is not useful for making an index on it. So i converted the MeterDate -> CONVERT(date, MeterDate). I though that after joining The auxiliary table with [dbo].[Measurements] the query would be faster but still query takes more than 12s which is too long for me.
The structure of table:
Create table [dbo].[Measurements]
[Id] [int] IDENTITY(1,1) NOT NULL,
[ReadType_Id] [int] NOT NULL,
[Device_Id] [int] NULL,
[DeviceInterface] [tinyint] NULL,
[MeterDate] [datetime] NULL,
[MeasureValue] [decimal](18, 3) NULL
Every row of Measurements table include measurement value on direct MeterDate e.g. "2008-04-04 13:28:44.473"
Direct select structure:
DECLARE @startdate datetime= '2013-07-01';
DECLARE @enddate datetime = '2013-08-01';
SELECT *
FROM [dbo].[Measurements]
WHERE [MeterDate] BETWEEN @startdate and @enddate
Does anyone knows how to rebuilt table or add new or add indexes on which column that speed up query a bit ? Thanks in advance for any info.
Edit:
The table that I used was created by this query
with t1 as
(
Select [Device_Id], [DeviceInterface], CONVERT(date, MeterDate) as OnlyDate, Max(MeterDate) as MaxMeterDate
FROM [dbo].[Measurements]
GROUP BY [Device_Id], [DeviceInterface], CONVERT(date, MeterDate)
)
Select t1.[Device_Id], t1.[DeviceInterface],t1.[OnlyDate], r.Id
INTO [dbo].[MaxDatesMeasurements]
FROM t1
JOIN [dbo].[Measurements] as r ON r.Device_Id = t1.Device_Id AND r.DeviceInterface = t1.DeviceInterface AND r.MeterDate = t1.MaxMeterDate
Then I wanted to join the new created table [dbo].[MaxDatesMeasurements] with old [dbo].[Measurements] and select direct rows
DECLARE @startdate datetime= '2013-07-01';
DECLARE @enddate datetime = '2013-08-01';
Select *
From [dbo].[MaxDatesMeasurements] as t1
Join [dbo].[Measurements] as t2 on t1.[Id] = t2.[Id]
WHERE t1.[OnlyDate] BETWEEN @startdate AND @enddate
sql-server index query-performance clustered-index
bumped to the homepage by Community♦ 11 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
"despite all of the indexes i've made on different columns" SQL antipattern Lets index all columns because select query will be faster that many will hurt you with insert update en delete querys and it also means you could have an bigger index fragmentation... can you post the created table statement andthe querys you run on that table.. if you combinate MAX with GROUP MySQL really needs an multiple index. for example SELECT * FROM phonebook WHERE firstname LIKE 's%' ORDER BY number the index should be KEY(firstname, number)
– Raymond Nijland
Sep 25 '13 at 14:53
I posted created table statements in edit post. Thanks for response !
– KatarzynaMoss
Sep 26 '13 at 9:54
You have amysql
tag but all these[dbo]
smell like SQL-Server. Which of the two DBMS do you use?
– ypercubeᵀᴹ
Sep 26 '13 at 18:58
I edited the tags. Correct back in the unexpected case that you do use mysql.
– ypercubeᵀᴹ
Sep 26 '13 at 19:02
This article may be useful: simple-talk.com/sql/database-administration/…
– A-K
Sep 26 '13 at 20:01
|
show 2 more comments
The Issue:
I'm working on a big table that consists about 37mln rows. Data include measurements of many devices made in certain time e.g. '2013-09-24 10:45:50'. Each day all of those devices are sending many measurements in different intervals on different times. I want to make a query which selects all the most actual ( 'actual' I mean the latest from all measurements made in each day) measurement of each day for 2 months e.g from 2013-01-01 to 2013-02-01.
The problem is that this query takes so much time to go, despite all of the indexes i've made on different columns. I've also created auxiliary table that contains max(MeterDate) and MeasurementsId when the measurement was given. I've noticed that index can't be made on MeterDate because it contains date and time which is not useful for making an index on it. So i converted the MeterDate -> CONVERT(date, MeterDate). I though that after joining The auxiliary table with [dbo].[Measurements] the query would be faster but still query takes more than 12s which is too long for me.
The structure of table:
Create table [dbo].[Measurements]
[Id] [int] IDENTITY(1,1) NOT NULL,
[ReadType_Id] [int] NOT NULL,
[Device_Id] [int] NULL,
[DeviceInterface] [tinyint] NULL,
[MeterDate] [datetime] NULL,
[MeasureValue] [decimal](18, 3) NULL
Every row of Measurements table include measurement value on direct MeterDate e.g. "2008-04-04 13:28:44.473"
Direct select structure:
DECLARE @startdate datetime= '2013-07-01';
DECLARE @enddate datetime = '2013-08-01';
SELECT *
FROM [dbo].[Measurements]
WHERE [MeterDate] BETWEEN @startdate and @enddate
Does anyone knows how to rebuilt table or add new or add indexes on which column that speed up query a bit ? Thanks in advance for any info.
Edit:
The table that I used was created by this query
with t1 as
(
Select [Device_Id], [DeviceInterface], CONVERT(date, MeterDate) as OnlyDate, Max(MeterDate) as MaxMeterDate
FROM [dbo].[Measurements]
GROUP BY [Device_Id], [DeviceInterface], CONVERT(date, MeterDate)
)
Select t1.[Device_Id], t1.[DeviceInterface],t1.[OnlyDate], r.Id
INTO [dbo].[MaxDatesMeasurements]
FROM t1
JOIN [dbo].[Measurements] as r ON r.Device_Id = t1.Device_Id AND r.DeviceInterface = t1.DeviceInterface AND r.MeterDate = t1.MaxMeterDate
Then I wanted to join the new created table [dbo].[MaxDatesMeasurements] with old [dbo].[Measurements] and select direct rows
DECLARE @startdate datetime= '2013-07-01';
DECLARE @enddate datetime = '2013-08-01';
Select *
From [dbo].[MaxDatesMeasurements] as t1
Join [dbo].[Measurements] as t2 on t1.[Id] = t2.[Id]
WHERE t1.[OnlyDate] BETWEEN @startdate AND @enddate
sql-server index query-performance clustered-index
The Issue:
I'm working on a big table that consists about 37mln rows. Data include measurements of many devices made in certain time e.g. '2013-09-24 10:45:50'. Each day all of those devices are sending many measurements in different intervals on different times. I want to make a query which selects all the most actual ( 'actual' I mean the latest from all measurements made in each day) measurement of each day for 2 months e.g from 2013-01-01 to 2013-02-01.
The problem is that this query takes so much time to go, despite all of the indexes i've made on different columns. I've also created auxiliary table that contains max(MeterDate) and MeasurementsId when the measurement was given. I've noticed that index can't be made on MeterDate because it contains date and time which is not useful for making an index on it. So i converted the MeterDate -> CONVERT(date, MeterDate). I though that after joining The auxiliary table with [dbo].[Measurements] the query would be faster but still query takes more than 12s which is too long for me.
The structure of table:
Create table [dbo].[Measurements]
[Id] [int] IDENTITY(1,1) NOT NULL,
[ReadType_Id] [int] NOT NULL,
[Device_Id] [int] NULL,
[DeviceInterface] [tinyint] NULL,
[MeterDate] [datetime] NULL,
[MeasureValue] [decimal](18, 3) NULL
Every row of Measurements table include measurement value on direct MeterDate e.g. "2008-04-04 13:28:44.473"
Direct select structure:
DECLARE @startdate datetime= '2013-07-01';
DECLARE @enddate datetime = '2013-08-01';
SELECT *
FROM [dbo].[Measurements]
WHERE [MeterDate] BETWEEN @startdate and @enddate
Does anyone knows how to rebuilt table or add new or add indexes on which column that speed up query a bit ? Thanks in advance for any info.
Edit:
The table that I used was created by this query
with t1 as
(
Select [Device_Id], [DeviceInterface], CONVERT(date, MeterDate) as OnlyDate, Max(MeterDate) as MaxMeterDate
FROM [dbo].[Measurements]
GROUP BY [Device_Id], [DeviceInterface], CONVERT(date, MeterDate)
)
Select t1.[Device_Id], t1.[DeviceInterface],t1.[OnlyDate], r.Id
INTO [dbo].[MaxDatesMeasurements]
FROM t1
JOIN [dbo].[Measurements] as r ON r.Device_Id = t1.Device_Id AND r.DeviceInterface = t1.DeviceInterface AND r.MeterDate = t1.MaxMeterDate
Then I wanted to join the new created table [dbo].[MaxDatesMeasurements] with old [dbo].[Measurements] and select direct rows
DECLARE @startdate datetime= '2013-07-01';
DECLARE @enddate datetime = '2013-08-01';
Select *
From [dbo].[MaxDatesMeasurements] as t1
Join [dbo].[Measurements] as t2 on t1.[Id] = t2.[Id]
WHERE t1.[OnlyDate] BETWEEN @startdate AND @enddate
sql-server index query-performance clustered-index
sql-server index query-performance clustered-index
edited Sep 26 '13 at 19:02
ypercubeᵀᴹ
78.5k11137221
78.5k11137221
asked Sep 25 '13 at 9:20
KatarzynaMossKatarzynaMoss
1113
1113
bumped to the homepage by Community♦ 11 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
bumped to the homepage by Community♦ 11 mins ago
This question has answers that may be good or bad; the system has marked it active so that they can be reviewed.
"despite all of the indexes i've made on different columns" SQL antipattern Lets index all columns because select query will be faster that many will hurt you with insert update en delete querys and it also means you could have an bigger index fragmentation... can you post the created table statement andthe querys you run on that table.. if you combinate MAX with GROUP MySQL really needs an multiple index. for example SELECT * FROM phonebook WHERE firstname LIKE 's%' ORDER BY number the index should be KEY(firstname, number)
– Raymond Nijland
Sep 25 '13 at 14:53
I posted created table statements in edit post. Thanks for response !
– KatarzynaMoss
Sep 26 '13 at 9:54
You have amysql
tag but all these[dbo]
smell like SQL-Server. Which of the two DBMS do you use?
– ypercubeᵀᴹ
Sep 26 '13 at 18:58
I edited the tags. Correct back in the unexpected case that you do use mysql.
– ypercubeᵀᴹ
Sep 26 '13 at 19:02
This article may be useful: simple-talk.com/sql/database-administration/…
– A-K
Sep 26 '13 at 20:01
|
show 2 more comments
"despite all of the indexes i've made on different columns" SQL antipattern Lets index all columns because select query will be faster that many will hurt you with insert update en delete querys and it also means you could have an bigger index fragmentation... can you post the created table statement andthe querys you run on that table.. if you combinate MAX with GROUP MySQL really needs an multiple index. for example SELECT * FROM phonebook WHERE firstname LIKE 's%' ORDER BY number the index should be KEY(firstname, number)
– Raymond Nijland
Sep 25 '13 at 14:53
I posted created table statements in edit post. Thanks for response !
– KatarzynaMoss
Sep 26 '13 at 9:54
You have amysql
tag but all these[dbo]
smell like SQL-Server. Which of the two DBMS do you use?
– ypercubeᵀᴹ
Sep 26 '13 at 18:58
I edited the tags. Correct back in the unexpected case that you do use mysql.
– ypercubeᵀᴹ
Sep 26 '13 at 19:02
This article may be useful: simple-talk.com/sql/database-administration/…
– A-K
Sep 26 '13 at 20:01
"despite all of the indexes i've made on different columns" SQL antipattern Lets index all columns because select query will be faster that many will hurt you with insert update en delete querys and it also means you could have an bigger index fragmentation... can you post the created table statement andthe querys you run on that table.. if you combinate MAX with GROUP MySQL really needs an multiple index. for example SELECT * FROM phonebook WHERE firstname LIKE 's%' ORDER BY number the index should be KEY(firstname, number)
– Raymond Nijland
Sep 25 '13 at 14:53
"despite all of the indexes i've made on different columns" SQL antipattern Lets index all columns because select query will be faster that many will hurt you with insert update en delete querys and it also means you could have an bigger index fragmentation... can you post the created table statement andthe querys you run on that table.. if you combinate MAX with GROUP MySQL really needs an multiple index. for example SELECT * FROM phonebook WHERE firstname LIKE 's%' ORDER BY number the index should be KEY(firstname, number)
– Raymond Nijland
Sep 25 '13 at 14:53
I posted created table statements in edit post. Thanks for response !
– KatarzynaMoss
Sep 26 '13 at 9:54
I posted created table statements in edit post. Thanks for response !
– KatarzynaMoss
Sep 26 '13 at 9:54
You have a
mysql
tag but all these [dbo]
smell like SQL-Server. Which of the two DBMS do you use?– ypercubeᵀᴹ
Sep 26 '13 at 18:58
You have a
mysql
tag but all these [dbo]
smell like SQL-Server. Which of the two DBMS do you use?– ypercubeᵀᴹ
Sep 26 '13 at 18:58
I edited the tags. Correct back in the unexpected case that you do use mysql.
– ypercubeᵀᴹ
Sep 26 '13 at 19:02
I edited the tags. Correct back in the unexpected case that you do use mysql.
– ypercubeᵀᴹ
Sep 26 '13 at 19:02
This article may be useful: simple-talk.com/sql/database-administration/…
– A-K
Sep 26 '13 at 20:01
This article may be useful: simple-talk.com/sql/database-administration/…
– A-K
Sep 26 '13 at 20:01
|
show 2 more comments
1 Answer
1
active
oldest
votes
On reason this can happen is that you're using local variables.
The problem is that this query takes so much time to go, despite all
of the indexes i've made on different columns.
Here's an example using a similar setup. In the Stack Overflow schema there's a narrow-ish table called Votes that looks like this.
With no index on CreationDate
, our only option would be to scan the Clustered Index. But if we create one only on CreationDate
, the optimizer can choose to use that index if it thinks doing a Key Lookup for the rest of the columns is cheaper than scanning the Clustered Index and applying a predicate.
CREATE INDEX ix_yourmom ON dbo.Votes(CreationDate)
For this query:
DECLARE @StartDate DATETIME = '2010-07-01';
DECLARE @EndDate DATETIME = '2010-07-02';
SELECT *
FROM dbo.Votes AS v
WHERE v.CreationDate BETWEEN @StartDate AND @EndDate;
GO
The cardinality estimate for unknown variables using between is 16.4317%. That leads to a clustered index scan and a missing index request for an index that covers the entire query.
If you run the query with RECOMPILE, you allow for the parameter embedding optimization.
DECLARE @StartDate DATETIME = '2010-07-01';
DECLARE @EndDate DATETIME = '2010-07-02';
SELECT *
FROM dbo.Votes AS v
WHERE v.CreationDate BETWEEN @StartDate AND @EndDate
OPTION ( RECOMPILE );
Which gives us a different query plan, and a more accurate estimate.
Hope this helps!
add a comment |
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "182"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdba.stackexchange.com%2fquestions%2f50469%2fhow-to-speed-up-query-on-table-with-millions-of-rows%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
On reason this can happen is that you're using local variables.
The problem is that this query takes so much time to go, despite all
of the indexes i've made on different columns.
Here's an example using a similar setup. In the Stack Overflow schema there's a narrow-ish table called Votes that looks like this.
With no index on CreationDate
, our only option would be to scan the Clustered Index. But if we create one only on CreationDate
, the optimizer can choose to use that index if it thinks doing a Key Lookup for the rest of the columns is cheaper than scanning the Clustered Index and applying a predicate.
CREATE INDEX ix_yourmom ON dbo.Votes(CreationDate)
For this query:
DECLARE @StartDate DATETIME = '2010-07-01';
DECLARE @EndDate DATETIME = '2010-07-02';
SELECT *
FROM dbo.Votes AS v
WHERE v.CreationDate BETWEEN @StartDate AND @EndDate;
GO
The cardinality estimate for unknown variables using between is 16.4317%. That leads to a clustered index scan and a missing index request for an index that covers the entire query.
If you run the query with RECOMPILE, you allow for the parameter embedding optimization.
DECLARE @StartDate DATETIME = '2010-07-01';
DECLARE @EndDate DATETIME = '2010-07-02';
SELECT *
FROM dbo.Votes AS v
WHERE v.CreationDate BETWEEN @StartDate AND @EndDate
OPTION ( RECOMPILE );
Which gives us a different query plan, and a more accurate estimate.
Hope this helps!
add a comment |
On reason this can happen is that you're using local variables.
The problem is that this query takes so much time to go, despite all
of the indexes i've made on different columns.
Here's an example using a similar setup. In the Stack Overflow schema there's a narrow-ish table called Votes that looks like this.
With no index on CreationDate
, our only option would be to scan the Clustered Index. But if we create one only on CreationDate
, the optimizer can choose to use that index if it thinks doing a Key Lookup for the rest of the columns is cheaper than scanning the Clustered Index and applying a predicate.
CREATE INDEX ix_yourmom ON dbo.Votes(CreationDate)
For this query:
DECLARE @StartDate DATETIME = '2010-07-01';
DECLARE @EndDate DATETIME = '2010-07-02';
SELECT *
FROM dbo.Votes AS v
WHERE v.CreationDate BETWEEN @StartDate AND @EndDate;
GO
The cardinality estimate for unknown variables using between is 16.4317%. That leads to a clustered index scan and a missing index request for an index that covers the entire query.
If you run the query with RECOMPILE, you allow for the parameter embedding optimization.
DECLARE @StartDate DATETIME = '2010-07-01';
DECLARE @EndDate DATETIME = '2010-07-02';
SELECT *
FROM dbo.Votes AS v
WHERE v.CreationDate BETWEEN @StartDate AND @EndDate
OPTION ( RECOMPILE );
Which gives us a different query plan, and a more accurate estimate.
Hope this helps!
add a comment |
On reason this can happen is that you're using local variables.
The problem is that this query takes so much time to go, despite all
of the indexes i've made on different columns.
Here's an example using a similar setup. In the Stack Overflow schema there's a narrow-ish table called Votes that looks like this.
With no index on CreationDate
, our only option would be to scan the Clustered Index. But if we create one only on CreationDate
, the optimizer can choose to use that index if it thinks doing a Key Lookup for the rest of the columns is cheaper than scanning the Clustered Index and applying a predicate.
CREATE INDEX ix_yourmom ON dbo.Votes(CreationDate)
For this query:
DECLARE @StartDate DATETIME = '2010-07-01';
DECLARE @EndDate DATETIME = '2010-07-02';
SELECT *
FROM dbo.Votes AS v
WHERE v.CreationDate BETWEEN @StartDate AND @EndDate;
GO
The cardinality estimate for unknown variables using between is 16.4317%. That leads to a clustered index scan and a missing index request for an index that covers the entire query.
If you run the query with RECOMPILE, you allow for the parameter embedding optimization.
DECLARE @StartDate DATETIME = '2010-07-01';
DECLARE @EndDate DATETIME = '2010-07-02';
SELECT *
FROM dbo.Votes AS v
WHERE v.CreationDate BETWEEN @StartDate AND @EndDate
OPTION ( RECOMPILE );
Which gives us a different query plan, and a more accurate estimate.
Hope this helps!
On reason this can happen is that you're using local variables.
The problem is that this query takes so much time to go, despite all
of the indexes i've made on different columns.
Here's an example using a similar setup. In the Stack Overflow schema there's a narrow-ish table called Votes that looks like this.
With no index on CreationDate
, our only option would be to scan the Clustered Index. But if we create one only on CreationDate
, the optimizer can choose to use that index if it thinks doing a Key Lookup for the rest of the columns is cheaper than scanning the Clustered Index and applying a predicate.
CREATE INDEX ix_yourmom ON dbo.Votes(CreationDate)
For this query:
DECLARE @StartDate DATETIME = '2010-07-01';
DECLARE @EndDate DATETIME = '2010-07-02';
SELECT *
FROM dbo.Votes AS v
WHERE v.CreationDate BETWEEN @StartDate AND @EndDate;
GO
The cardinality estimate for unknown variables using between is 16.4317%. That leads to a clustered index scan and a missing index request for an index that covers the entire query.
If you run the query with RECOMPILE, you allow for the parameter embedding optimization.
DECLARE @StartDate DATETIME = '2010-07-01';
DECLARE @EndDate DATETIME = '2010-07-02';
SELECT *
FROM dbo.Votes AS v
WHERE v.CreationDate BETWEEN @StartDate AND @EndDate
OPTION ( RECOMPILE );
Which gives us a different query plan, and a more accurate estimate.
Hope this helps!
answered Apr 11 '18 at 16:35
Erik DarlingErik Darling
22.8k1269113
22.8k1269113
add a comment |
add a comment |
Thanks for contributing an answer to Database Administrators Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdba.stackexchange.com%2fquestions%2f50469%2fhow-to-speed-up-query-on-table-with-millions-of-rows%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
"despite all of the indexes i've made on different columns" SQL antipattern Lets index all columns because select query will be faster that many will hurt you with insert update en delete querys and it also means you could have an bigger index fragmentation... can you post the created table statement andthe querys you run on that table.. if you combinate MAX with GROUP MySQL really needs an multiple index. for example SELECT * FROM phonebook WHERE firstname LIKE 's%' ORDER BY number the index should be KEY(firstname, number)
– Raymond Nijland
Sep 25 '13 at 14:53
I posted created table statements in edit post. Thanks for response !
– KatarzynaMoss
Sep 26 '13 at 9:54
You have a
mysql
tag but all these[dbo]
smell like SQL-Server. Which of the two DBMS do you use?– ypercubeᵀᴹ
Sep 26 '13 at 18:58
I edited the tags. Correct back in the unexpected case that you do use mysql.
– ypercubeᵀᴹ
Sep 26 '13 at 19:02
This article may be useful: simple-talk.com/sql/database-administration/…
– A-K
Sep 26 '13 at 20:01