MySQL :: Optimize sub query with large group by

New Topic

Optimize sub query with large group by

Posted by: Alex K
Date: March 10, 2011 07:44PM

I am having trouble working out a way to improve this query.
The problem is the group by object_id as there are about half as many object_ids as rows in the table.

The aim of the query is to count the number of objects that were accessed per person, where only the most recent person to access the object gets counted.

select person_id, count(id)
from (select id, person_id from events group by object_id) as subquery
group by person_id;

The query takes just over 2 minutes with about 1 million events.

Below is the create statement for the event table.

create table events(
    id int not null auto_increment,
    object_id int not null,
    person_id int not null,
    primary key(id),
    index(object_id),
    index(person_id)
) ENGINE=InnoDB;

To recreate this I used the following:

create table numbers(i int unsigned not null);
insert into numbers(i) values (0), (1), (2), (3), (4), (5), (6), (7), (8), (9);
insert into numbers(i) select a.i * b.i * c.i from numbers as a cross join numbers as b cross join numbers as c;

insert into events(object_id, person_id)        
    select
        floor((select count(i) from numbers) / 2 * rand()),
        floor(10 * rand())
    from numbers as a
        cross join numbers as b;
Query OK, 1020100 rows affected (4 min 2.54 sec)

select person_id, count(id)
from (select id, person_id from events group by object_id) as subquery
group by person_id;
+-----------+-----------+
| person_id | count(id) |
+-----------+-----------+
|         0 |        52 |
|         1 |        54 |
|         2 |        44 |
|         3 |        44 |
|         4 |        47 |
|         5 |        43 |
|         6 |        48 |
|         7 |        56 |
|         8 |        61 |
|         9 |        56 |
+-----------+-----------+
10 rows in set (2 min 15.38 sec)

Any help improving this would be much appreciated.

Navigate: Previous Message• Next Message

Options: Reply• Quote

Subject

Views

Written By

Posted

Optimize sub query with large group by

4080

Alex K

March 10, 2011 07:44PM

Re: Optimize sub query with large group by

2265

Rick James

March 13, 2011 03:52PM

Re: Optimize sub query with large group by

1983

Alex K

March 13, 2011 04:59PM

Re: Optimize sub query with large group by

2962

Rick James

March 13, 2011 11:32PM

Re: Optimize sub query with large group by

2262

Alex K

March 14, 2011 02:39AM

Re: Optimize sub query with large group by

1717

Rick James

March 14, 2011 06:31PM

Re: Optimize sub query with large group by

1616

Alex K

March 14, 2011 07:15PM

Re: Optimize sub query with large group by

1918

Øystein Grøvlen

March 15, 2011 04:32AM

Re: Optimize sub query with large group by

1675

Alex K

March 15, 2011 08:46PM

Re: Optimize sub query with large group by

2433

Øystein Grøvlen

March 16, 2011 02:02AM

Re: Optimize sub query with large group by

2340

Rick James

March 16, 2011 08:57AM

Re: Optimize sub query with large group by

1699

Alex K

March 17, 2011 12:40AM

Re: Optimize sub query with large group by

3691

Øystein Grøvlen

March 18, 2011 10:58AM

Sorry, you can't reply to this topic. It has been closed.

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.