MySQL :: Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

New Topic

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

Posted by: Ted Wennmark
Date: May 25, 2021 12:17AM

Hello,

Something that is really bad for NDB internal batching is INSERTS with dependencies, like "INSERT IGNORE" or "INSERT ON DUPLICATE KEY".

Using "INSERT IGNORE" or "INSERT ON DUPLICATE KEY" you break batching in NDB Cluster, all ROWS are inserted one-by-one and not in bathes this is why NDB is slow in this case.
The solution is not handle this in the application code, fetch all the rows you want to insert then run a set of UPDATE/INSERT statements.

Simple test with "INSERT IGNORE":
1) Create simple test table
CREATE TABLE subscribers (
email VARCHAR(50) NOT NULL PRIMARY KEY
) engine=ndbcluster;

2) Generate some test data, only values part:
for i in {1..1000}; do echo "('kalle$i@gmail.com'),"; done > 1000-VALUES.sql
for i in {1..500}; do echo "('kalle$i@gmail.com'),"; done > 500-VALUES.sql

3) Run some tests:

Insert 500 values in one INSERT:
[opc@student1-server1 ~]$ time mysql -uroot -S /tmp/mysql.mycluster.50.sock ted < INSERT-500.sql
real 0m0.024s
user 0m0.006s
sys 0m0.002s

Insert 1000 values using INSERT IGNORE (500 duplicates)
[opc@student1-server1 ~]$ time mysql -uroot -S /tmp/mysql.mycluster.50.sock ted < INSERT-IGNORE-1000.sql
real 0m0.695s
user 0m0.005s
sys 0m0.003s

Truncate table and run some more test:
time mysql -uroot -S /tmp/mysql.mycluster.50.sock ted -se "truncate table subscribers"

Insert 1000 values:
[opc@student1-server1 ~]$ time mysql -uroot -S /tmp/mysql.mycluster.50.sock ted < INSERT-1000.sql
real 0m0.025s
user 0m0.007s
sys 0m0.001s

So, to summarize:
- Insert of 500 rows 0.024s
- Insert of 1000 rows 0.025s
- Insert ignore (with 500 duplicates) 0.695s ...

As you can see INSERT IGNORE is not optimal, it's much better (quicker) to first read all rows, remove duplicates from insert statement and then run the insert.

Last test, insert ignore with all duplicates:

First insert 1000 rows:
[opc@student1-server1 ~]$ time mysql -uroot -S /tmp/mysql.mycluster.50.sock ted < INSERT-1000.sql
real 0m0.025s
user 0m0.007s
sys 0m0.001s

Then insert 1000 rows with INSERT IGNORE (all duplicates)
[opc@student1-server1 ~]$ time mysql -uroot -S /tmp/mysql.mycluster.50.sock ted < INSERT-IGNORE-1000.sql
real 0m0.402s
user 0m0.003s
sys 0m0.004s

And again ....
[opc@student1-server1 ~]$ time mysql -uroot -S /tmp/mysql.mycluster.50.sock ted < INSERT-IGNORE-1000.sql
real 0m0.419s
user 0m0.007s
sys 0m0.001s

Navigate: Previous Message• Next Message

Options: Reply• Quote

Subject

Views

Written By

Posted

Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

1004

Walter Trapa

May 11, 2021 02:51AM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

447

Peter Brawley

May 11, 2021 02:34PM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

432

Walter Trapa

May 12, 2021 01:24AM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

407

Peter Brawley

May 13, 2021 08:30AM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

348

Walter Trapa

May 24, 2021 01:45AM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

400

Walter Trapa

May 24, 2021 01:53AM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

439

Peter Brawley

May 24, 2021 11:50AM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

382

Walter Trapa

May 24, 2021 01:55PM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

372

Peter Brawley

May 24, 2021 03:07PM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

359

Walter Trapa

May 24, 2021 03:52PM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

431

Peter Brawley

May 24, 2021 04:13PM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

352

Walter Trapa

May 25, 2021 08:23AM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

447

Ted Wennmark

May 25, 2021 12:17AM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

421

Ted Wennmark

May 25, 2021 12:23AM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

353

Walter Trapa

May 25, 2021 08:22AM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

442

Ted Wennmark

May 25, 2021 01:42PM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

450

Walter Trapa

May 25, 2021 02:04PM

Re: Analysis: INSERT ON DUPLICATE UPDATE VS UPDATE

424

Ted Wennmark

May 27, 2021 01:20AM

Sorry, you can't reply to this topic. It has been closed.

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.