MySQL Forums
Forum List  »  InnoDB

Re: utf8 and utf8mb4 performance difference
Posted by: Rick James
Date: September 30, 2016 08:04PM

In 'unicode_520', for example, the two characters "ae" are treated as equal to the single character "æ". Ditto for the 'german2' collation. This is an example of where the the 'general' collation is a little faster -- because it takes less effort.

In most collations "lj" = "lj", but not in 'general'.

A longer list of such differences can be found here: http://mysql.rjweb.org/utf8_collations.html .

There is a third category of collation, 'bin', as in 'utf8_bin' and 'utf8mb4_bin'. These are the fastest, since they blindly compare bits; no case folding; no accent stripping. That is, "A" != "a" != "á", etc.; in all(?) other collations, those are treated as equal. (MySQL 8.0 promises to have some collations that will invalidate this last statement.)

Options: ReplyQuote


Subject
Views
Written By
Posted
3325
September 09, 2016 07:25AM
1548
September 12, 2016 08:55PM
1384
September 18, 2016 12:20AM
1067
September 19, 2016 02:14AM
1677
September 25, 2016 01:20AM
Re: utf8 and utf8mb4 performance difference
1847
September 30, 2016 08:04PM


Sorry, you can't reply to this topic. It has been closed.

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.