MySQL :: Problem with unicode character comparison

Contact MySQL |
Login | Register

The world's most popular open source database

Documentation Downloads MySQL.com

Developer Zone

Section Menu:

New Topic

Problem with unicode character comparison

Posted by: Brooks Brown
Date: April 12, 2005 12:02PM

I was going to reply to Dimitry Libertas's post, as I believe mine is a similar problem to his, but I thought this would would have more emphasis if it was a separate post.

I am using utf8_unicode_ci collation and am very frustrated with the support for "expansions" (see http://dev.mysql.com/doc/mysql/en/charset-unicode-sets.html). Evidently 'a-acute' or 'a-umlaut' is interpreted as "equal" to 'a' which is generally NOT what is desired. For example, a customer in Sweden is complaining that searching on man yields matches for Människan.

Using the binary collation (utf8_bin) is not a good option, as the sorting that this would produce is not desirable.

We could programmatically weed out false matches, but this would be disruptive and unnecessarily complicate our application which supports multiple database servers.

Also, as Mr. Libertas observes, the 'like' operator works differently than '='.

Navigate: Previous Message• Next Message

Options: Reply• Quote

Subject

Views

Written By

Posted

Problem with unicode character comparison

5547

Brooks Brown

April 12, 2005 12:02PM

Re: Problem with unicode character comparison

2583

Alexander Barkov

April 14, 2005 05:16AM

Re: Problem with unicode character comparison

2550

Brooks Brown

April 18, 2005 03:54PM

Re: Problem with unicode character comparison

2172

Alexander Barkov

May 07, 2005 07:47AM

Sorry, you can't reply to this topic. It has been closed.

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.