Re: Best collation to use for ancient languages
Posted by: Peter Gulutzan
Date: May 26, 2011 01:25PM

Rick James wrote:
> The collation may as well be utf8_general_ci;
> utf8_unicode_ci probably does nothing different.

The differences won't matter for for this questioner.
In case anyone sees this and thinks "general" is just as good,
I add a warning that "unicode" is better for obscurer cases.

For example, here are two MySQL-collation-sequence charts:
http://www.collation-charts.org/mysql60/mysql604.utf8_unicode_ci.european.html
http://www.collation-charts.org/mysql60/mysql604.utf8_general_ci.european.html
Scroll down till you see the Greek characters alpha beta gamma
delta epsilon. Notice the next characters after epsilon.
In the "general" collation the next character is zeta.
In the "unicode" collation the next character is digamma.
One never sees digammas in the Bible,
but the "unicode" collation is correct

Peter Gulutzan
Oracle / MySQL

Options: ReplyQuote


Subject
Views
Written By
Posted
Re: Best collation to use for ancient languages
2616
May 26, 2011 01:25PM


Sorry, you can't reply to this topic. It has been closed.

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.