Re: Vietnamese character matching with %LIKE%
Posted by: Rick James
Date: July 04, 2011 06:53PM

Here are some equivalences for some charsets/collations:
latin1 : latin1_german1_ci            B=b  C=c=Ç=ç  ch  cz       D=d      dz
latin1 : latin1_german2_ci            B=b  C=c=Ç=ç  ch  cz       D=d=Ð=ð  dz
latin1 : latin1_general_ci            B=b  C=c      ch  cz  Ç=ç  D=d      dz  Ð=ð
cp850 : cp850_general_ci              B=b  C=c      ch  cz  Ç=ç  D=d      dz  ð=Ð
utf8 : utf8_general_ci                B=b  C=c=Ç=ç  ch  cz       D=d      dz
utf8 : utf8_unicode_ci                B=b  C=c=Ç=ç  ch  cz       D=d      dz  Ð=ð
cp1250 : cp1250_general_ci            B=b  C=c=Ç=ç  ch  cz       D=d      dz
That is, I would expect `%D%` to match`Đ` in latin1_german2_ci, but not the others listed above.

For full Vietnamese collation, upgrade to 5.6:
Quote

Vietnamese collations were added for the Unicode character sets. Those based on Unicode Collation Algorithm 5.2.0 have names of the form xxx_vietnamese_520_ci (for example, utf8_vietnamese_520_ci). Those based on Unicode Collation Algorithm 4.0.0 have names of the form xxx_vietnamese_ci (for example, utf8_vietnamese_ci). These collations are the same as the corresponding xxx_unicode_520_ci and xxx_unicode_ci collations except for precomposed characters which are accented versions of “A”, “D”, “E”, “O”, and “U”.

Options: ReplyQuote


Subject
Views
Written By
Posted
Re: Vietnamese character matching with %LIKE%
2663
July 04, 2011 06:53PM


Sorry, you can't reply to this topic. It has been closed.

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.