<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/">
    <channel>
        <title>MySQL Forums - Character Sets, Collation, Unicode</title>
        <description>Forum for MySQL Character Sets, Collation, Unicode</description>
        <link>https://forums.mysql.com/list.php?103</link>
        <lastBuildDate>Tue, 10 Mar 2026 18:52:48 +0000</lastBuildDate>
        <generator>Phorum 5.2.23</generator>
        <item>
            <guid>https://forums.mysql.com/read.php?103,726503,726503#msg-726503</guid>
            <title>Collation and illegal mix (1 reply)</title>
            <link>https://forums.mysql.com/read.php?103,726503,726503#msg-726503</link>
            <description><![CDATA[ i have a 8.0.37 mysql server running on azure. The server, the database, all tables and all columns are set to character set latin1 and collate latin1_general_ci.<br />
<br />
when i create a view with unions from a user where the session variable collation_connection is utf8mb4_0900_ai_ci the view works as it should for every other user.<br />
<br />
if i create the view from a user where that session variable is &#039;latin1_general_ci&#039; every user gets the Illegal mix of collations for operation &#039;UNION&#039; error message.<br />
<br />
Is that a bug or do i have a fundamental misunderstanding of what&#039;s going on?]]></description>
            <dc:creator>Nigel Gomm</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Tue, 24 Sep 2024 13:56:01 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,725434,725434#msg-725434</guid>
            <title>Collations break on MySQL Update (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,725434,725434#msg-725434</link>
            <description><![CDATA[ We use a non-standard collation in order that customers can search for products using 14.4v or P-ABCDE. The period and hyphen in the voltage or part number would normally be treated by FullText as word breaks and by changing the collation this doesn&#039;t happen.<br />
<br />
Our hosting partner keeps &#039;updating&#039; MySQL and restoring the default XML files (Index.xml and latin1.xml) over our modified files and then the collation (1002) goes missing and the website fails.<br />
<br />
My question is how do we stop this? Is the Oracle MySQL update overwriting the files (unlikely) or are the hosting company doing it? They of course say it&#039;s the update so I&#039;d like to know if that&#039;s true before proceeding.<br />
<br />
Having a website down for a day each month is getting seriously tedious.<br />
<br />
Thanks.]]></description>
            <dc:creator>Simon Anthony</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Fri, 09 Aug 2024 09:44:52 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,723360,723360#msg-723360</guid>
            <title>Question marks instead of emoji when exporting from the mysql database to a .sql file (5 replies)</title>
            <link>https://forums.mysql.com/read.php?103,723360,723360#msg-723360</link>
            <description><![CDATA[ I would like to export my emoji database, but I&#039;m having a problem with the export. When I export my table to .sql some characters are replaced with &quot;?&quot;. more precisely, the symbols of the flags. For example, such data (USA 🇺🇸 ☺) became such data after export (USA ?? ☺). I use mysql workbench. utf8mb4 is everywhere in the database. After exporting, I viewed the file through Notepad, there were signs &quot;?&quot; imported the same file into another database, the same &quot;?&quot; signs. Tell me, maybe someone knows how to be in such a situation)<br />
<br />
I tried to put different encodings in the database, nothing works. looked through a bunch of different articles on this problem, could not solve anything, the problem remained]]></description>
            <dc:creator>Микола Расік</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Wed, 27 Mar 2024 08:31:37 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,709786,709786#msg-709786</guid>
            <title>Not able to insert arbitrary binary data / invalid UTF8 characters into a VARCHAR column (1 reply)</title>
            <link>https://forums.mysql.com/read.php?103,709786,709786#msg-709786</link>
            <description><![CDATA[ We have a VARCHAR(255) column using collation utf8_unicode_ci in a table.<br />
<br />
We can write arbitrary byte sequences (data that contains invalid UTF8 character sequences) using INSERT or UPDATE statements in MySQL 5.7.43. However, we get errors while performing the same actions with the same configurations (character set  utf8mb3 collate utf8mb3_unicode_ci) in MySQL 8.0.33.<br />
<br />
For example, I’ve tried the following<br />
<br />
INSERT INTO data_tests (data) VALUES (0xED\xA0\xBC\xED\xB7\xA9\xED\xA0\xBC\xED\xB7\xAA);<br />
<br />
In MySQL 5.7.43, the arbitrary byte sequences is written into the table successfully:<br />
<br />
Query OK, 1 row affected (0.01 sec)<br />
<br />
In MySQL 8.0.33, I get the following error:<br />
<br />
ERROR 1366 (HY000): Incorrect string value: &#039;\xED\xA0\xBC\xED\xB7\xA9...&#039; for column &#039;data&#039; at row 1<br />
<br />
I also tried CONVERT( … USING UTF8) or BINARY( … ), but neither of them are working in MySQL 8.0.33.<br />
<br />
How can I write an INSERT or UPDATE statement that bypasses the check/validation, allowing me to write arbitrary byte sequences in MySQL 8.0.33?]]></description>
            <dc:creator>Claude Pan</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Mon, 15 Jan 2024 09:40:06 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,706532,706532#msg-706532</guid>
            <title>utf8mb4 introducer added to generated column expression (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,706532,706532#msg-706532</link>
            <description><![CDATA[ Stock MySQL 8.0.31, Linux<br />
<br />
CREATE TABLE `test` (<br />
  `id` int NOT NULL AUTO_INCREMENT,<br />
  `test_col` VARCHAR(45) GENERATED ALWAYS AS (&quot;test value&quot;),<br />
  PRIMARY KEY (`id`)<br />
) ENGINE=InnoDB;<br />
SELECT GENERATION_EXPRESSION FROM information_schema.columns WHERE TABLE_NAME=&quot;test&quot; AND COLUMN_NAME=&quot;test_col&quot;;<br />
<br />
GENERATION_EXPRESSION = _utf8mb4\&#039;test value\&#039;<br />
<br />
Why does test_col&#039;s GENERATION_EXPRESSION gain the _utf8mb4 introducer? Is there something I can do to prevent this? My app has a technical need for this to not happen. Plus, this seems unnecessary with everything in MySQL 8 defaulting to utf8mb4. This began when we migrated from MySQL 5.7 to MySQL 8.0.31. <br />
<br />
Thanks.]]></description>
            <dc:creator>Jason Brunette</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Wed, 19 Oct 2022 15:44:59 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,705572,705572#msg-705572</guid>
            <title>The number of attributes is larger than the number of attribute values provided (500) (10 replies)</title>
            <link>https://forums.mysql.com/read.php?103,705572,705572#msg-705572</link>
            <description><![CDATA[ Hi All,<br />
<br />
For the longest time I have been using Visual FoxPro with MySQL as my database backend. Also, I have been using SELECT COUNT(*) to get the total number of rows for any SELECT statement.<br />
<br />
But this one is weird:<br />
<br />
select count(*) as totrecs from pihdr a left join supplier b on a.supplierid=b.id left join ewtax c on a.ewtaxid=c.id left join jthdr d on a.jthdrid=d.id<br />
<br />
<br />
In my ODBC trace log, this is what I found:<br />
<br />
DIAG [01000] [MySQL][ODBC 8.0(w)Driver][mysqld-8.0.30]The number of attributes is larger than the number of attribute values provided (500)<br />
<br />
The offending statement does not even show in the ODBC Trace log.<br />
<br />
That is why I tried substituting it with (which is already deprecated):<br />
<br />
select SQL_CALC_FOUND_ROWS * from pihdr a left join supplier b on<br />
a.supplierid=b.id left join ewtax c on a.ewtaxid=c.id left join jthdr d<br />
on a.jthdrid=d.id<br />
<br />
select FOUND_ROWS() as totrecs <br />
<br />
<br />
Still yields same error.<br />
<br />
My code works with MySQL ODBC 8.0.30 and MySQL Server 5.7.37 perfectly... when I run it with MySQL ODBC 8.0.30 and MySQL Server 8.0.30.. that is where the error appear...<br />
<br />
I was going to move up to MySQL 8.0.x. So I guess I have to wait for some resolution to this.]]></description>
            <dc:creator>Dennis Lim</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Wed, 03 Aug 2022 22:36:58 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,703665,703665#msg-703665</guid>
            <title>MySQL 8.0:  Migrating to utf8mb4: Things to Consider (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,703665,703665#msg-703665</link>
            <description><![CDATA[ <a href="https://www.percona.com/blog/migrating-to-utf8mb4-things-to-consider/"  rel="nofollow">https://www.percona.com/blog/migrating-to-utf8mb4-things-to-consider/</a>]]></description>
            <dc:creator>Edwin Desouza</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Tue, 29 Mar 2022 14:11:36 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,700601,700601#msg-700601</guid>
            <title>MySQL 8.0:  utf8mb4 (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,700601,700601#msg-700601</link>
            <description><![CDATA[ <a href="https://twitter.com/isotopp/status/1481383731562795009"  rel="nofollow">https://twitter.com/isotopp/status/1481383731562795009</a><br />
&quot;Why the #### did MySQL not upgrade the utf8 charset to 4 bytes, but created utf8mb4 instead?&quot;<br />
&quot;Because indexes matter and they can be large, so changing collations and charsets is impossible.&quot;<br />
<br />
Deep dive:<br />
<a href="https://blog.koehntopp.info/2022/01/12/utf8mb4.html"  rel="nofollow">https://blog.koehntopp.info/2022/01/12/utf8mb4.html</a><br />
<br />
<br />
<a href="https://www.percona.com/blog/migrating-to-utf8mb4-things-to-consider/"  rel="nofollow">https://www.percona.com/blog/migrating-to-utf8mb4-things-to-consider/</a>]]></description>
            <dc:creator>Edwin Desouza</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Thu, 13 Jan 2022 05:57:22 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,700414,700414#msg-700414</guid>
            <title>I can&#039;t insert arabic characters (1 reply)</title>
            <link>https://forums.mysql.com/read.php?103,700414,700414#msg-700414</link>
            <description><![CDATA[ I have to insert text in various languages and when I insert arabic it inserts this &quot;????????&quot; Help? Thank you]]></description>
            <dc:creator>Paola Sigurtà</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Thu, 30 Dec 2021 21:56:39 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,698902,698902#msg-698902</guid>
            <title>Update latin1/utf8 to utf8mb4 (1 reply)</title>
            <link>https://forums.mysql.com/read.php?103,698902,698902#msg-698902</link>
            <description><![CDATA[ Hi Team,<br />
Planning to update my database&#039;s from latin1/utf8 to utf8mb4 character set. Please clarify below things.<br />
1. Will there be any dataloss during this conversion?<br />
2. I have column with varchar(32) in utf8. But now maximum length of values inside this column is 31. while converting to utf8mb4 do I need to increase the value of varchar(?) ?]]></description>
            <dc:creator>Sivaranjani P</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Mon, 20 Sep 2021 19:48:00 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,697769,697769#msg-697769</guid>
            <title>MySQL: Character Sets, Unicode, and UCA compliant collations (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,697769,697769#msg-697769</link>
            <description><![CDATA[ MySQL: Character Sets, Unicode, and UCA compliant collations<br />
- <a href="https://blogs.oracle.com/mysql/mysql%3a-character-sets%2c-unicode%2c-and-uca-compliant-collations"  rel="nofollow">https://blogs.oracle.com/mysql/mysql%3a-character-sets%2c-unicode%2c-and-uca-compliant-collations</a>]]></description>
            <dc:creator>Edwin Desouza</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Thu, 22 Jul 2021 17:19:51 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,697148,697148#msg-697148</guid>
            <title>Accent sensitive sorting by slovak alphabet (1 reply)</title>
            <link>https://forums.mysql.com/read.php?103,697148,697148#msg-697148</link>
            <description><![CDATA[ I have a problem with collation of sorting results.<br />
I would want to order results by slovak alphabet using utf8_slovak ci, but accent sensitive.<br />
Utf8_slovak_ci works, but isn&#039;t accent sensitive as i understand.<br />
<br />
It sorts results as this:<br />
Sack, John<br />
Sácká, Gabriela<br />
Sačková, Eva<br />
Sacková, Michelle<br />
Sacks, Oliver<br />
but I would like it to be like this:<br />
Sack, John<br />
Sacková, Michelle<br />
Sacks, Oliver<br />
Sačková, Eva<br />
Sácká, Gabriela<br />
<br />
How can I order results by slovak alphabet accent sensitively?<br />
Thanks]]></description>
            <dc:creator>Filip Aufricht</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Mon, 30 Aug 2021 20:08:24 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,696275,696275#msg-696275</guid>
            <title>Japanese voiced and unvoiced characters (5 replies)</title>
            <link>https://forums.mysql.com/read.php?103,696275,696275#msg-696275</link>
            <description><![CDATA[ MySQL doesn&#039;t seem to differentiate Japanese Dakuon (voiced) characters and Seion (unvoiced) characters. For instance SELECT query won&#039;t differentiate &quot;きず (kizu = wound in English)&quot; and &quot;きす&quot; (kisu = kiss in English), so when I run a query like &quot;SELECT * FROM mytable WHERE mytable.pronunciation = &#039;きず&#039;, it returns the entries with both &quot;きず = kizu&quot; and &quot;きす = kisu&quot;.<br />
I&#039;ve tried changing the character sets and collation of the table to all the combinations I can find, but it hasn&#039;t made any difference.<br />
<br />
Am I doing something wrong, or is this the normal behavior of MySQL? <br />
<br />
Thank you,]]></description>
            <dc:creator>Ken Guiche</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Wed, 05 May 2021 07:24:38 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,696186,696186#msg-696186</guid>
            <title>Case insensitive collation identical to Javascript (1 reply)</title>
            <link>https://forums.mysql.com/read.php?103,696186,696186#msg-696186</link>
            <description><![CDATA[ Can one please confirm or reject if<br />
- SELECT (a = b) with utf8mb4_0900_as_ci collation<br />
results always the same result as<br />
- (a.toLowerCase() == b.toLowerCase()) in JavaScript?<br />
a and b are both non-null Unicode strings.<br />
<br />
If not, what would be the best collation (or solution) to make case-insensitive comparison to work the same on MySQL and JS?<br />
<br />
Thanks.]]></description>
            <dc:creator>Rene Prillop</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Thu, 29 Apr 2021 09:07:07 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,695968,695968#msg-695968</guid>
            <title>behaviour of latin1 in mysql (3 replies)</title>
            <link>https://forums.mysql.com/read.php?103,695968,695968#msg-695968</link>
            <description><![CDATA[ I have some confusion on latin1 behaviour<br />
<br />
As per my understanding latin1 supports 256 characters and use 1 byte per character.<br />
<br />
I created 2 tables with latin1 and utf8 charset<br />
I am using mysql 5.7<br />
<br />
set names latin1;<br />
<br />
CREATE TABLE `foo` (<br />
      `i` int(11) DEFAULT NULL,<br />
       `v` varchar(10) DEFAULT NULL<br />
    ) ENGINE=InnoDB DEFAULT CHARSET=utf8;<br />
<br />
insert into foo values (&#039;Ũ&#039;);<br />
insert into foo(v) values (&#039;ϧ&#039;);<br />
insert into foo(v) values (&#039;þ&#039;);<br />
mysql&gt; select v, hex(v)  from foo;<br />
+------+--------+<br />
| v    | hex(v) |<br />
+------+--------+<br />
| Ũ    | C5A8   |<br />
| ϧ    | CFA7   |<br />
| þ    | C3BE   |<br />
+------+--------+<br />
<br />
--------------------------------------------<br />
<br />
CREATE TABLE foo_utf8 (   `i` int(11) DEFAULT NULL,   `v` varchar(10) DEFAULT NULL ) ENGINE=InnoDB DEFAULT CHARSET=utf8;<br />
<br />
insert into foo_utf8(v) values (&#039;þ&#039;);<br />
insert into foo_utf8(v) values (&#039;ϧ&#039;);<br />
insert into foo_utf8(v) values (&#039;Ũ&#039;);<br />
<br />
mysql&gt; select v, hex(v)  from foo_utf8;<br />
+------+----------+<br />
| v    | hex(v)   |<br />
+------+----------+<br />
| þ    | C383C2BE |<br />
| ϧ    | C38FC2A7 |<br />
| Ũ    | C385C2A8 |<br />
+------+----------+<br />
<br />
-----------------------------------<br />
<br />
In both the cases client encoding was latin1<br />
<br />
I am not able to understand<br />
1. How come I am able to insert characters with unicode greater then 256 in latin table<br />
2. how come I am able to fetch such data from latin1<br />
3. If I am able to insert/ access all characters in Latin1, then why I need ant other encoding like utf8<br />
4. why hex values are different for both the tables with different charset, although client encoding is same<br />
5. why I am able to fetch correct data even though hex values are different and client encoding is same<br />
<br />
please help me in understand or share any blogs so that I can go deeper to understand how charset and encoding works<br />
<br />
thanks <br />
Madhur]]></description>
            <dc:creator>madhur garg</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Tue, 20 Apr 2021 01:15:08 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,695837,695837#msg-695837</guid>
            <title>Issue with UTF-8 in MYSQL 5.7 (1 reply)</title>
            <link>https://forums.mysql.com/read.php?103,695837,695837#msg-695837</link>
            <description><![CDATA[ Hi everyone,<br />
<br />
Greetings. My name is Shanu. I am a developer in this company called Typeset (www.typeset.io). <br />
<br />
We use MySQL 5.7 internally in our company, and I am constantly running into UTF-8 troubles. Would want to describe the instance. <br />
<br />
On the frontend, we have real-time Editor on which the user can enter the data. Once the user enters this data, it should be stored properly in the database. However, here are the results I can see in the database:<br />
<br />
(a.) Question marks (????) instead of Chinese/ Japanese characters.<br />
(b.) Black diamonds coming up, such as Mi�uel.<br />
(c.) Random text (eg. æ–°æµ for 新浪)<br />
<br />
What should I do? I have done research on the internet, but not able to clearly understand. Any help in the right direction would be appreciated.]]></description>
            <dc:creator>Shanu Kumar</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Sun, 11 Apr 2021 16:30:18 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,694073,694073#msg-694073</guid>
            <title>Unable to query with Chinese character (4 replies)</title>
            <link>https://forums.mysql.com/read.php?103,694073,694073#msg-694073</link>
            <description><![CDATA[ Hello everyone,<br />
I&#039;m new to MySQL so I was practicing with a MySQL tutorial<br />
<br />
There was a question in this tutorial requiring us to query a record using LIKE operator so I did it as:<br />
<br />
select<br />
	Tname<br />
from Teacher<br />
where Tname like &#039;李%&#039;;<br />
<br />
and the result was Tname:  (nothing showed up except the column name).<br />
(All databases and tables in this tutorial was set to utf8 before any query was performed)<br />
<br />
<br />
<br />
Some suggested me to set the encoding in the configuration file (my.cnf) into utf8 and so I add:<br />
<br />
[client]<br />
default-character-set=utf8<br />
<br />
[mysql]<br />
default-character-set=utf8<br />
<br />
[mysqld]<br />
character-set-server=utf8<br />
<br />
in the my.cnf file and apply it in the system preference panel.<br />
<br />
<br />
<br />
<br />
However, it does&#039;t work at all. <br />
Could somebody please show me a way out of this?<br />
Thank you so much.]]></description>
            <dc:creator>Lamarche Lam</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Mon, 05 Jul 2021 19:11:07 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,690742,690742#msg-690742</guid>
            <title>Blackhole engine and utf8 character set (1 reply)</title>
            <link>https://forums.mysql.com/read.php?103,690742,690742#msg-690742</link>
            <description><![CDATA[ I&#039;m replicating a 5.5 DB to 5.6. The default storage engine on 5.5 is InnoDB, and on 5.6 is Blackhole. The character set system variables are all the same. Creating the tables, all work except one.<br />
<br />
create table t1 (<br />
`uuid` varchar(255) NOT NULL,<br />
`device_uuid` varchar(255) NOT NULL,<br />
`created_at` timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP,<br />
`updated_at` timestamp NULL DEFAULT NULL,<br />
PRIMARY KEY (`uuid`,`device_uuid`,`created_at`)<br />
) DEFAULT CHARSET=utf8;<br />
<br />
ERROR 1071 (42000): Specified key was too long; max key length is 1000 bytes<br />
<br />
I can create the table as InnoDB, and I know utf8 in MySQL isn&#039;t &quot;actually&quot; utf8. But I wondered if anyone knew the reason why InnoDB and Blackhole don&#039;t produce the same error?]]></description>
            <dc:creator>Craig Healey</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Fri, 23 Oct 2020 12:58:41 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,689545,689545#msg-689545</guid>
            <title>MySQL: Some Character Set Basics (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,689545,689545#msg-689545</link>
            <description><![CDATA[ MySQL: Some Character Set Basics<br />
<a href="https://blog.koehntopp.info/2020/08/18/mysql-character-sets.html"  rel="nofollow">https://blog.koehntopp.info/2020/08/18/mysql-character-sets.html</a>]]></description>
            <dc:creator>Edwin Desouza</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Mon, 17 Aug 2020 21:07:17 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,689012,689012#msg-689012</guid>
            <title>An in depth DBA&#039;s guide to migrating a MySQL database from the `utf8` to the `utf8mb4` charset (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,689012,689012#msg-689012</link>
            <description><![CDATA[ <a href="https://saveriomiroddi.github.io/An-in-depth-dbas-guide-to-migrating-a-mysql-database-from-the-utf8-to-the-utf8mb4-charset/"  rel="nofollow">https://saveriomiroddi.github.io/An-in-depth-dbas-guide-to-migrating-a-mysql-database-from-the-utf8-to-the-utf8mb4-charset/</a>]]></description>
            <dc:creator>Edwin Desouza</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Mon, 20 Jul 2020 14:38:28 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,686578,686578#msg-686578</guid>
            <title>Changing collation type slows down queries (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,686578,686578#msg-686578</link>
            <description><![CDATA[ I don&#039;t understand the issue with this. I have some tables that come to me as <br />
latin1_swedish_ci i need to do case sensitive joins so i change all the tables involved in both databases to utf8mb4_cs_0900_as_cs now my queries are very slow ! the &#039;explain&#039; shows that the indexes are being used. What can be the issue? i am using community version on Microsoft desktop.]]></description>
            <dc:creator>Doug Barger</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Thu, 23 Apr 2020 18:10:29 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,686408,686408#msg-686408</guid>
            <title>Config or Installation Issue (10 replies)</title>
            <link>https://forums.mysql.com/read.php?103,686408,686408#msg-686408</link>
            <description><![CDATA[ Data contained in a varchar type field, produces an error that avoid saving a record because de data contains characters (scape sequences) that prints Spanish language accents since the fields of any table shall contain information in Spanish and some English words or short expression(s) in the same piece of data. I think it it a config issue, because the same application, but using MSSQL as data base engine, accepted the string data as it came from the comma delimited source as is in the sequential (text) file.  This is the SQL instruction and error message:<br />
<br />
Insert INTO prodfarm (Codigo_Prod, Nombre, Prin_Activo, Forma_Farma, Activ_Terap, No_Reg_SSA, Fecha_Revision, Presentaciones,  Caducidades ) Values ( <br />
 &#039;REG-0135&#039;, &#039;Evastel D&#039;, &#039;Ebastina 10 mg / Pseudoefedrina 120 mg&#039;, &#039;Cápsulas liberación prolongada&#039;, &#039;Anti-histamínico, Descongestivo&#039;, &#039;622M98 SSA II&#039;, &#039;2005-10-03&#039;, &#039;Caja con 10 y 5 cápsulas en envase de burbuja&#039;, &#039;36 MESES&#039; ); <br />
<br />
Error Code: 1366. Incorrect string value: &#039;\xC3\xBApsul...&#039; for column &#039;Presentaciones&#039; at row 1<br />
<br />
The weird thing is that other fields as Activ_Terap accepts accented words(histamínico) and allows the record to be saved, but the field Presentaciones does not.<br />
Can anyone help me to solve this issue? I will appreciate a lot!<br />
<br />
As a work around, I replaced the accented vowel character with a non accented one, i.e. cápsula is replaced with capsula. The record could be stored but it&#039;s a miss spelling.<br />
Thank you.]]></description>
            <dc:creator>Luis Benitez-Martell</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Tue, 21 Apr 2020 05:39:17 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,686360,686360#msg-686360</guid>
            <title>UTF-8 Everywhere (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,686360,686360#msg-686360</link>
            <description><![CDATA[ About the authors<br />
<br />
This manifesto was written by Pavel Radzivilovsky, Yakov Galka and Slava Novgorodov. It is a result of our experience and research of real-world Unicode issues and mistakes done by real-world programmers. Our goal here is to improve awareness of text issues and to inspire industry-wide changes to make Unicode-aware programming easier, ultimately improving the experience of users of those programs written by human engineers. Neither of us is involved in the Unicode consortium.<br />
<br />
Special thanks to Glenn Linderman for providing information about Python, and to Markus Künne, Jelle Geerts, Lazy Rui and Jan Rüegg for reporting bugs and typos in this document.<br />
<br />
Much of the text was inspired by discussions on StackOverflow initiated by Artyom Beilis, the author of Boost.Locale. Additional inspiration came from the development conventions at VisionMap and Michael Hartl’s tauday.org.<br />
<br />
UTF-8 Everywhere<br />
- <a href="http://utf8everywhere.org/"  rel="nofollow">http://utf8everywhere.org/</a><br />
- <a href="https://news.ycombinator.com/item?id=22867503"  rel="nofollow">https://news.ycombinator.com/item?id=22867503</a>]]></description>
            <dc:creator>Edwin Desouza</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Tue, 14 Apr 2020 16:16:49 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,684803,684803#msg-684803</guid>
            <title>TicketSolve:  Upgrading from MySQL 5.7 to 8.0 (Character Sets and Collations) (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,684803,684803#msg-684803</link>
            <description><![CDATA[ Upgrade and UTF8:<br />
- <a href="https://saveriomiroddi.github.io/An-in-depth-dbas-guide-to-migrating-a-mysql-database-from-the-utf8-to-the-utf8mb4-charset/"  rel="nofollow">https://saveriomiroddi.github.io/An-in-depth-dbas-guide-to-migrating-a-mysql-database-from-the-utf8-to-the-utf8mb4-charset/</a><br />
- <a href="https://saveriomiroddi.github.io/Pre-fosdem-talk-upgrading-from-mysql-5.7-to-8.0/"  rel="nofollow">https://saveriomiroddi.github.io/Pre-fosdem-talk-upgrading-from-mysql-5.7-to-8.0/</a>]]></description>
            <dc:creator>Edwin Desouza</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Sun, 23 Feb 2020 21:45:41 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,683856,683856#msg-683856</guid>
            <title>Adding custom french EBCDIC collation for unicode (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,683856,683856#msg-683856</link>
            <description><![CDATA[ Hi everyone,<br />
<br />
I have a task to add custom EBCDIC collation with french symbols.<br />
This collation based on <a href="https://en.wikipedia.org/wiki/EBCDIC_1047"  rel="nofollow">https://en.wikipedia.org/wiki/EBCDIC_1047</a><br />
<br />
1)<br />
I&#039;ve already tried to add custom collation to latin1<br />
<br />
add weights map to latin1.xml<br />
  &lt;collation name=&quot;latin1_ebcdic&quot;&gt;<br />
   &lt;map&gt;<br />
<br />
    00 01 02 03 04 05 06 07 08 09 0A 0B 0C 0D 0E 0F<br />
    10 11 12 13 14 15 16 17 18 19 1A 1B 1C 1D 1E 1F<br />
    20 3A 5F 5B 3B 4C 30 5D 2D 3D 3C 2E 4B 40 2B 41<br />
    F0 F1 F2 F3 F4 F5 F6 F7 F8 F9 5A 3E 2C 5E 4E 4F<br />
    5C C1 C2 C3 C4 C5 C6 C7 C8 C9 D1 D2 D3 D4 D5 D6<br />
    D7 D8 D9 E2 E3 E4 E5 E6 E7 E8 E9 AD E0 BD 3F 4D<br />
    59 61 62 63 64 65 66 67 68 69 71 72 73 74 75 76<br />
    77 78 79 A2 A3 A4 A5 A6 A7 A8 A9 C0 2F D0 A1 FF<br />
    80 81 82 83 84 85 86 87 88 89 8A 8B 8C 8D 8E 8F<br />
    90 91 92 93 94 95 96 97 98 99 9A 9B 9C 9D 9E 9F<br />
    21 AA 2A B1 7F B2 4A B5 BB B4 7A 6A B0 CA AF BC<br />
    70 6F EA FA BE A0 B6 B3 7D DA 7B 6B B7 B8 B9 AB<br />
    44 44 42 46 43 47 7E 48 54 51 52 53 58 55 56 57<br />
    AC 49 ED EE EB EF EC BF 60 FD FE FB FC BA AE 39<br />
    24 25 22 26 23 27 7C 28 34 31 32 33 38 35 36 37<br />
    6C 29 CD CE CB CF CC E1 50 DD DE DB DC 6D 6E DF<br />
   &lt;/map&gt;<br />
  &lt;/collation&gt;<br />
<br />
In this approach everything works fine except unicode symbols.<br />
Using this collation they convert to &quot;?&quot; have same weight.<br />
Unicode symbols during sorting put into middle of the list (after &quot;?&quot;) instead of end of it.<br />
<br />
2)<br />
Also have tried solution (Defining a UCA Collation Using LDML Syntax) described here <a href="https://dev.mysql.com/doc/refman/5.7/en/ldml-collation-example.html"  rel="nofollow">https://dev.mysql.com/doc/refman/5.7/en/ldml-collation-example.html</a><br />
Add rules to utf8 charset.<br />
But it seems that &lt;rules&gt; dont support order that I set. It just put symbols in the beginning but not support order that I mentioned in &lt;rules&gt; block:<br />
&lt;rules&gt;<br />
 &lt;reset&gt;\u0000&lt;/reset&gt;<br />
   &lt;i&gt;\u0020&lt;/i&gt;<br />
   &lt;i&gt;\u00A0&lt;/i&gt;<br />
   &lt;i&gt;\u00E2&lt;/i&gt;<br />
   &lt;i&gt;\u00E4&lt;/i&gt;<br />
   &lt;i&gt;\u00E0&lt;/i&gt;<br />
   &lt;i&gt;\u00E1&lt;/i&gt;<br />
   &lt;i&gt;\u00E3&lt;/i&gt;<br />
   &lt;i&gt;\u00E5&lt;/i&gt;<br />
...<br />
&lt;/rules&gt;<br />
<br />
<br />
Maybe someone can give me a hint how to add custom collation based on EBCDIC 1047 with support of unicode symbols (symbols other that EBCDIC 1047 must be put into end with maximum weight). <br />
<br />
Thank you in advance!]]></description>
            <dc:creator>Sergey Vovnenko</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Wed, 22 Jan 2020 15:03:09 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,683785,683785#msg-683785</guid>
            <title>MySQL 8.0:  A Tale of UDFs with Character Sets (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,683785,683785#msg-683785</link>
            <description><![CDATA[ MySQL 8.0:  A Tale of UDFs with Character Sets<br />
<a href="https://mysqlserverteam.com/a-tale-of-udfs-with-character-sets/"  rel="nofollow">https://mysqlserverteam.com/a-tale-of-udfs-with-character-sets/</a>]]></description>
            <dc:creator>Edwin Desouza</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Fri, 17 Jan 2020 23:43:08 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,683663,683663#msg-683663</guid>
            <title>MYSQL DB not showing uft8 on php page (8 replies)</title>
            <link>https://forums.mysql.com/read.php?103,683663,683663#msg-683663</link>
            <description><![CDATA[ Hello everyone,<br />
I created a database and a table. I created a php page. When I visit the php page, the foreign characters change to question marks. This is what I did to troubleshoot:<br />
<br />
1- php page contains the tag: &lt;meta http-equiv=&quot;Content-Type&quot; content=&quot;text/html; charset=utf-8&quot; /&gt;<br />
2- I tested that the problem is with the mysql db by adding text directly on the php page, and it was showing correctly. So it&#039;s not the php page not uft enabled.<br />
3- mysql db is using MyISAM utf8_general_ci<br />
4- When I go to the table inside the db, I see the text displayed correctly.<br />
5- When I test using the my localhost, everything works perfectly. When I add it to Bluehost server, I get the ???? instead of the actual characters.<br />
<br />
What am I missing? Thanks!]]></description>
            <dc:creator>Ben Je</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Thu, 16 Jan 2020 06:22:28 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,680730,680730#msg-680730</guid>
            <title>Summary of trailing spaces handling in MySQL, with version 8.0 upgrade considerations (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,680730,680730#msg-680730</link>
            <description><![CDATA[ <a href="https://saveriomiroddi.github.io/Summary-of-trailing-spaces-handling-in-MySQL-with-version-8.0-upgrade-considerations/"  rel="nofollow">https://saveriomiroddi.github.io/Summary-of-trailing-spaces-handling-in-MySQL-with-version-8.0-upgrade-considerations/</a>]]></description>
            <dc:creator>Edwin Desouza</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Mon, 02 Dec 2019 00:31:04 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,680729,680729#msg-680729</guid>
            <title>An in depth DBA&#039;s guide to migrating a MySQL database from the `utf8` to the `utf8mb4` charset (no replies)</title>
            <link>https://forums.mysql.com/read.php?103,680729,680729#msg-680729</link>
            <description><![CDATA[ <a href="https://saveriomiroddi.github.io/An-in-depth-dbas-guide-to-migrating-a-mysql-database-from-the-utf8-to-the-utf8mb4-charset/"  rel="nofollow">https://saveriomiroddi.github.io/An-in-depth-dbas-guide-to-migrating-a-mysql-database-from-the-utf8-to-the-utf8mb4-charset/</a>]]></description>
            <dc:creator>Edwin Desouza</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Mon, 02 Dec 2019 00:29:20 +0000</pubDate>
        </item>
        <item>
            <guid>https://forums.mysql.com/read.php?103,680580,680580#msg-680580</guid>
            <title>Case insensitive search in utf8 db (14 replies)</title>
            <link>https://forums.mysql.com/read.php?103,680580,680580#msg-680580</link>
            <description><![CDATA[ I have a db with uft8 values in greek like:<br />
ΓΙΑΝΝΗΣ<br />
Γιάννης<br />
Γιαννης<br />
(it is the word John in greek) and I want to write a query to be able to find all these 3 instances. In other words, I have words in upper case, words in lower case, words with or without tonos (= the mark &#039; on letter α, second row in my example). Also, another word may be before or after that word. I tried adding &quot; COLLATE NOCASE&quot; at the end of the query and didn&#039;t help. Tried: SELECT * FROM Table WHERE UPPER(item) LIKE &#039;UPPER(%text%&#039;)<br />
and now I can find ΓΙΑΝΝΗΣ when searching for &quot;γιαννησ&quot;. The last letter &quot;ς&quot; is handled as different than &quot;Σ&quot; as the lower case of &quot;Σ&quot; is &quot;σ&quot; (when a greek word ends with &quot;σ&quot;, we use &quot;ς&quot; instead). What can I do to deal with it?]]></description>
            <dc:creator>John Stergiou</dc:creator>
            <category>Character Sets, Collation, Unicode</category>
            <pubDate>Mon, 02 Dec 2019 20:47:32 +0000</pubDate>
        </item>
    </channel>
</rss>
