Difference between revisions of "MySQL collation for Kyrgyz"

From FireSpeakerWiki
Jump to navigationJump to search
m
 
(One intermediate revision by the same user not shown)
Line 7: Line 7:
 
   <collation name="utf8_cyr_kir" id="244">
 
   <collation name="utf8_cyr_kir" id="244">
 
     <rules>
 
     <rules>
       <reset before="primary">\u043E</reset>
+
       <reset before="primary">\u04A3</reset>
       <p>\u04A3</p>
+
       <p>\u043E</p>
 
       <reset before="primary">\u043F</reset>
 
       <reset before="primary">\u043F</reset>
 
       <p>\u04E9</p>
 
       <p>\u04E9</p>
Line 21: Line 21:
  
 
The id can be anything from 0 to 255, but it should be unique.  The mysql process will have to be restarted after modifying the file.  The collation can then be applied to a table.
 
The id can be anything from 0 to 255, but it should be unique.  The mysql process will have to be restarted after modifying the file.  The collation can then be applied to a table.
 +
 +
For reference, the characters translate as follows:
 +
 +
{|class="wikitable"
 +
|-
 +
! reference || = || →before || =
 +
|-
 +
| u04A3 || ң || u043E || о
 +
|-
 +
| u043F || п || u04E9 || ө
 +
|-
 +
| u0444 || ф || u04AF|| ү
 +
|-
 +
| u0451 || ё || u0435 || е
 +
|}
  
 
[[Category:Kyrgyz]]
 
[[Category:Kyrgyz]]

Latest revision as of 12:35, 12 May 2010

This page provides a UCA collation for Kyrgyz in mysql. The UCA itself was built using guidelines found on mysql's webiste, links from there, and some bugging of people on IRC (thanks), and a certain amount of experimenting.

The collation should go in the unicode character set definition in /usr/share/mysql/charsets/Index.xml:

<charset name="utf8">
  ...
  <collation name="utf8_cyr_kir" id="244">
    <rules>
      <reset before="primary">\u04A3</reset>
      <p>\u043E</p>
      <reset before="primary">\u043F</reset>
      <p>\u04E9</p>
      <reset before="primary">\u0444</reset>
      <p>\u04AF</p>
      <reset before="primary">\u0451</reset>
      <p>\u0435</p>
    </rules>
  </collation>
</charset>

The id can be anything from 0 to 255, but it should be unique. The mysql process will have to be restarted after modifying the file. The collation can then be applied to a table.

For reference, the characters translate as follows:

reference = →before =
u04A3 ң u043E о
u043F п u04E9 ө
u0444 ф u04AF ү
u0451 ё u0435 е