Step 1: Determine the UTF-8 encoding bit layout
The character ၞ has the Unicode code point U+105E. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+105E to binary:
00010000 01011110
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10000001 10011110
MYANMAR CONSONANT SIGN MON MEDIAL NA·U+105E
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 81 9E | 11100001 10000001 10011110 |
UTF16 (big Endian) | 10 5E | 00010000 01011110 |
UTF16 (little Endian) | 5E 10 | 01011110 00010000 |
UTF32 (big Endian) | 00 00 10 5E | 00000000 00000000 00010000 01011110 |
UTF32 (little Endian) | 5E 10 00 00 | 01011110 00010000 00000000 00000000 |
Description
The Unicode character U+105E, known as MYANMAR CONSONANT SIGN MON MEDIAL NA, holds a significant role in digital text representation within the Myanmar script. This character is a crucial element in the Myanmar language's typography and phonetics, serving as a consonant sign in the middle of words. It demonstrates the typical usage and implementation of the Myanmar script in digital text systems and applications. In its linguistic context, U+105E is primarily used to transcribe the sound "ŋ" or "m̩" in the Myanmar language, which is a significant part of the language's phonetic inventory. This character, along with other Unicode characters within the Myanmar script block, facilitates accurate and efficient communication for speakers of the Myanmar language across digital platforms. The introduction and usage of this character and others like it in the Unicode standard reflect a commitment to preserving and promoting linguistic diversity worldwide. From a technical standpoint, U+105E's presence in the Unicode character set allows for the accurate representation of Myanmar text within digital environments, enabling proper text rendering, search functionality, and data exchange for applications, websites, and documents that utilize the Myanmar script. In summary, the MYANMAR CONSONANT SIGN MON MEDIAL NA character (U+105E) plays a vital role in the digital representation of the Myanmar language by accurately transcribing specific phonetic sounds, contributing to the preservation and promotion of linguistic diversity within the Unicode standard.
How to type the ၞ symbol on Windows
Hold Alt and type 4190 on the numpad. Or use Character Map.