Step 1: Determine the UTF-8 encoding bit layout
The character ဇ has the Unicode code point U+1007. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+1007 to binary:
00010000 00000111
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10000000 10000111
MYANMAR LETTER JA·U+1007
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 80 87 | 11100001 10000000 10000111 |
UTF16 (big Endian) | 10 07 | 00010000 00000111 |
UTF16 (little Endian) | 07 10 | 00000111 00010000 |
UTF32 (big Endian) | 00 00 10 07 | 00000000 00000000 00010000 00000111 |
UTF32 (little Endian) | 07 10 00 00 | 00000111 00010000 00000000 00000000 |
Description
Unicode character U+1007 is designated as 'MYANMAR LETTER JA'. In digital text, this character represents a specific letter used in the Myanmar language, which belongs to the larger group of Southeast Asian languages. This particular letter, 'Ja', plays an essential role in written Myanmar language, contributing to its orthography and proper syntax structure. Myanmar script is a unique alphabet that uses both vowels and consonants, where each consonant has its distinct shape and is combined with various vowel signs to form words. The character U+1007 'MYANMAR LETTER JA' is part of this script system. The use of U+1007 reflects the cultural and linguistic diversity, as Myanmar has its own unique writing system different from those used in other parts of the world. The character holds significant technical context as well, particularly with regard to computer encoding standards, where it represents a specific value in Unicode representation, ensuring its accurate digital transmission and display across various platforms and devices. In summary, U+1007 'MYANMAR LETTER JA' is an integral part of the Myanmar language system, contributing to the rich linguistic heritage of the region while being a vital component in modern digital text encoding and communication.
How to type the ဇ symbol on Windows
Hold Alt and type 4103 on the numpad. Or use Character Map.