Step 1: Determine the UTF-8 encoding bit layout
The character ᆝ has the Unicode code point U+119D. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+119D to binary:
00010001 10011101
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10000110 10011101
HANGUL JUNGSEONG I-ARAEA·U+119D
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 86 9D | 11100001 10000110 10011101 |
UTF16 (big Endian) | 11 9D | 00010001 10011101 |
UTF16 (little Endian) | 9D 11 | 10011101 00010001 |
UTF32 (big Endian) | 00 00 11 9D | 00000000 00000000 00010001 10011101 |
UTF32 (little Endian) | 9D 11 00 00 | 10011101 00010001 00000000 00000000 |
Description
The Hangul Jungseong I-Araea (U+119D) is a unique character within the Unicode Standard that plays an important role in digital text encoding for the Korean language. It primarily serves as one of the essential components of the Hangul writing system, which was developed during the 15th century under the ruling of King Sejong the Great. The Hangul script is used by millions of people across South Korea and North Korea as their primary written form of communication. In its cultural context, this character contributes to the rich linguistic heritage of Korean-speaking communities worldwide. Technically, U+119D is part of the Hangul Syllables block in Unicode, which comprises 430 characters. These characters are designed to represent the basic building blocks of the Korean language - consonants (U+1161 to U+117A), vowels (U+117B to U+119E), and various jungseong (U+119F to U+11FD) such as I-Araea, which are used to modify the initial consonant's sound. The character U+119D specifically represents a modification of the consonant I in Hangul, enhancing the expressive power and nuance of the script. Its utilization is essential for accurate digital representation and communication of Korean text across different platforms and devices, showcasing the importance of Unicode in preserving linguistic diversity on the global stage.
How to type the ᆝ symbol on Windows
Hold Alt and type 4509 on the numpad. Or use Character Map.