Step 1: Determine the UTF-8 encoding bit layout
The character ᇠ has the Unicode code point U+11E0. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+11E0 to binary:
00010001 11100000
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10000111 10100000
HANGUL JONGSEONG MIEUM-CHIEUCH·U+11E0
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 87 A0 | 11100001 10000111 10100000 |
UTF16 (big Endian) | 11 E0 | 00010001 11100000 |
UTF16 (little Endian) | E0 11 | 11100000 00010001 |
UTF32 (big Endian) | 00 00 11 E0 | 00000000 00000000 00010001 11100000 |
UTF32 (little Endian) | E0 11 00 00 | 11100000 00010001 00000000 00000000 |
Description
The Unicode character U+11E0, commonly known as Hangul Jongseong Mieum-Chieuch, plays a significant role in digital text communication, particularly within the Korean language. This specific character belongs to the Korean hangul script, which is comprised of a total of 36 consonants, 21 vowels, and 9 jongseong (consonant modifiers) like U+11E0. The term "Mieum-Chieuch" itself denotes the combination of two separate components: Mieum, which is one of the initial consonants in the hangul script, and Chieuch, which signifies a particular jongseong. In digital communication, U+11E0 helps to form words by attaching itself to vowel characters. This character doesn't possess any standalone meaning; instead, it significantly contributes to the formation of various Korean words and phrases. The utilization of this Unicode character is limited to the Korean language due to its specific phonetic and structural role in forming syllable blocks within the Hangul script. The use of U+11E0 demonstrates the rich cultural history and linguistic uniqueness of the Korean language, which has adopted an innovative writing system that effectively combines both consonants and vowels to facilitate fluid communication. The Unicode character U+11E0 helps uphold this uniqueness in digital text form, thus contributing significantly to the preservation and propagation of Korean linguistic and cultural heritage.
How to type the ᇠ symbol on Windows
Hold Alt and type 4576 on the numpad. Or use Character Map.