Step 1: Determine the UTF-8 encoding bit layout
The character ㉡ has the Unicode code point U+3261. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+3261 to binary:
00110010 01100001
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100011 10001001 10100001
CIRCLED HANGUL NIEUN·U+3261
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E3 89 A1 | 11100011 10001001 10100001 |
UTF16 (big Endian) | 32 61 | 00110010 01100001 |
UTF16 (little Endian) | 61 32 | 01100001 00110010 |
UTF32 (big Endian) | 00 00 32 61 | 00000000 00000000 00110010 01100001 |
UTF32 (little Endian) | 61 32 00 00 | 01100001 00110010 00000000 00000000 |
Description
U+3261, also known as the Circled Hangul Nieun (ㅥ), is a character in the Unicode Standard, which encompasses characters from a wide range of scripts and languages for digital text representation. In the Korean language, this character represents the consonant 'nieun' (니) and is part of the Hangul script. It was added to the Unicode standard in 1997 as part of the third major version update, expanding the capabilities of digital communication across various platforms and devices. The Circled Hangul Nieun character is used to denote a syllable block in Korean text input systems, where it serves as an input method for learners or non-native speakers who are unfamiliar with the standard Hangul forms. This character's role highlights its cultural significance, as the Hangul script is integral to the Korean language and is widely used by millions of people worldwide. As digital communication continues to expand globally, characters like U+3261 play a crucial role in maintaining linguistic diversity and facilitating accurate information exchange across different cultures and languages.
How to type the ㉡ symbol on Windows
Hold Alt and type 12897 on the numpad. Or use Character Map.