Step 1: Determine the UTF-8 encoding bit layout
The character ㊔ has the Unicode code point U+3294. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+3294 to binary:
00110010 10010100
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100011 10001010 10010100
CIRCLED IDEOGRAPH NAME·U+3294
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E3 8A 94 | 11100011 10001010 10010100 |
UTF16 (big Endian) | 32 94 | 00110010 10010100 |
UTF16 (little Endian) | 94 32 | 10010100 00110010 |
UTF32 (big Endian) | 00 00 32 94 | 00000000 00000000 00110010 10010100 |
UTF32 (little Endian) | 94 32 00 00 | 10010100 00110010 00000000 00000000 |
Description
The Unicode character U+3294 is known as the "Circled Ideograph Name." It is a typographical symbol that represents a circle containing an ideograph from the Chinese script. This character holds significance in digital text, particularly in East Asian languages such as Chinese, Japanese, and Korean. Its primary usage lies in the context of naming or labeling specific ideographs, serving as a marker to identify particular characters within texts, especially in reference materials like dictionaries and language guides. The U+3294 symbol is often employed to differentiate similar-looking characters with distinct meanings or pronunciations, aiding in the comprehension of written content in these languages. In certain technical contexts, the Circled Ideograph Name may also be utilized in software applications, such as character encoding systems or font design tools, to denote specific ideographs and ensure accurate representation and interpretation across different platforms. Overall, U+3294 serves a practical purpose in digital text for East Asian languages, ensuring clarity and precision in communication through the use of this distinctive typographical symbol.
How to type the ㊔ symbol on Windows
Hold Alt and type 12948 on the numpad. Or use Character Map.