Step 1: Determine the UTF-8 encoding bit layout
The character ㏄ has the Unicode code point U+33C4. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+33C4 to binary:
00110011 11000100
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100011 10001111 10000100
SQUARE CC·U+33C4
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E3 8F 84 | 11100011 10001111 10000100 |
UTF16 (big Endian) | 33 C4 | 00110011 11000100 |
UTF16 (little Endian) | C4 33 | 11000100 00110011 |
UTF32 (big Endian) | 00 00 33 C4 | 00000000 00000000 00110011 11000100 |
UTF32 (little Endian) | C4 33 00 00 | 11000100 00110011 00000000 00000000 |
Description
The Unicode character U+33C4, known as the "SQUARE CC," primarily serves a role in digital typography for encoding specific symbols in various languages and applications. While it may not be widely used or recognized, it plays an essential part in ensuring accurate representation of characters when working with different scripts and systems. The SQUARE CC character is often utilized to represent unique, culturally specific elements within digital text, such as punctuation marks or symbols for certain alphabets. In technical contexts, the SQUARE CC may be used to signify a particular encoding or formatting requirement in specific software applications. By understanding and utilizing this character effectively, typographers and developers can maintain accuracy and precision in their work with digital text across various languages and platforms.
How to type the ㏄ symbol on Windows
Hold Alt and type 13252 on the numpad. Or use Character Map.