Step 1: Determine the UTF-8 encoding bit layout
The character ④ has the Unicode code point U+2463. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+2463 to binary:
00100100 01100011
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10010001 10100011
CIRCLED DIGIT FOUR·U+2463
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 91 A3 | 11100010 10010001 10100011 |
UTF16 (big Endian) | 24 63 | 00100100 01100011 |
UTF16 (little Endian) | 63 24 | 01100011 00100100 |
UTF32 (big Endian) | 00 00 24 63 | 00000000 00000000 00100100 01100011 |
UTF32 (little Endian) | 63 24 00 00 | 01100011 00100100 00000000 00000000 |
Description
The Unicode character U+2463, known as CIRCLED DIGIT FOUR, serves a significant purpose in digital typography. This symbol is part of the Miscellaneous Technical (U+2400-U+245F) block within the Unicode standard, which is devoted to technical symbols and special characters often utilized in computer programming, mathematics, and scientific notation. The CIRCLED DIGIT FOUR is commonly employed as an alternative representation for the number four (4) when clarity or emphasis is needed in digital text. Although it may be used infrequently compared to other digits, this character can prove valuable in situations requiring distinction from its adjacent numerical symbols or for enhancing visual appearance within a document or application. The CIRCLED DIGIT FOUR, like other characters in the Miscellaneous Technical block, bridges linguistic and cultural barriers by employing universally recognized symbols to convey specific information or concepts across various languages and platforms.
How to type the ④ symbol on Windows
Hold Alt and type 9315 on the numpad. Or use Character Map.