Step 1: Determine the UTF-8 encoding bit layout
The character ➃ has the Unicode code point U+2783. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+2783 to binary:
00100111 10000011
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10011110 10000011
DINGBAT CIRCLED SANS-SERIF DIGIT FOUR·U+2783
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 9E 83 | 11100010 10011110 10000011 |
UTF16 (big Endian) | 27 83 | 00100111 10000011 |
UTF16 (little Endian) | 83 27 | 10000011 00100111 |
UTF32 (big Endian) | 00 00 27 83 | 00000000 00000000 00100111 10000011 |
UTF32 (little Endian) | 83 27 00 00 | 10000011 00100111 00000000 00000000 |
Description
U+2783 is a dingbat character in Unicode that represents a circled sans-serif digit four. In digital typography and text, this character is often used for visual purposes such as creating diagrams, charts, or other graphical representations where the number four needs to be emphasized or distinguished from its standard appearance. The "circled" aspect of the character adds a unique element to the digit, making it more visible and easily identifiable within the context of text or design. There is no specific cultural, linguistic, or technical context surrounding U+2783 as it serves primarily as an aesthetic choice for designers and writers when they need a distinct representation of the number four in their work. As with other dingbat characters, the use of U+2783 depends on the preferences of the designer, and its utility is limited to cases where the visual presentation of the digit four holds importance over its numerical value.
How to type the ➃ symbol on Windows
Hold Alt and type 10115 on the numpad. Or use Character Map.