Step 1: Determine the UTF-8 encoding bit layout
The character ➇ has the Unicode code point U+2787. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+2787 to binary:
00100111 10000111
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10011110 10000111
DINGBAT CIRCLED SANS-SERIF DIGIT EIGHT·U+2787
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 9E 87 | 11100010 10011110 10000111 |
UTF16 (big Endian) | 27 87 | 00100111 10000111 |
UTF16 (little Endian) | 87 27 | 10000111 00100111 |
UTF32 (big Endian) | 00 00 27 87 | 00000000 00000000 00100111 10000111 |
UTF32 (little Endian) | 87 27 00 00 | 10000111 00100111 00000000 00000000 |
Description
U+2787 is a character from the Unicode standard, specifically known as the DINGBAT CIRCLED SANS-SERIF DIGIT EIGHT. This symbol falls under the category of dingbat characters, which are primarily used for decorative or artistic purposes in digital text. Unlike regular digits or letters, dingbats do not have any numerical or alphabetical value and are often incorporated into designs to create visual interest or emphasis. The DINGBAT CIRCLED SANS-SERIF DIGIT EIGHT is part of a larger family of dingbat characters that include various shapes, styles, and sizes. Its circular form, combined with the sans-serif style, sets it apart from the more commonly used linear digit eight. This character can be utilized in a variety of design applications, such as branding materials, logos, or other visual communications where a unique or stylized representation of the number eight is desired. Although the DINGBAT CIRCLED SANS-SERIF DIGIT EIGHT does not have any specific cultural, linguistic, or technical context, it demonstrates the versatility and richness of the Unicode standard, which provides a comprehensive set of characters for a wide range of applications in digital text. By avoiding fluff and focusing on accuracy, this character serves as an excellent example of how Unicode enables designers and typographers to create engaging and effective visual communication.
How to type the ➇ symbol on Windows
Hold Alt and type 10119 on the numpad. Or use Character Map.