Step 1: Determine the UTF-8 encoding bit layout
The character ✐ has the Unicode code point U+2710. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+2710 to binary:
00100111 00010000
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10011100 10010000
UPPER RIGHT PENCIL·U+2710
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 9C 90 | 11100010 10011100 10010000 |
UTF16 (big Endian) | 27 10 | 00100111 00010000 |
UTF16 (little Endian) | 10 27 | 00010000 00100111 |
UTF32 (big Endian) | 00 00 27 10 | 00000000 00000000 00100111 00010000 |
UTF32 (little Endian) | 10 27 00 00 | 00010000 00100111 00000000 00000000 |
Description
The Unicode character U+2710, known as the UPPER RIGHT PENCIL, is a typographical symbol used primarily in digital text for representing a rightward-pointing pencil. While not universally recognized or widely utilized, this symbol serves a specific role in contexts where a clear distinction between left and right orientations of objects or actions is required. In certain fields like mathematics, engineering, or technical documentation, it may be employed to indicate the directional flow of a process or system. However, its usage is relatively limited, likely due to the lack of a well-established convention for incorporating directional pencils in these disciplines. Despite its limited scope, U+2710 is an example of how Unicode strives to encompass diverse symbols and characters from various cultures and languages, ensuring that digital text can accurately represent the full range of human communication.
How to type the ✐ symbol on Windows
Hold Alt and type 10000 on the numpad. Or use Character Map.