Step 1: Determine the UTF-8 encoding bit layout
The character ▜ has the Unicode code point U+259C. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+259C to binary:
00100101 10011100
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10010110 10011100
QUADRANT UPPER LEFT AND UPPER RIGHT AND LOWER RIGHT·U+259C
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 96 9C | 11100010 10010110 10011100 |
UTF16 (big Endian) | 25 9C | 00100101 10011100 |
UTF16 (little Endian) | 9C 25 | 10011100 00100101 |
UTF32 (big Endian) | 00 00 25 9C | 00000000 00000000 00100101 10011100 |
UTF32 (little Endian) | 9C 25 00 00 | 10011100 00100101 00000000 00000000 |
Description
The Unicode character U+259C is known as the "QUADRANT UPPER LEFT AND UPPER RIGHT AND LOWER RIGHT" character. This typographical symbol serves a specific role in digital text, particularly within technical documents and software applications. It is often used to indicate a quadrant or section of an area in diagrams, charts, and other visual representations where four distinct sections need to be clearly delineated. In the context of digital text, U+259C finds its usage predominantly within technical documentation, such as user manuals, programming languages, and engineering drawings. Its distinct shape, featuring two upward-pointing lines to form an upper left quadrant and two downward-pointing lines for the lower right quadrant, visually delineates the four quadrants of a circle or square. The character is part of a larger group of Unicode block characters known as "Box Drawing" (U+2500 to U+257F), which consist of various symbols used for creating simple box-like shapes and lines in text mode. The specific function and usage of U+259C highlight the versatility of Unicode in representing a wide range of characters, symbols, and glyphs across different cultures, languages, and industries.
How to type the ▜ symbol on Windows
Hold Alt and type 9628 on the numpad. Or use Character Map.