Step 1: Determine the UTF-8 encoding bit layout
The character ▏ has the Unicode code point U+258F. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+258F to binary:
00100101 10001111
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10010110 10001111
LEFT ONE EIGHTH BLOCK·U+258F
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 96 8F | 11100010 10010110 10001111 |
UTF16 (big Endian) | 25 8F | 00100101 10001111 |
UTF16 (little Endian) | 8F 25 | 10001111 00100101 |
UTF32 (big Endian) | 00 00 25 8F | 00000000 00000000 00100101 10001111 |
UTF32 (little Endian) | 8F 25 00 00 | 10001111 00100101 00000000 00000000 |
Description
The Unicode character U+258F, known as the Left One Eighth Block, primarily serves as a formatting control in digital text. It is part of a set of box-drawing characters used to create simple graphical elements such as borders and dividers within text content. Typically, these characters are used in monospaced fonts, where each character occupies the same amount of horizontal space, enabling precise alignment of the graphic elements. The Left One Eighth Block, along with other box-drawing characters, has found significant usage in creating simple tables or diagrams within plain text documents, especially prior to the widespread use of graphical interfaces and HTML for web content. While it is not commonly used in contemporary digital communication, it retains a niche role in specialized contexts such as programming, markdown syntax, and ASCII art.
How to type the ▏ symbol on Windows
Hold Alt and type 9615 on the numpad. Or use Character Map.