Step 1: Determine the UTF-8 encoding bit layout
The character ⅁ has the Unicode code point U+2141. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+2141 to binary:
00100001 01000001
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10000101 10000001
TURNED SANS-SERIF CAPITAL G·U+2141
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 85 81 | 11100010 10000101 10000001 |
UTF16 (big Endian) | 21 41 | 00100001 01000001 |
UTF16 (little Endian) | 41 21 | 01000001 00100001 |
UTF32 (big Endian) | 00 00 21 41 | 00000000 00000000 00100001 01000001 |
UTF32 (little Endian) | 41 21 00 00 | 01000001 00100001 00000000 00000000 |
Description
U+2141, known as the "TURNED SANS-SERIF CAPITAL G," is a character within the Unicode standard designed for digital text representation. The TURNED SANS-SERIF CAPITAL G holds significance due to its unique formatting in typography. This character takes on the shape of an uppercase 'G' and exhibits a distinctive sans-serif style, which is devoid of the small lines or decorative strokes at the ends of letters typically found in serif fonts. The primary usage of U+2141 lies within digital typography, where it can be employed to create distinct visual contrast when incorporated into design elements, such as logos and headings. While its use is not widespread, it holds value for designers seeking a specific aesthetic or wishing to display text in an unconventional manner. As part of the Unicode standard, U+2141 ensures consistency across various platforms and devices when representing this particular sans-serif capital G character. No notable cultural, linguistic, or technical context is associated with the TURNED SANS-SERIF CAPITAL G. Its primary role lies in visual representation and typographic design, catering to the creative needs of designers working with digital text.
How to type the ⅁ symbol on Windows
Hold Alt and type 8513 on the numpad. Or use Character Map.