Step 1: Determine the UTF-8 encoding bit layout
The character ⦃ has the Unicode code point U+2983. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+2983 to binary:
00101001 10000011
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10100110 10000011
LEFT WHITE CURLY BRACKET·U+2983
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 A6 83 | 11100010 10100110 10000011 |
UTF16 (big Endian) | 29 83 | 00101001 10000011 |
UTF16 (little Endian) | 83 29 | 10000011 00101001 |
UTF32 (big Endian) | 00 00 29 83 | 00000000 00000000 00101001 10000011 |
UTF32 (little Endian) | 83 29 00 00 | 10000011 00101001 00000000 00000000 |
Description
The Unicode character U+2983, also known as the Left White Curly Bracket, plays a significant role in digital text formatting, particularly within programming languages and mathematical expressions. This symbol is commonly used to denote the beginning of an array or sequence in many programming languages such as JavaScript, Python, and C++. It often appears alongside its right-facing counterpart (U+2984) to enclose and group together related elements, aiding in code readability and clarity. In the context of mathematical notation, it is employed to signify an open fraction or a subset within a set. The Left White Curly Bracket has no specific cultural or linguistic significance but is universally understood as a symbol for grouping elements. As with any other Unicode character, its usage depends on the programming language or application, ensuring efficient and accurate communication of ideas in digital text.
How to type the ⦃ symbol on Windows
Hold Alt and type 10627 on the numpad. Or use Character Map.