Step 1: Determine the UTF-8 encoding bit layout
The character ⷋ has the Unicode code point U+2DCB. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+2DCB to binary:
00101101 11001011
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10110111 10001011
ETHIOPIC SYLLABLE KYAA·U+2DCB
ⷋ
Character Information
Code Point
U+2DCB
HEX
2DCB
Unicode Plane
Basic Multilingual Plane
Category
Other Letter
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 B7 8B | 11100010 10110111 10001011 |
UTF16 (big Endian) | 2D CB | 00101101 11001011 |
UTF16 (little Endian) | CB 2D | 11001011 00101101 |
UTF32 (big Endian) | 00 00 2D CB | 00000000 00000000 00101101 11001011 |
UTF32 (little Endian) | CB 2D 00 00 | 11001011 00101101 00000000 00000000 |
HTML Entity
ⷋ
URI Encoded
%E2%B7%8B
Description
The Unicode character U+2DCB, also known as ETHIOPIC SYLLABLE KYAA, is an essential component of the Ethiopic script. In digital text, it serves as a building block for creating words in Amharic and other Ethiopian Semitic languages. As part of the Ge'ez script, which has been used since the 4th century AD, U+2DCB is rooted in a rich cultural and linguistic history that spans over 1,600 years. Today, it plays a crucial role in preserving and promoting Ethiopia's linguistic heritage, enabling digital communication and literacy in these ancient languages.
How to type the ⷋ symbol on Windows
Hold Alt and type 11723 on the numpad. Or use Character Map.