Step 1: Determine the UTF-8 encoding bit layout
The character ዊ has the Unicode code point U+12CA. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+12CA to binary:
00010010 11001010
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10001011 10001010
ETHIOPIC SYLLABLE WI·U+12CA
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 8B 8A | 11100001 10001011 10001010 |
UTF16 (big Endian) | 12 CA | 00010010 11001010 |
UTF16 (little Endian) | CA 12 | 11001010 00010010 |
UTF32 (big Endian) | 00 00 12 CA | 00000000 00000000 00010010 11001010 |
UTF32 (little Endian) | CA 12 00 00 | 11001010 00010010 00000000 00000000 |
Description
The Unicode character U+12CA is known as "ETHIOPIC SYLLABLE WI" and holds a significant position in the Ethiopian language system. It is an integral component of the Ge'ez script, which has been in use since ancient times, dating back to around the 4th century AD. In digital text, U+12CA serves as a building block for forming words and phrases within the Ethiopian linguistic context. This character, along with other Ethiopic syllables, enables accurate representation of the Ethiopian languages in the digital realm. As a result, its usage is crucial for preserving cultural heritage and facilitating communication among Ethiopian speakers worldwide. The Ge'ez script itself is considered a part of the world's oldest written tradition, and its continued use today reflects the rich history and cultural identity of Ethiopia and neighboring countries where these languages are spoken.
How to type the ዊ symbol on Windows
Hold Alt and type 4810 on the numpad. Or use Character Map.