Step 1: Determine the UTF-8 encoding bit layout
The character ፄ has the Unicode code point U+1344. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+1344 to binary:
00010011 01000100
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10001101 10000100
ETHIOPIC SYLLABLE TZEE·U+1344
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 8D 84 | 11100001 10001101 10000100 |
UTF16 (big Endian) | 13 44 | 00010011 01000100 |
UTF16 (little Endian) | 44 13 | 01000100 00010011 |
UTF32 (big Endian) | 00 00 13 44 | 00000000 00000000 00010011 01000100 |
UTF32 (little Endian) | 44 13 00 00 | 01000100 00010011 00000000 00000000 |
Description
The Unicode character U+1344, known as ETHIOPIC SYLLABLE TZEE, holds significant value within the Ethiopic script system. This syllabary is primarily used to represent the Amharic language, which is spoken by over 20 million people in Ethiopia and Eritrea. The character U+1344 represents the "tzee" sound, which is an essential phoneme in the Amharic language. In digital text, U+1344 follows the standard Unicode rules for its usage and representation. It can be used independently or in combination with other Ethiopic script characters to form words or sentences. The Ethiopic script is a unique abugida system where each character represents a consonant sound along with an inherent vowel "a". A diacritic mark can be added to change the inherent vowel, as seen in U+1344. The Ethiopic script has a long history that dates back to the 4th century AD, making it one of the oldest writing systems in the world. U+1344 and other characters in the Ethiopic script are crucial for preserving the rich linguistic and cultural heritage of the Amharic language and its speakers. The inclusion of these characters in the Unicode standard allows for a more accurate representation of digital texts in Amharic, facilitating communication and understanding across various platforms and devices worldwide.
How to type the ፄ symbol on Windows
Hold Alt and type 4932 on the numpad. Or use Character Map.