Step 1: Determine the UTF-8 encoding bit layout
The character ጿ has the Unicode code point U+133F. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+133F to binary:
00010011 00111111
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10001100 10111111
ETHIOPIC SYLLABLE TSWA·U+133F
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 8C BF | 11100001 10001100 10111111 |
UTF16 (big Endian) | 13 3F | 00010011 00111111 |
UTF16 (little Endian) | 3F 13 | 00111111 00010011 |
UTF32 (big Endian) | 00 00 13 3F | 00000000 00000000 00010011 00111111 |
UTF32 (little Endian) | 3F 13 00 00 | 00111111 00010011 00000000 00000000 |
Description
The Unicode character U+133F, known as ETHIOPIC SYLLABLE TSWA, plays a significant role in the Ethiopian script system, which is primarily used to represent the Amharic language. In digital text, it serves as a building block for creating words and phrases within this unique linguistic context. The TSWA syllable itself forms part of a series of 306 Ethiopic syllables, each represented by a specific Unicode character. These characters are essential in the construction of the Ethiopian script, which is based on Ge'ez, an ancient Semitic language that has evolved over centuries to encompass various languages and dialects within the region. The ETHIOPIC SYLLABLE TSWA, therefore, holds cultural significance as a vital component of the Amharic language and Ethiopian literature, while also carrying linguistic importance in its representation of specific sounds and phonetics.
How to type the ጿ symbol on Windows
Hold Alt and type 4927 on the numpad. Or use Character Map.