ETHIOPIC SYLLABLE SE·U+1235

Character Information

Code Point
U+1235
HEX
1235
Unicode Plane
Basic Multilingual Plane
Category
Other Letter

Character Representations

Click elements to copy
EncodingHexBinary
UTF8
E1 88 B5
11100001 10001000 10110101
UTF16 (big Endian)
12 35
00010010 00110101
UTF16 (little Endian)
35 12
00110101 00010010
UTF32 (big Endian)
00 00 12 35
00000000 00000000 00010010 00110101
UTF32 (little Endian)
35 12 00 00
00110101 00010010 00000000 00000000
HTML Entity
ስ
URI Encoded
%E1%88%B5

Description

U+1235, known as ETHIOPIC SYLLABLE SE, is a unique character within the Unicode Standard that holds significant importance in Ethiopian orthography. This syllable is part of the Ethiopic script, which has been used for centuries to write the Amharic language - the official and most widely spoken language in Ethiopia. The ETHIOPIC SYLLABLE SE character serves as a building block for constructing words within the Ethiopian writing system, where it usually follows the vowel "a" and can be combined with other consonants or syllables to form complex words. In digital text, U+1235 helps preserve the authenticity and accuracy of Ethiopian texts, enabling readers worldwide to access and comprehend works written in Amharic or other Ethiopian languages. This character also aids in preserving linguistic heritage and facilitating communication among Ethiopian communities that rely on written forms of their languages for cultural, religious, and educational purposes. The Unicode Consortium introduced the U+1235 character to ensure consistent encoding of Ethiopic texts across different digital platforms, such as websites, documents, and software applications. This has been instrumental in supporting the Ethiopian language's rich linguistic history and cultural identity, while promoting internationalization and multilingual support within the realm of computing and technology.

How to type the symbol on Windows

Hold Alt and type 4661 on the numpad. Or use Character Map.

  1. Step 1: Determine the UTF-8 encoding bit layout

    The character has the Unicode code point U+1235. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of 0x0800 to 0xffff.

    Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format: 1110xxxx 10xxxxxx 10xxxxxx
    Where the x are the payload bits.

    UTF-8 Encoding bit layout by codepoint range
    Codepoint RangeBytesBit patternPayload length
    U+0000 - U+007F10xxxxxxx7 bits
    U+0080 - U+07FF2110xxxxx 10xxxxxx11 bits
    U+0800 - U+FFFF31110xxxx 10xxxxxx 10xxxxxx16 bits
    U+10000 - U+10FFFF411110xxx 10xxxxxx 10xxxxxx 10xxxxxx21 bits
  2. Step 2: Obtain the payload bits:

    Convert the hexadecimal code point U+1235 to binary: 00010010 00110101. Those are the payload bits.

  3. Step 3: Fill in the bits to match the bit pattern:

    Obtain the final bytes by arranging the paylod bits to match the bit layout:
    11100001 10001000 10110101