Step 1: Determine the UTF-8 encoding bit layout
The character ඒ has the Unicode code point U+0D92. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+0D92 to binary:
00001101 10010010
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100000 10110110 10010010
SINHALA LETTER EEYANNA·U+0D92
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E0 B6 92 | 11100000 10110110 10010010 |
UTF16 (big Endian) | 0D 92 | 00001101 10010010 |
UTF16 (little Endian) | 92 0D | 10010010 00001101 |
UTF32 (big Endian) | 00 00 0D 92 | 00000000 00000000 00001101 10010010 |
UTF32 (little Endian) | 92 0D 00 00 | 10010010 00001101 00000000 00000000 |
Description
The Unicode character U+0D92 represents the Sinhala letter Eeyanna (එයාන). In its typical usage within digital text, it serves as a unique character in the Sinhala script, which is primarily used for writing the Sinhalese language spoken by the majority of the population in Sri Lanka. The Sinhala script, along with the Tamil script, constitutes the indigenous writing systems of the island nation. The Eeyanna (එයාන) character carries cultural and linguistic significance, as it contributes to the richness and diversity of the Sinhalese language's phonetic and semantic expression. As part of the Sinhala script, which dates back to at least the 3rd century BCE, U+0D92 also holds historical importance in maintaining and preserving the ancient literary heritage of Sri Lanka. In terms of technical context, the Unicode Standard plays a crucial role in accurately representing characters like Eeyanna (එයාන) on digital platforms, enabling seamless communication and information exchange across various devices and systems worldwide. This standardization ensures that all characters, including those unique to specific languages and scripts such as Sinhala, are properly encoded, displayed, and processed, fostering global understanding and respect for linguistic diversity.
How to type the ඒ symbol on Windows
Hold Alt and type 3474 on the numpad. Or use Character Map.