Step 1: Determine the UTF-8 encoding bit layout
The character ދ has the Unicode code point U+078B. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+078B to binary:
00000111 10001011
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11011110 10001011
THAANA LETTER DHAALU·U+078B
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | DE 8B | 11011110 10001011 |
UTF16 (big Endian) | 07 8B | 00000111 10001011 |
UTF16 (little Endian) | 8B 07 | 10001011 00000111 |
UTF32 (big Endian) | 00 00 07 8B | 00000000 00000000 00000111 10001011 |
UTF32 (little Endian) | 8B 07 00 00 | 10001011 00000111 00000000 00000000 |
Description
U+078B is the Unicode character code for Thaana Letter Dhaalu, a symbol used predominantly within the Thaana script. Thaana, also known as Sesotho or Adlam, is the writing system of the Fula people from Senegal and Gambia, who have historically relied on this script to communicate their rich cultural heritage and literature. The Thaana script is composed of 28 letters, and U+078B specifically represents the Dhaalu character. In digital text, U+078B serves a vital role in maintaining and preserving linguistic identity, enabling seamless communication for those who utilize the Thaana script in their daily lives. As with other Unicode characters, U+078B promotes cultural diversity and linguistic expression within the global digital landscape.
How to type the ދ symbol on Windows
Hold Alt and type 1931 on the numpad. Or use Character Map.