Step 1: Determine the UTF-8 encoding bit layout
The character ڶ has the Unicode code point U+06B6. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+06B6 to binary:
00000110 10110110
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11011010 10110110
ARABIC LETTER LAM WITH DOT ABOVE·U+06B6
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | DA B6 | 11011010 10110110 |
UTF16 (big Endian) | 06 B6 | 00000110 10110110 |
UTF16 (little Endian) | B6 06 | 10110110 00000110 |
UTF32 (big Endian) | 00 00 06 B6 | 00000000 00000000 00000110 10110110 |
UTF32 (little Endian) | B6 06 00 00 | 10110110 00000110 00000000 00000000 |
Description
U+06B6 Arabic Letter Lam with Dot Above is a Unicode character that plays a crucial role in the Arabic language. As an essential component of digital text, it enables accurate representation of the Arabic script in computers and software applications. The character is derived from the Arabic letter Lām (ل) and is characterized by a distinctive dot above its vertical stroke. This typographical feature differentiates it from other similar letters in the Arabic alphabet. U+06B6 is widely used in written communication across various domains, such as literature, religious texts, legal documents, and scientific publications. Its usage adheres to strict rules of Arabic writing systems, ensuring correct pronunciation and comprehension for native speakers. Overall, U+06B6 contributes significantly to the rich linguistic heritage of the Arabic language and its digital representation worldwide.
How to type the ڶ symbol on Windows
Hold Alt and type 1718 on the numpad. Or use Character Map.