Step 1: Determine the UTF-8 encoding bit layout
The character ُ has the Unicode code point U+064F. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+064F to binary:
00000110 01001111
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11011001 10001111
ARABIC DAMMA·U+064F
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | D9 8F | 11011001 10001111 |
UTF16 (big Endian) | 06 4F | 00000110 01001111 |
UTF16 (little Endian) | 4F 06 | 01001111 00000110 |
UTF32 (big Endian) | 00 00 06 4F | 00000000 00000000 00000110 01001111 |
UTF32 (little Endian) | 4F 06 00 00 | 01001111 00000110 00000000 00000000 |
Description
The Unicode character U+064F, known as the Arabic Damma, is a crucial component of the Arabic script used for writing in multiple languages across North Africa and West Asia. In digital text, it serves as one of the four vertical marks employed to connect or separate words within a sentence or paragraph, demonstrating its essential role in maintaining text readability and coherence. This particular character is specifically utilized after specific Arabic letters (such as "ف", "ق", "ص" and "ک") to either create a new word or indicate vowel patterns. The usage of U+064F reflects the importance of diacritical marks in the Arabic language, which are essential for conveying meaning, grammatical structure, and phonetic pronunciation.
How to type the ُ symbol on Windows
Hold Alt and type 1615 on the numpad. Or use Character Map.