Step 1: Determine the UTF-8 encoding bit layout
The character ٮ has the Unicode code point U+066E. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+066E to binary:
00000110 01101110
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11011001 10101110
ARABIC LETTER DOTLESS BEH·U+066E
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | D9 AE | 11011001 10101110 |
UTF16 (big Endian) | 06 6E | 00000110 01101110 |
UTF16 (little Endian) | 6E 06 | 01101110 00000110 |
UTF32 (big Endian) | 00 00 06 6E | 00000000 00000000 00000110 01101110 |
UTF32 (little Endian) | 6E 06 00 00 | 01101110 00000110 00000000 00000000 |
Description
The Unicode character U+066E, also known as ARABIC LETTER DOTLESS BEH, is a crucial component in the Arabic script. In digital text, this character plays a pivotal role by representing a specific sound or phoneme in various regional dialects of the Arabic language. While its primary usage is in the written Arabic language, it has also found application in typography, where it contributes to the accuracy and authenticity of Arabic texts, particularly in digital media and software applications. ARABIC LETTER DOTLESS BEH holds a significant place in the history and cultural context of Arabic-speaking regions. It is part of the Arabic script, which dates back over 1,400 years to the advent of Islam. This script has been continually refined over time, with variations emerging from different regions and historical periods. The dotless aspect of this character refers to its distinct feature of not having a central dot that is commonly present in other Arabic letters, which helps distinguish it visually. Technically, the ARABIC LETTER DOTLESS BEH is encoded in the Unicode Standard, which facilitates the reliable and consistent representation of text across various platforms, devices, and software applications. This character is essential for maintaining accurate translations and ensuring proper communication in the Arabic language when using digital technologies. In summary, U+066E (ARABIC LETTER DOTLESS BEH) serves as an integral element in Arabic script, representing a phoneme in multiple dialects while preserving cultural heritage and enabling effective communication in the Arabic-speaking world through digital text.
How to type the ٮ symbol on Windows
Hold Alt and type 1646 on the numpad. Or use Character Map.