Step 1: Determine the UTF-8 encoding bit layout
The character ؏ has the Unicode code point U+060F. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+060F to binary:
00000110 00001111
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11011000 10001111
ARABIC SIGN MISRA·U+060F
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | D8 8F | 11011000 10001111 |
UTF16 (big Endian) | 06 0F | 00000110 00001111 |
UTF16 (little Endian) | 0F 06 | 00001111 00000110 |
UTF32 (big Endian) | 00 00 06 0F | 00000000 00000000 00000110 00001111 |
UTF32 (little Endian) | 0F 06 00 00 | 00001111 00000110 00000000 00000000 |
Description
The Unicode character U+060F, Arabic Sign Misra, is a unique symbol used primarily in the Arabic language. In its typical usage, this glyph serves as an interpunct in digital text, a punctuation mark that visually separates words or clauses without creating a pause like a comma does. This character plays a crucial role in the readability and legibility of Arabic text. As part of the rich typographic heritage of the Arabic script, U+060F is a vital component of the language's grammar and syntax. The Misra mark can be used in both traditional and modern forms of the Arabic alphabet. It has no equivalent in the Latin alphabet, making it an essential feature for accurate digital representation of Arabic texts. Therefore, U+060F holds a significant position in ensuring linguistic accuracy and cultural authenticity within the realm of digital text processing and presentation.
How to type the ؏ symbol on Windows
Hold Alt and type 1551 on the numpad. Or use Character Map.