Step 1: Determine the UTF-8 encoding bit layout
The character ۔ has the Unicode code point U+06D4. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+06D4 to binary:
00000110 11010100
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11011011 10010100
ARABIC FULL STOP·U+06D4
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | DB 94 | 11011011 10010100 |
UTF16 (big Endian) | 06 D4 | 00000110 11010100 |
UTF16 (little Endian) | D4 06 | 11010100 00000110 |
UTF32 (big Endian) | 00 00 06 D4 | 00000000 00000000 00000110 11010100 |
UTF32 (little Endian) | D4 06 00 00 | 11010100 00000110 00000000 00000000 |
Description
The Unicode character U+06D4 is designated as the "ARABIC FULL STOP." In the realm of digital text, it serves a crucial function in Arabic typography by acting as a punctuation mark to demarcate sentences and aid readability. Its presence is integral to maintaining coherence and clarity within written documents composed in Arabic script. Unlike its English counterpart, which typically signifies the end of a sentence with a period or ".", the ARABIC FULL STOP (U+06D4) maintains cultural and linguistic significance due to the unique structure of Arabic script. The character is vital for proper text rendering in various digital environments, such as websites, mobile applications, and software programs that support Unicode, which allows for the accurate representation of more than 100,000 characters from diverse writing systems across the globe. The ARABIC FULL STOP (U+06D4) is a crucial yet often overlooked element in the world of digital typography and communication.
How to type the ۔ symbol on Windows
Hold Alt and type 1748 on the numpad. Or use Character Map.