Step 1: Determine the UTF-8 encoding bit layout
The character ܃ has the Unicode code point U+0703. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+0703 to binary:
00000111 00000011
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11011100 10000011
SYRIAC SUPRALINEAR COLON·U+0703
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | DC 83 | 11011100 10000011 |
UTF16 (big Endian) | 07 03 | 00000111 00000011 |
UTF16 (little Endian) | 03 07 | 00000011 00000111 |
UTF32 (big Endian) | 00 00 07 03 | 00000000 00000000 00000111 00000011 |
UTF32 (little Endian) | 03 07 00 00 | 00000011 00000111 00000000 00000000 |
Description
The Unicode character U+0703 represents the Syriac Supralinear Colon (ܘ). This character plays a significant role in digital text, particularly within the Syriac script, which is used for writing the liturgical language of several Eastern Christian churches. The Syriac Supralinear Colon serves as a punctuation mark to separate clauses or ideas, similar to a semicolon in English. Its supra linear property means it sits above the line of text, adding a distinctive visual element that is characteristic of the Syriac script. This character holds cultural and linguistic importance as it aids in the accurate representation and transmission of religious texts and prayers in the Syriac language. It is essential for digital typography to accurately encode and display the unique aspects of the Syriac script, including its punctuation marks like the Supralinear Colon.
How to type the ܃ symbol on Windows
Hold Alt and type 1795 on the numpad. Or use Character Map.