Step 1: Determine the UTF-8 encoding bit layout
The character ܭ has the Unicode code point U+072D. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+072D to binary:
00000111 00101101
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11011100 10101101
SYRIAC LETTER PERSIAN BHETH·U+072D
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | DC AD | 11011100 10101101 |
UTF16 (big Endian) | 07 2D | 00000111 00101101 |
UTF16 (little Endian) | 2D 07 | 00101101 00000111 |
UTF32 (big Endian) | 00 00 07 2D | 00000000 00000000 00000111 00101101 |
UTF32 (little Endian) | 2D 07 00 00 | 00101101 00000111 00000000 00000000 |
Description
The Unicode character U+072D, known as the Syriac Letter Persian Bheth, is a typographic symbol significant in digital text representation, particularly for those working with languages that utilize the Syriac script. This character is of great importance within the field of linguistics and cultural studies, as it serves to represent the sound /p/ or /b/, depending on its position within a word or sentence. The Syriac Letter Persian Bheth finds its roots in ancient civilizations, where it was primarily used within the context of the Aramaic language, spoken widely in the Middle East and Asia Minor during the time of Jesus Christ and the Apostles. While it may not be as prevalent today, this character's continued presence in modern digital text demonstrates the rich cultural and linguistic history that Unicode strives to preserve and represent accurately. In terms of its technical context, the Syriac Letter Persian Bheth adheres to the rules and regulations outlined by the Unicode Standard, which aims to ensure consistent encoding and representation across various platforms and devices. This allows for greater interoperability between different digital systems, fostering a more seamless user experience when engaging with languages that utilize unique or lesser-known characters such as U+072D.
How to type the ܭ symbol on Windows
Hold Alt and type 1837 on the numpad. Or use Character Map.