Step 1: Determine the UTF-8 encoding bit layout
The character ݹ has the Unicode code point U+0779. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+0779 to binary:
00000111 01111001
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11011101 10111001
ARABIC LETTER WAW WITH EXTENDED ARABIC-INDIC DIGIT THREE ABOVE·U+0779
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | DD B9 | 11011101 10111001 |
UTF16 (big Endian) | 07 79 | 00000111 01111001 |
UTF16 (little Endian) | 79 07 | 01111001 00000111 |
UTF32 (big Endian) | 00 00 07 79 | 00000000 00000000 00000111 01111001 |
UTF32 (little Endian) | 79 07 00 00 | 01111001 00000111 00000000 00000000 |
Description
U+0779 is a unique character in the Arabic typography system, specifically known as "ARABIC LETTER WAW WITH EXTENDED ARABIC-INDIC DIGIT THREE ABOVE." It plays a pivotal role in digital text by enabling linguistic and cultural nuances to be represented accurately within the Arabic language. This character is predominantly used in digital platforms, where it helps to depict words and phrases with enhanced clarity and precision. Its application is primarily observed in written forms of the Arabic script. In its typographic representation, ARABIC LETTER WAW WITH EXTENDED ARABIC-INDIC DIGIT THREE ABOVE is a combination of two components. The first part, "ARABIC LETTER WAW," is one of the 28 basic Arabic letters. Waw in Arabic stands for 'w' or 'v' sound, depending on its place within a word and the context it appears. The second component, "EXTENDED ARABIC-INDIC DIGIT THREE ABOVE," denotes a numerical representation of '3,' which is placed above the letter 'Waw.' The cultural significance of this character lies in the Arabic numeral system that uses an Indic numeral form as its base. This indicates a historical connection between the Arabic and Indian cultures, showcasing how they have influenced each other over time. The technical context of U+0779 is crucial in digital text, as it helps maintain consistency and accuracy while representing Arabic words and phrases that require additional numerical information.
How to type the ݹ symbol on Windows
Hold Alt and type 1913 on the numpad. Or use Character Map.