Step 1: Determine the UTF-8 encoding bit layout
The character ؾ has the Unicode code point U+063E. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+063E to binary:
00000110 00111110
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11011000 10111110
ARABIC LETTER FARSI YEH WITH TWO DOTS ABOVE·U+063E
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | D8 BE | 11011000 10111110 |
UTF16 (big Endian) | 06 3E | 00000110 00111110 |
UTF16 (little Endian) | 3E 06 | 00111110 00000110 |
UTF32 (big Endian) | 00 00 06 3E | 00000000 00000000 00000110 00111110 |
UTF32 (little Endian) | 3E 06 00 00 | 00111110 00000110 00000000 00000000 |
Description
The Unicode character U+063E, known as Arabic Letter Farsi Yeh with Two Dots Above, is a crucial glyph in the Arabic script system. This character has a significant role in digital text by representing an individual phoneme or sound in various Arabic dialects, including Persian (Farsi), Uzbek, and other Turkic languages. In these languages, it represents a voiced palatal fricative consonant sound (/\u0284/) similar to the English "y" sound in "yes." It's part of the Arabic Extended-A form of the script, which includes more than 31,000 characters. U+063E is not only essential for linguistic and cultural context but also plays a vital technical role in enabling correct text representation and accurate language translation in digital environments. By using this character accurately, we can ensure that written communication in these languages remains clear, precise, and faithful to the original intent of the author.
How to type the ؾ symbol on Windows
Hold Alt and type 1598 on the numpad. Or use Character Map.