Step 1: Determine the UTF-8 encoding bit layout
The character ١ has the Unicode code point U+0661. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+0661 to binary:
00000110 01100001
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11011001 10100001
ARABIC-INDIC DIGIT ONE·U+0661
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | D9 A1 | 11011001 10100001 |
UTF16 (big Endian) | 06 61 | 00000110 01100001 |
UTF16 (little Endian) | 61 06 | 01100001 00000110 |
UTF32 (big Endian) | 00 00 06 61 | 00000000 00000000 00000110 01100001 |
UTF32 (little Endian) | 61 06 00 00 | 01100001 00000110 00000000 00000000 |
Description
U+0661 is a Unicode character known as ARABIC-INDIC DIGIT ONE. It holds an essential position in Arabic numerals, specifically in the Arabic-Indic script system, which has been widely adopted across various linguistic regions. This digit symbolizes the value of one and is frequently employed in digital texts, calculations, and numerical data representation to express quantities and values accurately. The Arabic-Indic script originated from India in the 6th century AD and has since been adapted for use in other languages. It showcases a rich cultural and historical significance, reflecting the evolution of mathematical notation and the interplay of diverse civilizations throughout human history. In today's digital world, U+0661 contributes to precise communication and data handling across languages that utilize this numeral system, emphasizing the versatility and relevance of the Unicode standard in facilitating global information exchange.
How to type the ١ symbol on Windows
Hold Alt and type 1633 on the numpad. Or use Character Map.