Step 1: Determine the UTF-8 encoding bit layout
The character ઁ has the Unicode code point U+0A81. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+0A81 to binary:
00001010 10000001
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100000 10101010 10000001
GUJARATI SIGN CANDRABINDU·U+0A81
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E0 AA 81 | 11100000 10101010 10000001 |
UTF16 (big Endian) | 0A 81 | 00001010 10000001 |
UTF16 (little Endian) | 81 0A | 10000001 00001010 |
UTF32 (big Endian) | 00 00 0A 81 | 00000000 00000000 00001010 10000001 |
UTF32 (little Endian) | 81 0A 00 00 | 10000001 00001010 00000000 00000000 |
Description
U+0A81 Gujarati Sign Candrabindu is a unique character in the Unicode Standard, specifically used in the Gujarati language. It plays a significant role in digital text by enabling accurate representation of the Gujarati script. The character is used to denote a specific syllable structure in this language, which is essential for correct pronunciation and comprehension. In the context of linguistics, U+0A81 Gujarati Sign Candrabindu is an indispensable part of the Gujarati script system. It is used to indicate a particular combination of vowel sounds, helping readers to correctly articulate words. This character also contributes to the rich cultural heritage of the Gujarati language by enabling more precise communication and expression of ideas in this language. From a technical perspective, U+0A81 Gujarati Sign Candrabindu is encoded as part of the Gujarati Extended script block in Unicode, ensuring its correct representation on various digital platforms. Its accurate representation is crucial for ensuring the effective communication and preservation of the Gujarati language in the digital age. In summary, U+0A81 Gujarati Sign Candrabindu is a vital character in the Gujarati script, playing an essential role in digital text by accurately representing specific syllable structures in the language. Its presence contributes to the rich cultural and linguistic context of Gujarati and helps maintain its integrity in the digital world.
How to type the ઁ symbol on Windows
Hold Alt and type 2689 on the numpad. Or use Character Map.