Step 1: Determine the UTF-8 encoding bit layout
The character ഼ has the Unicode code point U+0D3C. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+0D3C to binary:
00001101 00111100
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100000 10110100 10111100
MALAYALAM SIGN CIRCULAR VIRAMA·U+0D3C
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E0 B4 BC | 11100000 10110100 10111100 |
UTF16 (big Endian) | 0D 3C | 00001101 00111100 |
UTF16 (little Endian) | 3C 0D | 00111100 00001101 |
UTF32 (big Endian) | 00 00 0D 3C | 00000000 00000000 00001101 00111100 |
UTF32 (little Endian) | 3C 0D 00 00 | 00111100 00001101 00000000 00000000 |
Description
U+0D3C is the Unicode code point for the Malayalam Sign Circular Virama, a unique character in digital text used specifically within the Malayalam script, which is predominantly employed for writing the Malayalam language, spoken primarily in the Indian state of Kerala and by the Malayali people. The Circular Virama serves as a crucial typographic element in this script. Its primary function is to act as an orthographic break between two characters, effectively separating words or breaking down long strings of text into more easily readable segments. This allows for a fluid and natural flow of reading in the Malayalam language, while also maintaining its phonetic structure. Given the significance of this character in Malayalam typography, U+0D3C plays an indispensable role in accurately rendering the written form of the Malayalam language on digital platforms.
How to type the ഼ symbol on Windows
Hold Alt and type 3388 on the numpad. Or use Character Map.