Step 1: Determine the UTF-8 encoding bit layout
The character ⰲ has the Unicode code point U+2C32. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+2C32 to binary:
00101100 00110010
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10110000 10110010
GLAGOLITIC SMALL LETTER VEDE·U+2C32
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 B0 B2 | 11100010 10110000 10110010 |
UTF16 (big Endian) | 2C 32 | 00101100 00110010 |
UTF16 (little Endian) | 32 2C | 00110010 00101100 |
UTF32 (big Endian) | 00 00 2C 32 | 00000000 00000000 00101100 00110010 |
UTF32 (little Endian) | 32 2C 00 00 | 00110010 00101100 00000000 00000000 |
Description
The Unicode character U+2C32, known as GLAGOLITIC SMALL LETTER VEDE, holds a significant position in the world of typography and digital text. It is specifically associated with Glagolitic script, an ancient writing system that originated in the 9th century among Slavs living under Byzantine influence. This character, like other Glagolitic letters, represents specific phonetic or semantic elements within the language it serves. In digital text usage, U+2C32 contributes to preserving and promoting the cultural richness of languages that utilize Glagolitic script, such as Old Church Slavonic and Old Bulgarian. It enables accurate representation of these languages in digital platforms and software, supporting linguistic research and preservation efforts. The Glagolitic script holds historical significance as it was one of the first Slavic alphabets developed, predating Cyrillic by over a century. The development of this script reflects the early history of Slavic writing systems and their cultural contexts, showcasing the influence of various civilizations on the Slavic linguistic landscape. Technically, U+2C32 is part of the Glagolitic block of Unicode, which includes 60 characters ranging from U+2C00 to U+2C5F, offering comprehensive coverage of the Glagolitic script. The inclusion of such specialized scripts in Unicode demonstrates the versatility and inclusivity of this coding standard, accommodating a wide range of languages and writing systems. In conclusion, U+2C32 (GLAGOLITIC SMALL LETTER VEDE) is an essential character in digital text representation for Glagolitic script-based languages. Its presence in Unicode supports cultural preservation, linguistic research, and the continued use of these ancient scripts in modern computing environments.
How to type the ⰲ symbol on Windows
Hold Alt and type 11314 on the numpad. Or use Character Map.