Step 1: Determine the UTF-8 encoding bit layout
The character ᗶ has the Unicode code point U+15F6. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+15F6 to binary:
00010101 11110110
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10010111 10110110
CANADIAN SYLLABICS CARRIER KHO·U+15F6
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 97 B6 | 11100001 10010111 10110110 |
UTF16 (big Endian) | 15 F6 | 00010101 11110110 |
UTF16 (little Endian) | F6 15 | 11110110 00010101 |
UTF32 (big Endian) | 00 00 15 F6 | 00000000 00000000 00010101 11110110 |
UTF32 (little Endian) | F6 15 00 00 | 11110110 00010101 00000000 00000000 |
Description
U+15F6 is a Canadian Syllabics Carrier character used in the digital representation of text. It serves as a neutral syllable carrier in the Cree and other Canadian aboriginal syllabic writing systems. This character allows for the encoding of various combinations of syllables, enabling the accurate representation of words in these languages. In a cultural context, it plays an essential role in preserving and promoting Indigenous languages, facilitating literacy and communication among Canadian First Nations communities. In linguistic terms, U+15F6 helps maintain the unique phonological structure of syllabic writing systems, which differ from alphabetic systems by representing syllables rather than individual letters or sounds. Its technical significance lies in its ability to support the encoding of these languages in digital environments, ensuring their preservation and accessibility in the modern world.
How to type the ᗶ symbol on Windows
Hold Alt and type 5622 on the numpad. Or use Character Map.