Step 1: Determine the UTF-8 encoding bit layout
The character ᒊ has the Unicode code point U+148A. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+148A to binary:
00010100 10001010
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10010010 10001010
CANADIAN SYLLABICS CAAI·U+148A
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 92 8A | 11100001 10010010 10001010 |
UTF16 (big Endian) | 14 8A | 00010100 10001010 |
UTF16 (little Endian) | 8A 14 | 10001010 00010100 |
UTF32 (big Endian) | 00 00 14 8A | 00000000 00000000 00010100 10001010 |
UTF32 (little Endian) | 8A 14 00 00 | 10001010 00010100 00000000 00000000 |
Description
U+148A is a Canadian Syllabics character representing the CAAI glyph. In digital text, it serves as a phonetic symbol for the syllable "Caai" in the Cree language, which is widely spoken by Indigenous peoples across Canada. The Canadian Syllabics characters play a crucial role in preserving and promoting indigenous languages by enabling accurate representation of their unique phonology. These glyphs are integral to the development and maintenance of digital resources for educational, linguistic, and cultural purposes. While not widely used outside of Canadian Indigenous communities, U+148A contributes significantly to the rich tapestry of global typography by representing a distinct and valuable linguistic tradition.
How to type the ᒊ symbol on Windows
Hold Alt and type 5258 on the numpad. Or use Character Map.