Step 1: Determine the UTF-8 encoding bit layout
The character ᑤ has the Unicode code point U+1464. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+1464 to binary:
00010100 01100100
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10010001 10100100
CANADIAN SYLLABICS WEST-CREE TWAA·U+1464
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 91 A4 | 11100001 10010001 10100100 |
UTF16 (big Endian) | 14 64 | 00010100 01100100 |
UTF16 (little Endian) | 64 14 | 01100100 00010100 |
UTF32 (big Endian) | 00 00 14 64 | 00000000 00000000 00010100 01100100 |
UTF32 (little Endian) | 64 14 00 00 | 01100100 00010100 00000000 00000000 |
Description
U+1464 is the Unicode character code for "CANADIAN SYLLABICS WEST-CREE TWAA" (ᐧ). This character plays a significant role in digital text, specifically within the context of Canadian Aboriginal Syllabics. These symbols are used in written communications across Indigenous communities in Canada, particularly among the Cree and other First Nations peoples. The use of these characters helps preserve linguistic heritage, while also facilitating communication in areas where traditional oral languages are still spoken but not standardized or widely taught. In a technical context, the Canadian Aboriginal Syllabics were designed to be written with a monospaced font on a manual typewriter, which is a unique attribute compared to other writing systems. U+1464, as part of the West-Cree set of characters, contributes to the rich tapestry of Indigenous languages and culture in Canada, while also supporting digital literacy and communications for these communities.
How to type the ᑤ symbol on Windows
Hold Alt and type 5220 on the numpad. Or use Character Map.