Step 1: Determine the UTF-8 encoding bit layout
The character Ī has the Unicode code point U+012A. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+012A to binary:
00000001 00101010
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11000100 10101010
LATIN CAPITAL LETTER I WITH MACRON·U+012A
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | C4 AA | 11000100 10101010 |
UTF16 (big Endian) | 01 2A | 00000001 00101010 |
UTF16 (little Endian) | 2A 01 | 00101010 00000001 |
UTF32 (big Endian) | 00 00 01 2A | 00000000 00000000 00000001 00101010 |
UTF32 (little Endian) | 2A 01 00 00 | 00101010 00000001 00000000 00000000 |
Description
The Unicode character U+012A is the "LATIN CAPITAL LETTER I WITH MACRON." This typographical symbol is primarily used in digital text for various purposes, including transcribing languages that utilize diacritical marks or creating specific phonetic representations. The macron over the capital letter 'I' serves to lengthen the vowel sound, aiding in pronunciation and clarification of speech. While it is most commonly seen in languages like Irish, Cornish, and some dialects of Scottish Gaelic, it can also be used in linguistic research or in creating custom alphabets for specific communication needs. The character's role in digital text highlights the importance of Unicode in facilitating a wide range of language expressions and maintaining accuracy in written communication across different languages and scripts.
How to type the Ī symbol on Windows
Hold Alt and type 0298 on the numpad. Or use Character Map.