Step 1: Determine the UTF-8 encoding bit layout
The character æ has the Unicode code point U+00E6. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+00E6 to binary:
11100110
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11000011 10100110
LATIN SMALL LETTER AE·U+00E6
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | C3 A6 | 11000011 10100110 |
UTF16 (big Endian) | 00 E6 | 00000000 11100110 |
UTF16 (little Endian) | E6 00 | 11100110 00000000 |
UTF32 (big Endian) | 00 00 00 E6 | 00000000 00000000 00000000 11100110 |
UTF32 (little Endian) | E6 00 00 00 | 11100110 00000000 00000000 00000000 |
Description
The Unicode character U+00E6, with the codepoint 00E6 and the char value æ, is a significant element of digital text, particularly in languages such as German, Swedish, Danish, Norwegian, and Icelandic. Representing the letter "AE," this character holds cultural and linguistic importance, reflecting the evolution of alphabets over time. Originally, "æ" was represented by the ligature "ae" in Old English due to its unique pronunciation. Over time, regional variations led to changes in usage, with modern pronunciations varying between "ä" (German) and a separate vowel sound in Scandinavian languages. The character U+00E6 can be found within the Latin-1 Supplement Unicode block, ranging from codepoints 128 to 255. This block is an essential collection of characters used for various text formatting and typography purposes. Its characters contribute to the readability and visual appeal of written content across a wide range of applications.
How to type the æ symbol on Windows
Hold Alt and type 0230 on the numpad. Or use Character Map.