Step 1: Determine the UTF-8 encoding bit layout
The character ᛙ has the Unicode code point U+16D9. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+16D9 to binary:
00010110 11011001
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10011011 10011001
RUNIC LETTER SHORT-TWIG-MADR M·U+16D9
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 9B 99 | 11100001 10011011 10011001 |
UTF16 (big Endian) | 16 D9 | 00010110 11011001 |
UTF16 (little Endian) | D9 16 | 11011001 00010110 |
UTF32 (big Endian) | 00 00 16 D9 | 00000000 00000000 00010110 11011001 |
UTF32 (little Endian) | D9 16 00 00 | 11011001 00010110 00000000 00000000 |
Description
U+16D9 (RUNIC LETTER SHORT-TWIG-MADR M) is a character from the Unicode standard, which plays a significant role in digital text representation. This particular character belongs to the Runic alphabet, an ancient script system that has been widely used across various European cultures. The Runic alphabet comprises a set of characters used for writing in Runic languages, such as Old Norse and Old English. In its typical usage, RUNIC LETTER SHORT-TWIG-MADR M represents the sound "m" or a variation thereof. It is employed to convey specific phonetic distinctions within texts that utilize Runic languages. The character's unique form is derived from its runic origins, which feature angular and stylized shapes to facilitate carving on various surfaces like wood, stone, or metal. This character holds cultural significance in the study of historical linguistics and archaeology, as it offers insight into the linguistic development and social practices of pre-modern European societies. The Runic alphabet's use across different languages and regions demonstrates its adaptability and the richness of its heritage. In a technical context, U+16D9 is essential for accurate digital text representation, particularly when working with historical documents or literature that employs Runic scripts.
How to type the ᛙ symbol on Windows
Hold Alt and type 5849 on the numpad. Or use Character Map.