Step 1: Determine the UTF-8 encoding bit layout
The character ᪫ has the Unicode code point U+1AAB. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+1AAB to binary:
00011010 10101011
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10101010 10101011
TAI THAM SIGN SATKAANKUU·U+1AAB
᪫
Character Information
Code Point
U+1AAB
HEX
1AAB
Unicode Plane
Basic Multilingual Plane
Category
Other Punctuation
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 AA AB | 11100001 10101010 10101011 |
UTF16 (big Endian) | 1A AB | 00011010 10101011 |
UTF16 (little Endian) | AB 1A | 10101011 00011010 |
UTF32 (big Endian) | 00 00 1A AB | 00000000 00000000 00011010 10101011 |
UTF32 (little Endian) | AB 1A 00 00 | 10101011 00011010 00000000 00000000 |
HTML Entity
᪫
URI Encoded
%E1%AA%AB
Description
The Unicode character U+1AAB, known as the "TAI THAM SIGN SATKAANKUU," is a specialized typographical symbol used primarily in digital text communication. Its typical usage is within the Thai language, where it serves to mark certain linguistic features or structures in written texts. The TAI THAM SIGN SATKAANKUU holds cultural significance, as it is a part of the traditional Thai writing system and contributes to the richness and depth of the language. Although its usage might be less common in global communication due to its specialized function, it remains an important character in digital text for those who require or prefer using the Thai script.
How to type the ᪫ symbol on Windows
Hold Alt and type 6827 on the numpad. Or use Character Map.