Step 1: Determine the UTF-8 encoding bit layout
The character ᨢ has the Unicode code point U+1A22. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+1A22 to binary:
00011010 00100010
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10101000 10100010
TAI THAM LETTER HIGH KXA·U+1A22
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 A8 A2 | 11100001 10101000 10100010 |
UTF16 (big Endian) | 1A 22 | 00011010 00100010 |
UTF16 (little Endian) | 22 1A | 00100010 00011010 |
UTF32 (big Endian) | 00 00 1A 22 | 00000000 00000000 00011010 00100010 |
UTF32 (little Endian) | 22 1A 00 00 | 00100010 00011010 00000000 00000000 |
Description
U+1A22, also known as TAI THAM LETTER HIGH KXA, is a unique Unicode character that holds significant importance in digital text. It belongs to the Thai script, which is used predominantly for written communication in Thailand and among Thai-speaking communities worldwide. The Thai script is an abugida system, wherein consonants have inherent vowels, with an additional vowel signified by a diacritical mark placed above or below the base character. In the context of TAI THAM LETTER HIGH KXA, it serves as a consonant-vowel combination in the Thai language. The letter "k" represents a voiceless velar plosive sound, while the vowel "a" represents an open front unrounded vowel sound. Together, they form the syllable "ka." This character is commonly used for transcribing and typing Thai words and phrases in digital platforms, facilitating communication and preserving linguistic diversity in the online space. It is important to note that the TAI THAM LETTER HIGH KXA character is part of the extended Thai script, which comprises over 40 characters not found in the original Thai script used on traditional typewriters. The inclusion of this character in digital text has broadened the expressive potential of the Thai language and allowed for greater accuracy in transcription. In conclusion, U+1A22 (TAI THAM LETTER HIGH KXA) is an essential character in the Thai script, contributing to the accurate representation of the Thai language in digital text. Its significance lies in its role as a consonant-vowel combination and its place within the extended Thai script, which serves to enrich and expand the linguistic capabilities of the Thai language in the digital realm.
How to type the ᨢ symbol on Windows
Hold Alt and type 6690 on the numpad. Or use Character Map.