Step 1: Determine the UTF-8 encoding bit layout
The character ะ has the Unicode code point U+0E30. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+0E30 to binary:
00001110 00110000
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100000 10111000 10110000
THAI CHARACTER SARA A·U+0E30
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E0 B8 B0 | 11100000 10111000 10110000 |
UTF16 (big Endian) | 0E 30 | 00001110 00110000 |
UTF16 (little Endian) | 30 0E | 00110000 00001110 |
UTF32 (big Endian) | 00 00 0E 30 | 00000000 00000000 00001110 00110000 |
UTF32 (little Endian) | 30 0E 00 00 | 00110000 00001110 00000000 00000000 |
Description
The Unicode character U+0E30 represents the "THAI CHARACTER SARA A" (แอ). This character plays a crucial role in digital text, as it is one of the 44 Thai consonants in the Thai script. In the Thai language, U+0E30 contributes to forming words by combining with vowel marks and other consonants. The Thai alphabet, which uses the Latin script as a foundation, has evolved over centuries to include unique features such as initials, medials, finals, tone marks, and special signs. U+0E30, along with other Thai characters, enables accurate transcription and communication of the Thai language in digital text, including websites, documents, and software applications. In summary, U+0E30 is an essential character in the Thai script, contributing to its rich linguistic heritage and facilitating effective communication within the Thai-speaking community.
How to type the ะ symbol on Windows
Hold Alt and type 3632 on the numpad. Or use Character Map.