Step 1: Determine the UTF-8 encoding bit layout
The character ⾤ has the Unicode code point U+2FA4. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+2FA4 to binary:
00101111 10100100
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10111110 10100100
KANGXI RADICAL DISTINGUISH·U+2FA4
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 BE A4 | 11100010 10111110 10100100 |
UTF16 (big Endian) | 2F A4 | 00101111 10100100 |
UTF16 (little Endian) | A4 2F | 10100100 00101111 |
UTF32 (big Endian) | 00 00 2F A4 | 00000000 00000000 00101111 10100100 |
UTF32 (little Endian) | A4 2F 00 00 | 10100100 00101111 00000000 00000000 |
Description
U+2FA4, also known as the "Kangxi Radical Distinguish," is a crucial character in digital typography and Unicode representation. It serves as an important building block for the composition of Chinese characters, particularly within the context of the Kangxi Dictionary, a comprehensive encyclopedic reference work compiled by the Qing Dynasty scholar Lei Din-Tao. In this capacity, the Kangxi Radical Distinguish character helps to standardize and streamline the process of character identification and classification, thereby promoting consistency and clarity in written communication. The character's role is particularly vital when it comes to the decomposition of complex Chinese characters into their constituent radicals, which are simpler elements that combine to form a complete character. By employing U+2FA4 as a marker for distinguishing different radicals, digital text can more accurately and efficiently represent the intricate relationships between these various components, thus facilitating a smoother reading experience for users familiar with Chinese orthography. In addition to its technical significance, the Kangxi Radical Distinguish character reflects the rich cultural heritage of the Chinese language and its unique system of character classification. The Kangxi Dictionary itself is considered a monumental achievement in Chinese lexicography, having played an instrumental role in shaping modern Chinese orthography and serving as a foundation for subsequent dictionaries and reference works. To summarize, U+2FA4, or the Kangxi Radical Distinguish, is an indispensable component of digital typography and Unicode representation that facilitates accurate character identification, classification, and decomposition in Chinese text. Its significance lies not only in its technical role but also in its connection to the broader cultural and linguistic context of the Chinese language and the Kangxi Dictionary.
How to type the ⾤ symbol on Windows
Hold Alt and type 12196 on the numpad. Or use Character Map.