Step 1: Determine the UTF-8 encoding bit layout
The character ⼁ has the Unicode code point U+2F01. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+2F01 to binary:
00101111 00000001
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100010 10111100 10000001
KANGXI RADICAL LINE·U+2F01
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E2 BC 81 | 11100010 10111100 10000001 |
UTF16 (big Endian) | 2F 01 | 00101111 00000001 |
UTF16 (little Endian) | 01 2F | 00000001 00101111 |
UTF32 (big Endian) | 00 00 2F 01 | 00000000 00000000 00101111 00000001 |
UTF32 (little Endian) | 01 2F 00 00 | 00000001 00101111 00000000 00000000 |
Description
The Unicode character U+2F01 represents the "Kangxi Radical Line" in digital text. This symbol is used primarily within the context of Chinese characters, specifically for those that are classified under the Kangxi Dictionary. In traditional Chinese typography, each character is composed of a radical or basic element, which denotes its meaning and position, along with additional strokes that refine the meaning further. The Kangxi Radical Line symbolizes the presence of this base element within a character. This character-level classification system facilitates communication, learning, and organization in the Chinese language. The Kangxi Dictionary, compiled during the reign of Emperor Kangxi (1654-1722) of the Qing Dynasty, is one of the most comprehensive dictionaries ever created for a single language. It contains over 47,000 characters and their meanings, making it an essential reference work for understanding Chinese language and culture. The U+2F01 character plays a vital role in maintaining this linguistic and cultural heritage by identifying the Kangxi radical within digital texts. This ensures that electronic versions of the dictionary and other Chinese literature can be accurately searched and indexed, preserving their accessibility and relevance for future generations.
How to type the ⼁ symbol on Windows
Hold Alt and type 12033 on the numpad. Or use Character Map.