Step 1: Determine the UTF-8 encoding bit layout
The character ㇁ has the Unicode code point U+31C1. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+31C1 to binary:
00110001 11000001
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100011 10000111 10000001
CJK STROKE WG·U+31C1
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E3 87 81 | 11100011 10000111 10000001 |
UTF16 (big Endian) | 31 C1 | 00110001 11000001 |
UTF16 (little Endian) | C1 31 | 11000001 00110001 |
UTF32 (big Endian) | 00 00 31 C1 | 00000000 00000000 00110001 11000001 |
UTF32 (little Endian) | C1 31 00 00 | 11000001 00110001 00000000 00000000 |
Description
The Unicode character U+31C1 is known as CJK STROKE WG (CJK Stroke Wavy Line), which is primarily utilized in digital text for its typographic role within the CJK (Chinese, Japanese, and Korean) scripts. This specific symbol represents a wavy underline or decorative line that can be applied to other characters to indicate emphasis, style, or separator in certain contexts. Although it doesn't have any inherent meaning itself, its usage is heavily dependent on the cultural, linguistic, and technical context where it appears. For example, in Japanese typography, the CJK STROKE WG can be used to enhance the visual appearance of the text, while in Chinese, it might serve as a separator between different sections or paragraphs. Its primary function is aesthetic, but its use can also impact how readers perceive the digital text. Therefore, the Unicode character U+31C1 CJK STROKE WG plays a significant role in digital text presentation, particularly within the realms of CJK scripts.
How to type the ㇁ symbol on Windows
Hold Alt and type 12737 on the numpad. Or use Character Map.