Step 1: Determine the UTF-8 encoding bit layout
The character 〸 has the Unicode code point U+3038. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+3038 to binary:
00110000 00111000
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100011 10000000 10111000
HANGZHOU NUMERAL TEN·U+3038
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E3 80 B8 | 11100011 10000000 10111000 |
UTF16 (big Endian) | 30 38 | 00110000 00111000 |
UTF16 (little Endian) | 38 30 | 00111000 00110000 |
UTF32 (big Endian) | 00 00 30 38 | 00000000 00000000 00110000 00111000 |
UTF32 (little Endian) | 38 30 00 00 | 00111000 00110000 00000000 00000000 |
Description
The Unicode character U+3038 is known as the "Hangzhou Numeral Ten" and plays a significant role in typography and digital text representation. It is part of the Hangzhou numerals system, which was used by merchants in Hangzhou, China during the Song Dynasty to record business transactions. These numerals are visually distinctive and unique, with each character representing a specific value or digit. U+3038, specifically, is used to represent the number ten in this system. In digital text, it allows for the accurate representation of historical texts and documents that use Hangzhou numerals, ensuring the preservation of cultural and historical context. The character's unique design and historical significance make it an important inclusion in modern typography and Unicode standards, providing a valuable resource for scholars, historians, and linguists studying the history and development of number systems.
How to type the 〸 symbol on Windows
Hold Alt and type 12344 on the numpad. Or use Character Map.