Step 1: Determine the UTF-8 encoding bit layout
The character ㊃ has the Unicode code point U+3283. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+3283 to binary:
00110010 10000011
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100011 10001010 10000011
CIRCLED IDEOGRAPH FOUR·U+3283
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E3 8A 83 | 11100011 10001010 10000011 |
UTF16 (big Endian) | 32 83 | 00110010 10000011 |
UTF16 (little Endian) | 83 32 | 10000011 00110010 |
UTF32 (big Endian) | 00 00 32 83 | 00000000 00000000 00110010 10000011 |
UTF32 (little Endian) | 83 32 00 00 | 10000011 00110010 00000000 00000000 |
Description
U+3283, also known as the Circled Ideograph Four, is a unique character in Unicode that serves a specific role in digital text. It belongs to the Miscellaneous Symbols and Pictographs block, which includes characters with various uses and meanings across different cultures and languages. The Circled Ideograph Four is often used as a symbol for representing the number four within a circle, adding an element of visual interest or emphasis in text. This character is particularly useful when referring to the concept of four in a context where it needs to be distinguished from other numerals or symbols. In some instances, it may also represent a specific ideogram or glyph related to the number four in certain cultural or linguistic contexts. Its usage is primarily technical and aesthetic, rather than linguistic or cultural. Overall, U+3283 plays a niche but essential role in digital typography, providing an easily recognizable and unique representation of the number four within a circular form.
How to type the ㊃ symbol on Windows
Hold Alt and type 12931 on the numpad. Or use Character Map.