Step 1: Determine the UTF-8 encoding bit layout
The character ᯇ has the Unicode code point U+1BC7. In UTF-8, it is encoded using 3 bytes because its codepoint is in the range of
0x0800
to0xffff
.
Therefore we know that the UTF-8 encoding will be done over 16 bits within the final 24 bits and that it will have the format:1110xxxx 10xxxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+1BC7 to binary:
00011011 11000111
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11100001 10101111 10000111
BATAK LETTER PA·U+1BC7
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | E1 AF 87 | 11100001 10101111 10000111 |
UTF16 (big Endian) | 1B C7 | 00011011 11000111 |
UTF16 (little Endian) | C7 1B | 11000111 00011011 |
UTF32 (big Endian) | 00 00 1B C7 | 00000000 00000000 00011011 11000111 |
UTF32 (little Endian) | C7 1B 00 00 | 11000111 00011011 00000000 00000000 |
Description
U+1BC7 is a typographical character, known as "BATAK LETTER PA". It has an essential role in digital text, particularly for those who utilize the Batak language. The Batak script, from which this letter originates, is a family of scripts used to write various Batak languages spoken across Indonesia and Malaysia. U+1BC7, or "BATAK LETTER PA", is part of the Unicode Standard which aims to provide a unique number for every character, symbol, or glyph. The Unicode Standard has been adopted by a vast majority of software developers as it allows for accurate representation and seamless exchange of text between different platforms and programming languages, thus ensuring consistency in textual data. By enabling proper encoding and decoding of characters like "BATAK LETTER PA", the Unicode Standard ensures that digital texts written in the Batak script are correctly displayed and interpreted across various applications and devices, which in turn promotes cultural preservation and linguistic diversity in the digital realm.
How to type the ᯇ symbol on Windows
Hold Alt and type 7111 on the numpad. Or use Character Map.