Step 1: Determine the UTF-8 encoding bit layout
The character Ə has the Unicode code point U+018F. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of
0x0080
to0x07ff
.
Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format:110xxxxx 10xxxxxx
Where thex
are the payload bits.UTF-8 Encoding bit layout by codepoint range Codepoint Range Bytes Bit pattern Payload length U+0000 - U+007F 1 0xxxxxxx 7 bits U+0080 - U+07FF 2 110xxxxx 10xxxxxx 11 bits U+0800 - U+FFFF 3 1110xxxx 10xxxxxx 10xxxxxx 16 bits U+10000 - U+10FFFF 4 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 21 bits Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+018F to binary:
00000001 10001111
. Those are the payload bits.Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
11000110 10001111
LATIN CAPITAL LETTER SCHWA·U+018F
Character Information
Character Representations
Click elements to copyEncoding | Hex | Binary |
---|---|---|
UTF8 | C6 8F | 11000110 10001111 |
UTF16 (big Endian) | 01 8F | 00000001 10001111 |
UTF16 (little Endian) | 8F 01 | 10001111 00000001 |
UTF32 (big Endian) | 00 00 01 8F | 00000000 00000000 00000001 10001111 |
UTF32 (little Endian) | 8F 01 00 00 | 10001111 00000001 00000000 00000000 |
Description
The Unicode character U+018F is known as the "LATIN CAPITAL LETTER SCHWA". This typographical symbol is a capital letter in the Latin script, primarily used in digital text for representing a specific sound or phoneme that is not typically represented by other alphabetic characters. In linguistic terms, the schwa (ə) represents a central vowel sound that can be found in various languages. It often serves as an allophone, or a variant of a phoneme, in many languages, including English and several others. Although not commonly used in everyday written text, it has seen increased usage in digital contexts, such as in linguistic studies, transcriptions, and discussions related to phonetics and phonology. Its inclusion in the Unicode character set allows for greater accuracy and precision when transcribing and studying various languages' phonetic properties.
How to type the Ə symbol on Windows
Hold Alt and type 0399 on the numpad. Or use Character Map.