LATIN SMALL LETTER SHARP S·U+00DF

ß

Character Information

Code Point
U+00DF
HEX
00DF
Unicode Plane
Basic Multilingual Plane
Category
Lowercase Letter

Character Representations

Click elements to copy
EncodingHexBinary
UTF8
C3 9F
11000011 10011111
UTF16 (big Endian)
00 DF
00000000 11011111
UTF16 (little Endian)
DF 00
11011111 00000000
UTF32 (big Endian)
00 00 00 DF
00000000 00000000 00000000 11011111
UTF32 (little Endian)
DF 00 00 00
11011111 00000000 00000000 00000000
HTML Entity
ß
URI Encoded
%C3%9F

Description

The Unicode character U+00DF, represented by the codepoint 00DF and char ß, is a significant component in digital text, primarily used within German language writing systems. Known as the Latin small letter sharp S (LATIN SMALL LETTER SHARP S), it serves as an equivalent to the combination "ss" in other languages, offering an essential typographical distinction. The character's cultural significance stems from its introduction during 19th-century German orthographic reforms aimed at simplifying spelling rules. Although its use may be considered archaic or regional in modern German typography, the sharp s remains crucial for maintaining historical accuracy and linguistic integrity within digital texts, particularly in academic, literary, and international communication contexts. The Latin small letter sharp S resides in the Latin-1 Supplement Unicode block (678), a versatile collection of 256 characters that cater to various text formatting and typography needs. This character set, encompassing characters from 128 to 255, extends the basic Latin character set to accommodate additional symbols like pilcrows (◊) and en dashes (–), enhancing the readability and overall appearance of digital texts across various applications.

How to type the ß symbol on Windows

Hold Alt and type 0223 on the numpad. Or use Character Map.

  1. Step 1: Determine the UTF-8 encoding bit layout

    The character ß has the Unicode code point U+00DF. In UTF-8, it is encoded using 2 bytes because its codepoint is in the range of 0x0080 to 0x07ff.

    Therefore we know that the UTF-8 encoding will be done over 11 bits within the final 16 bits and that it will have the format: 110xxxxx 10xxxxxx
    Where the x are the payload bits.

    UTF-8 Encoding bit layout by codepoint range
    Codepoint RangeBytesBit patternPayload length
    U+0000 - U+007F10xxxxxxx7 bits
    U+0080 - U+07FF2110xxxxx 10xxxxxx11 bits
    U+0800 - U+FFFF31110xxxx 10xxxxxx 10xxxxxx16 bits
    U+10000 - U+10FFFF411110xxx 10xxxxxx 10xxxxxx 10xxxxxx21 bits
  2. Step 2: Obtain the payload bits:

    Convert the hexadecimal code point U+00DF to binary: 11011111. Those are the payload bits.

  3. Step 3: Fill in the bits to match the bit pattern:

    Obtain the final bytes by arranging the paylod bits to match the bit layout:
    11000011 10011111