U+000A <control> - octets.codes

Character Information

Code Point

U+000A

HEX

000A

Unicode Plane

Basic Multilingual Plane

Category

Control

Character Representations

Click elements to copy

Encoding	Hex	Binary
UTF8	0A	00001010
UTF16 (big Endian)	00 0A	00000000 00001010
UTF16 (little Endian)	0A 00	00001010 00000000
UTF32 (big Endian)	00 00 00 0A	00000000 00000000 00000000 00001010
UTF32 (little Endian)	0A 00 00 00	00001010 00000000 00000000 00000000

Encoding

Hex

Binary

UTF8

00001010

UTF16 (big Endian)

00 0A

00000000 00001010

UTF16 (little Endian)

0A 00

00001010 00000000

UTF32 (big Endian)

00 00 00 0A

00000000 00000000 00000000 00001010

UTF32 (little Endian)

0A 00 00 00

00001010 00000000 00000000 00000000

HTML Entity

URI Encoded

%0A

Description

The Unicode character U+000A, known as LINE SEPARATOR (LS), plays a vital role in the organization of digital text. It is primarily used to separate lines in plaintext data, such as in text files or email messages, where line breaks are significant. This includes various applications like PostScript and other page description languages. However, it should not be confused with the CARRIAGE RETURN (U+000D), which returns the cursor to the beginning of the current line on monospaced devices. The LS character is a part of the Unicode standard, which covers an extensive range of characters, symbols, and emojis used in modern text communication across various languages and platforms. This specific character belongs to the Basic Latin Unicode block (U+0000 to U+007F), a foundational component of the Unicode system that includes essential characters for controlling codes and special symbols used in programming languages, text documents, and multiple other applications. In terms of cultural, linguistic, or technical context, the LS character is crucial for ensuring proper formatting in digital texts, as it signifies where line breaks should occur. Its role becomes particularly significant when dealing with multilingual texts, as different languages may have specific requirements regarding line breaks and text alignment. The use of U+000A ensures consistency and readability across various platforms and devices. In summary, the LS character is a vital component in digital text organization, specifically used to separate lines in plaintext data. Its usage is essential for proper formatting in multilingual texts, ensuring consistency and readability across different platforms and devices. The character belongs to the Basic Latin Unicode block (U+0000 to U+007F), a fundamental part of the Unicode system that includes essential characters for controlling codes and special symbols used in various applications.

How to type the symbol on Windows

Hold Alt and type 0010 on the numpad. Or use Character Map.

Step 1: Determine the UTF-8 encoding bit layout

The character has the Unicode code point U+000A. In UTF-8, it is encoded using 1 byte because its codepoint is in the range of 0x0000 to 0x007f.

Therefore we know that the UTF-8 encoding will be done over 7 bits within the final 8 bits and that it will have the format: 0xxxxxxx
Where the x are the payload bits.

UTF-8 Encoding bit layout by codepoint range

Codepoint Range	Bytes	Bit pattern	Payload length
U+0000 - U+007F	1	0xxxxxxx	7 bits
U+0080 - U+07FF	2	110xxxxx 10xxxxxx	11 bits
U+0800 - U+FFFF	3	1110xxxx 10xxxxxx 10xxxxxx	16 bits
U+10000 - U+10FFFF	4	11110xxx 10xxxxxx 10xxxxxx 10xxxxxx	21 bits

Step 2: Obtain the payload bits:
Convert the hexadecimal code point U+000A to binary: 00001010. Those are the payload bits.
Step 3: Fill in the bits to match the bit pattern:
Obtain the final bytes by arranging the paylod bits to match the bit layout:
00001010

<control>·U+000A

Character Information

Character Representations

Description

How to type the symbol on Windows

UTF-8: How to encode (U+000A)

Step 1: Determine the UTF-8 encoding bit layout

Step 2: Obtain the payload bits:

Step 3: Fill in the bits to match the bit pattern:

UTF-16: How to encode (U+000A)