ISO 2047

ISO 2047 (Information processing – Graphical representations for the control characters of the 7-bit coded character set) ISO/IEC 646 describes a graphical representation of the control characters for debugging purposes, such as may be found in the character generator of a computer terminal; it also establishes a two-letter abbreviation of each control character. In addition, RFC 1345 "Character Mnemonics & Character Sets" is cited as the ISO 2047 two-letter abbreviation of the control character. ISO 2047, ECMA-17[1] in Europe, GB/T 3911-1983 in China, that corresponds to KS X 1010[2] in Korea (formerly KS C 5713) has been established as a standard. It was enacted "graphical representation of information exchange capabilities for character" JIS X 0209:1976 (former JIS C 6227) in Japan, and was abolished on January 20, 2010.

The three-letter abbreviation (such as "ESC"), or caret notation (such as "^[") are used much more often than this standard.

Character Table

Sign position Representation of the code table on Name Symbol representation [n 1] Symbol representation
0/0 NUL Null ⎕(U+2395) NU
0/1 TC₁ (SOH) Start of Heading ⌈(U+2308) SH
0/2 TC₂ (STX) Start of Text ⊥(U+22A5) SX
0/3 TC₃ (ETX) End of Text ⌋(U+230B) EX
0/4 TC₄ (EOT) End of Transmission ⌁(U+2301) ET
0/5 TC₅ (ENQ) Enquiry ⊠(U+22A0) EQ
0/6 TC₆ (ACK) Acknowledge ✓(U+2713) AK
0/7 BEL Bell ⍾(U+237E) BL
0/8 FE₀ (BS) Backspace ⌫(U+232B) BS
0/9 FE₁ (HT) Horizontal Tabulation ⪫(U+2AAB) HT
0/10 FE₂ (LF) Line Feed ≡(U+2261) LF
0/11 FE₃ (VT) Vertical Tabulation ⩛(U+2A5B) VT
0/12 FE₄ (FF) Form Feed ↡(U+21A1) FF
0/13 FE₅ (CR) Carriage Return ⪪(U+2AAA) CR
0/14 SO Shift Out ⊗(U+2297) SO
0/15 SI Shift In ⊙(U+2299) SI
1/0 TC₇ (DLE) Data Link Escape ⊟(U+229F) DL
1/1 DC₁ Device Control 1 ◷(U+25F7) D1
1/2 DC₂ Device Control 2 ◶(U+25F6) D2
1/3 DC₃ Device Control 3 ◵(U+25F5) D3
1/4 DC₄ Device Control 4 ◴(U+25F4) D4
1/5 TC₈ (NAK) Negative Acknowledge ⍻(U+237B) NK
1/6 TC₉ (SYN) Synchronization ⎍(U+238D) SY
1/7 TC₁₀ (ETB) End of Transmission Block ⊣(U+22A3) EB
1/8 CAN Cancel ⧖(U+29D6) CN
1/9 EM End of Medium ⍿(U+237F) EM
1/10 SUB Substitute Character ␦(U+2426) SB
1/11 ESC Escape ⊖(U+2296) EC
1/12 IS₄ (FS) File Separator ◰(U+25F0) FS
1/13 IS₃ (GS) Group Separator ◱(U+25F1) GS
1/14 IS₂ (RS) Record Separator ◲(U+25F2) RS
1/15 IS₁ (US) Unit Separator ◳(U+25F3) US
2/0 SP Space △(U+25B3) SP
7/15 DEL Delete ␥(U+2425) DT

Notes

  1. ^ The Unicode code charts specifically cite ISO 2047 for characters U+2301, U+237B, U+237E, U+237F, and U+2426.[3][4]

References

  1. ^ ECMA-17, Graphic Representation of the Control Characters of the ECMA 7-Bit Coded Character Set for Information Interchange (withdrawn)
  2. ^ KS X 1010-2007 Graphical representations control characters for Information interchange
  3. ^ The Unicode Standard, Version 6.2. Miscellaneous Technical. Range: 2300–23FF.
  4. ^ The Unicode Standard, Version 6.2. Control Pictures. Range: 2400–243F.
ASCII

ASCII ( (listen) ASS-kee), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Most modern character-encoding schemes are based on ASCII, although they support many additional characters.

ASCII is the traditional name for the encoding system; the Internet Assigned Numbers Authority (IANA) prefers the updated name US-ASCII, which clarifies that this system was developed in the US and based on the typographical symbols predominantly in use there.ASCII is one of the IEEE milestones.

Control Pictures

Control Pictures is a Unicode block containing characters for graphically representing the C0 control codes, and other control characters.

Control character

In computing and telecommunication, a control character or non-printing character is a code point (a number) in a character set, that does not represent a written symbol. They are used as in-band signaling to cause effects other than the addition of a symbol to the text. All other characters are mainly printing, printable, or graphic characters, except perhaps for the "space" character (see ASCII printable characters).

All entries in the ASCII table below code 32 (technically the C0 control code set) are of this kind, including CR and LF used to separate lines of text. The code 127 (DEL) is also a control character. Extended ASCII sets defined by ISO 8859 added the codes 128 through 159 as control characters, this was primarily done so that if the high bit was stripped it would not change a printing character to a C0 control code, but there have been some assignments here, in particular NEL. This second set is called the C1 set.

These 65 control codes were carried over to Unicode. Unicode added more characters that could be considered controls, but it makes a distinction between these "Formatting characters" (such as the Zero-width non-joiner), and the 65 Control characters.

The Extended Binary Coded Decimal Interchange Code (EBCDIC) character set contains 65 control codes, including all of the ASCII control codes as well as additional codes which are mostly used to control IBM peripherals.

List of International Organization for Standardization standards, 1-4999

This is a list of published International Organization for Standardization (ISO) standards and other deliverables. For a complete and up-to-date list of all the ISO standards, see the ISO catalogue.The standards are protected by copyright and most of them must be purchased. However, about 300 of the standards produced by ISO and IEC's Joint Technical Committee 1 (JTC1) have been made freely and publicly available.

Unicode control characters

Many Unicode control characters are used to control the interpretation or display of text, but these characters themselves have no visual or spatial representation. For example, the null character (U+0000 control characters) is used in C-programming application environments to indicate the end of a string of characters. In this way, these programs only require a single starting memory address for a string (as opposed to a starting address and a length), since the string ends once the program reads the null character.

ISO standards by standard number
1–9999
10000–19999
20000+

This page is based on a Wikipedia article written by authors (here).
Text is available under the CC BY-SA 3.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.