WIPOSTAD (v3.0)

Related Links

Search WIPOSTAD

Shortcuts

Actions

Language: English | Español | Français

Standard ST.31

Version 1.0

RECOMMENDED STANDARD CODED CHARACTER SETS FOR THE EXCHANGE OF MACHINE‑READABLE RECORDS OF PATENT DOCUMENTS
Standard adopted during the 1980’s

TABLE OF CONTENTS


Introduction

1.

This recommended Standard provides character sets for the coding of patent documents for the purpose of exchange on machine‑readable records, e.g., on magnetic tape. The character sets include those characters, numerals and special symbols likely to be encountered during the processing of patent documents.

2.

For the purposes of this recommended Standard the expression “patent document” comprises patents for invention, plant patents, inventors’ certificates, design patents, utility certificates, utility models, documents of addition thereto and published applications therefor.

3.

In compiling this recommended Standard, the following ISO standards were consulted:

  • ISO 6937:  This standard refers to:
    • A basic character set—International reference Version (ISO 646)
    • Tables of accented characters
    • Code extension technique (ISO 2022)
  • ISO 5428:  Greek alphabet
  • ISO 6862:  Mathematical symbols.

Code pages

4.

The character sets are coded by means of a number of Code Pages. Each Code Page has a theoretical capacity of (16x16) 256 positions. In practice the first 64 positions of each Code Page are reserved for control codes, leaving a maximum of 192 positions usable for characters.

5.

The recommended Standard at present comprises the following Code Pages:

  • Code Page 0:  standard Roman alphabet, numerals, punctuation, accented characters
  • Code Page 1:  Greek alphabet and some mathematical symbols
  • Code Page 2:  further mathematical symbols
  • Code Page 3:  Cyrillic character set
  • Code Page 8:  further special print characters
  • Code Page 9:  KANJI character set, where each character is represented by a two‑byte code.

The above Code Pages are fully defined in the Appendices to this Standard.

6.

The character sets will be extended when new or previously unnecessary character sets or symbols become known. Moreover, further Code Pages will be elaborated to cover characters used in alphabets other than those defined in Code Pages 0, 1, 2, 3, 8 and 9.

7.

In the tables accompanying each of the Code Pages, each of the codes provided for are defined by a print of the character concerned together with a description of the character. The code itself is quoted using two characters, the first character being the column of the code table with the second character being the row of the code table.

Code page switching

8.

The mechanism for switching between Code Pages is an escape sequence in the data stream which indicates both that a switch must be performed and gives the identity of the Code Page (nn) required.

The selected Code Page remains active until a subsequent escape sequence is encountered indicating another Code Page.

However, switching to Code Page 9 would signal the start of two‑byte representation. To escape from Code Page 9, the appropriate (two‑byte) escape sequence must be used as follows:

Extended character set

CODE PAGE: 0

CODE CHAR DESCRIPTION CODE CHAR DESCRIPTION
00 1A
01 1B
02 1C
03 1D
04 1E
05 1F
06 20
07 21
08 22
09 23
0A 24
0B 25 LF Line Feed control code
0C NP New Page control code 26
0D CR Carriage Return control code 27 ESC Escape control code
0E 28
0F 29
10 2A
11 2B
12 2C
13 2D
14 2E
15 NL New Line control code 2F
16 30
17 31
18 32
19 33
CODE CHAR DESCRIPTION
00
01
02
03
04
05
06
07
08
09
0A
0B
0C NP New Page control code
0D CR Carriage Return control code
0E
0F
10
11
12
13
14
15 NL New Line control code
16
17
18
19
1A
1B
1C
1D
1E
1F
20
21
22
23
24
25 LF Line Feed control code
26
27 ESC Escape control code
28
29
2A
2B
2C
2D
2E
2F
30
31
32
33

CODE PAGE: 1

CODE CHAR DESCRIPTION CODE CHAR DESCRIPTION
00 1A
01 1B
02 1C
03 1D
04 1E
05 1F
06 20
07 21
08 22
09 23
0A 24
0B 25 LF Line Feed control code
0C NP New Page control code 26
0D CR Carriage Return control code 27 ESC Escape control code
0E 28
0F 29
10 2A
11 2B
12 2C
13 2D
14 2E
15 NL New Line control code 2F
16 30
17 31
18 32
19 33
CODE CHAR DESCRIPTION
00
01
02
03
04
05
06
07
08
09
0A
0B
0C NP New Page control code
0D CR Carriage Return control code
0E
0F
10
11
12
13
14
15 NL New Line control code
16
17
18
19
1A
1B
1C
1D
1E
1F
20
21
22
23
24
25 LF Line Feed control code
26
27 ESC Escape control code
28
29
2A
2B
2C
2D
2E
2F
30
31
32
33

CODE PAGE: 2

CODE CHAR DESCRIPTION CODE CHAR DESCRIPTION
00 1A
01 1B
02 1C
03 1D
04 1E
05 1F
06 20
07 21
08 22
09 23
0A 24
0B 25 LF Line Feed control code
0C NP New Page control code 26
0D CR Carriage Return control code 27 ESC Escape control code
0E 28
0F 29
10 2A
11 2B
12 2C
13 2D
14 2E
15 NL New Line control code 2F
16 30
17 31
18 32
19 33
CODE CHAR DESCRIPTION
00
01
02
03
04
05
06
07
08
09
0A
0B
0C NP New Page control code
0D CR Carriage Return control code
0E
0F
10
11
12
13
14
15 NL New Line control code
16
17
18
19
1A
1B
1C
1D
1E
1F
20
21
22
23
24
25 LF Line Feed control code
26
27 ESC Escape control code
28
29
2A
2B
2C
2D
2E
2F
30
31
32
33

CODE PAGE: 3

(*) Characters of the Cyrillic alphabet which coincide in their representation (but not in the meaning) with the corresponding characters of the Latin alphabet.

CODE CHAR DESCRIPTION CODE CHAR DESCRIPTION
00 1A
01 1B
02 1C
03 1D
04 1E
05 1F
06 20
07 21
08 22
09 23
0A 24
0B 25 LF Line Feed control code
0C NP New Page control code 26
0D CR Carriage Return control code 27 ESC Escape control code
0E 28
0F 29
10 2A
11 2B
12 2C
13 2D
14 2E
15 NL New Line control code 2F
16 30
17 31
18 32
19 33
CODE CHAR DESCRIPTION
00
01
02
03
04
05
06
07
08
09
0A
0B
0C NP New Page control code
0D CR Carriage Return control code
0E
0F
10
11
12
13
14
15 NL New Line control code
16
17
18
19
1A
1B
1C
1D
1E
1F
20
21
22
23
24
25 LF Line Feed control code
26
27 ESC Escape control code
28
29
2A
2B
2C
2D
2E
2F
30
31
32
33

CODE PAGE: 8

CODE CHAR DESCRIPTION CODE CHAR DESCRIPTION
00 1A
01 1B
02 1C
03 1D
04 1E
05 1F
06 20
07 21
08 22
09 23
0A 24
0B 25 LF Line Feed control code
0C NP New Page control code 26
0D CR Carriage Return control code 27 ESC Escape control code
0E 28
0F 29
10 2A
11 2B
12 2C
13 2D
14 2E
15 NL New Line control code 2F
16 30
17 31
18 32
19 33
CODE CHAR DESCRIPTION
00
01
02
03
04
05
06
07
08
09
0A
0B
0C NP New Page control code
0D CR Carriage Return control code
0E
0F
10
11
12
13
14
15 NL New Line control code
16
17
18
19
1A
1B
1C
1D
1E
1F
20
21
22
23
24
25 LF Line Feed control code
26
27 ESC Escape control code
28
29
2A
2B
2C
2D
2E
2F
30
31
32
33

CODE PAGE: 9

[The alphabet and Japanese characters in the JPO 2‑byte character set are based upon JIS 6226‑1979, and other characters are based upon JIS 6226‑1983. The table consists of 187 pages and due to its length is not reproduced here. The International Bureau has a copy available for consultation.]

[End of Standard]