I70 Character Encoding

Identifies a scheme for representing a set of graphical characters with bit patterns.

Type
Identifier (ID)
Length
Min 1 / Max 2
Codes
CodeDescription
1US-ASCII
Latin, Swahili, Hawaiian and American English without most typographic frills.
2EBCDIC-US
IBM mainframes Extended Binary Coded Decimal Information Code.
3ISO 646
4ISO 8859-1
North America, Western Europe, Latin America, the Caribbean, Canada, Africa.
5ISO 8859-2
Eastern Europe.
6ISO 8859-5
The Cyrillic alphabet. Bulgarian, Belarusian, Russian and Macedonian.
7ISO 8859-7
The modern Greek alphabet and mathematical symbols derived from the Greek.
8ISO 8859-3
SE Europe, Esperanto, miscellaneous others.
9ISO 8859-4
Scandinavia/Baltics (and others not in ISO-8859-1).
10ISO 8859-6
The Arabic alphabet.
11ISO 8859-8
The Hebrew alphabet.
12ISO 8859-9
The Turkish alphabet. Same as ISO-8859-1 except Turkish characters replace Icelandic.
13ISO 8859-15
Nordic alphabets. Lappish, Nordic, Eskimo.
14ISO 2022
15ISO 2375
16ISO 10646
17GB18030
Simplified and traditional Chinese characters.
18EUC-JP
Japanese character set standards, namely JIS X 0208, JIS X 0212, and JIS X 0201.
19ISO-2022-JP
JIS Encodings.
20ISO-2022-JP-2
Multilingual extension of ISO-2022-JP.
21ISO-2022-KR
Encodes ASCII and the Korean double-byte.
22EUC-KR
Used primarily for Japanese, Korean, and simplified Chinese.
23ISO-2022-CN-EXT
Extends ISO-2022-CN with additional Guobiao standard.
24ISO-2022-CN
Support the character sets GB 2312 (for simplified Chinese) and CNS 11643 (for traditional Chinese).
25Big5
Chinese character encoding method used in Taiwan, Hong Kong, and Macau for Traditional Chinese characters.
26ISO-10646-UCS-2
Universal Character Set. Fixed-length 16 bits (2 bytes). Replaced by UTF-16.
27ISO-10646-UCS-4
Universal Character Set coded in 4 octets. It is now treated simply as a synonym for UTF-32.
28SCSU
Standard Compression Scheme for Unicode. Technical Standard for reducing the number of bytes needed to represent Unicode text.
29UTF-7
Universal Character Set. 7-bit Unicode Transformation Format.
30UTF-16BE
Universal Character Set. 16-bit Unicode Transformation Format Big Endian Byte Order.
31UTF-16LE
Universal Character Set. 16-bit Unicode Transformation Format Little Endian Byte Order.
32UTF-16
Universal Character Set. 16-bit Unicode Transformation.
33UTF8/CESU-8
Universal Character Set. 8-bit Unicode Transformation.
34UTF-32
Universal Character Set. 32-bit Unicode Transformation.
35UTF-32BE
Universal Character Set. 32-bit Unicode Transformation Format Big Endian Byte Order.
36UTF-32LE
Universal Character Set. 32-bit Unicode Transformation Format Little Endian Byte Order.
37BOCU-1
Binary Ordered Compression for Unicode (BOCU) is a MIME compatible Unicode compression scheme. Combines UTF-8 with the compactness of Standard Compression Scheme for Unicode (SCSU).

Stedi is a registered trademark of Stedi, Inc. Stedi's EDI Reference is provided for marketing purposes and is free of charge. All names, logos, and brands of third parties listed on our site are trademarks of their respective owners (including “X12”, which is a trademark of X12 Incorporated). Stedi, Inc. and its products and services are not endorsed by, sponsored by, or affiliated with these third parties. Our use of these names, logos, and brands is for identification purposes only, and does not imply any such endorsement, sponsorship, or affiliation.