Code page 932 (IBM)

From Infogalactic: the planetary knowledge core
Jump to: navigation, search

Code page 932 (abbreviated as CP932, also known by the IANA name Windows-31J) is Microsoft's extension of Shift JIS to include NEC special characters (Row 13), NEC selection of IBM extensions (Rows 89 to 92), and IBM extensions (Rows 115 to 119). The coded character sets are JIS X0201:1997, JIS X0208:1997, and these extensions. Windows-31J is often mistaken for Shift JIS: while similar, the distinction is significant for computer programmers wishing to avoid mojibake, and a good reason to use the world-widely unambiguous UTF-8 instead. The windows-31J name however is IANA's and not recognized by Microsoft, which historically has used shift_jis instead.

In Japanese editions of Windows, this code page is referred to as "ANSI", since it is the operating system's default 8-bit encoding, even though ANSI was not involved in its definition.

Code page 932 contains standard 7-bit ASCII codes, and Japanese characters are indicated by the high bit set to 1. Some code points in this page require a second byte, so characters use either 8 or 16 bits for encoding.

Notice that in the CP932.TXT mapping table linked below, code 0x5C is mapped to U+005C REVERSE SOLIDUS (\). This is often a source of confusion because in many Japanese fonts, this code is displayed as a Yen symbol, which would normally be represented as U+00A5 YEN SIGN (¥) in Unicode. However, on Windows systems, code 0x5C in code page 932 behaves as a reverse solidus (backslash) in all respects other than how it is displayed by some fonts.

External links