1 2001 11 3 1 2 2 2 2.1 ISO/IEC 646 IRV US-ASCII................................. 2 2.2 ISO/IEC 8859 JIS X 0201..................................... 4 2.3 ISO/IEC 2022............................... 6 2.4 JIS............................................... 11 2.5 ISO/IEC 10646-1 Unicode.................................... 12 3 13 3.1 IRV JIS........................................ 13 3.2........................................ 14 3.3 JIS JIS....................................... 14 3.4............................................ 15 4 16 1
1 Unicode 2 2 coded charactor set charactor encoding 2.1 ISO/IEC 646 IRV US-ASCII 1 US-ASCII ASCII American National Standard Code for Information Interchange ANSI American National Standards Institute 1 ANSI X3.4 ANSI ISO International Organization for Standardization: 2 ISO/IEC 646 ASCII US-ASCII ISO/IEC 646 IRV International Reference Version: ISO/IEC 646 7 bit 4 2 1. control charactors : 00H 1FH 2. SP SPACE : 20H 3. graphic characters : 21H 7EH 4. DEL DELETE : 7FH CL SP GL DEL SP DEL GL ISO/IEC 646 1 2 http://www.ansi.org/ http://www.iso.org/ 2
0 1 2 3 4 5 6 7 0 0 @ P p 1! 1 A Q a q 2 " 2 B R b r 3 # 3 C S c s 4 $ 4 D T d t 5 % 5 E U e u 6 & 6 F V f v 7 7 G W g w 8 ( 8 H X h x 9 ) 9 I Y i y A * : J Z j z B + ; K [ k { C, < L \ l D - = M ] m } E. > N ˆ n F /? O o 1: IRV/US-ASCII SP CL GL DEL 2: ISO/IEC 646 3
2.1.1 JIS ISO/IEC 646 JIS X 0201 JIS X 0201 2.2.1 IRV JIS IRV 5CH JIS \ REVERSE SOLIDUS Y= YEN SIGN 7EH TILDE OVER LINE IRV JIS IRV ISO/IEC 646 2.2 ISO/IEC 8859 2.2 ISO/IEC 8859 JIS X 0201 ISO ISO/IEC 646 8 bit ISO/IEC 8859 ISO/IEC ISO/IEC 646 2 3 1. CL CL control charactors : 00H 1FH 2. GL GL graphic characters : 20H 7FH SP,DEL 21H 7EH 3. CR CR control charactors : 80H 9FH 4. GR GR graphic characters : A0H FFH A1H FEH CL GL CR GR 3: ISO/IEC 8859 ISO/IEC 8859 CL,GL IRV GR ISO/IEC 8859 ISO/IEC 8859-1 Latin-1 4
3 ISO/IEC 8859 2.2.1 JIS X 0201 ISO/IEC 8859 JIS X 0201 JIS C 6220 JIS X 0201 GL IRV GR 4 0 1 2 3 4 5 6 7 8 9 A B C D E F 0 0 @ P p 1! 1 A Q a q 2 " 2 B R b r 3 # 3 C S c s 4 $ 4 D T d t 5 % 5 E U e u 6 & 6 F V f v 7 7 G W g w 8 ( 8 H X h x 9 ) 9 I Y i y A * : J Z j z B + ; K [ k { C, < L Y= l D - = M ] m } E. > N ˆ n F /? O o 4: JIS X 0201 8 bit JIS X 0201 7 bit 2.3.2 8 bit 2.2.2 IBM/PC CL,CR 8 bit GL IRV US-ASCII PC 932 JIS X 0201 3 ISO/IEC 8859 2001 3 ISO/IEC 8859-1 10,13 15 5
DOS/Windows 2.3 ISO/IEC 2022 ISO ISO/IEC 2022 ISO/IEC 2022 JIS X 0202 1 ISO/IEC 2022 C0 C1 CL GL CR GR G0 G1 G2 G3 94 96 94 n 96 n 1: ISO/IEC 2022 CL/CR/GL/GR ISO/IEC 2022 JIS X 0202 G0 G3 GL GR 94 96 94 n 96 n ISO/IEC 2022 CR/GR 7 bit C0/C1 6
2.3.5 LS0/LS1 5 G0 GR SI/LS0 G0 GL 0FH SO/LS1 G1 GL 0EH [SI] [SO] LS2 G2 GL 1BH 6EH [ESC] n LS3 G3 GL 1BH 6FH [ESC] o LS1R G1 GR 1BH 7EH [ESC] LS2R G2 GR 1BH 7DH [ESC] } LS3R G3 GR 1BH 7CH [ESC] SS2 G2 GL/GR 8EH 1BH 4EH SS3 G3 GL/GR 8FH 1BH 4FH [SS2] [ESC] N [SS3] [ESC] O 5: ISO/IEC 2022 1 G2 G3 GL GR G0 G3 6 96 96 n G0 94 n 94 G0 1BH 28H Ft [ESC] ( Ft 94 G1 1BH 29H Ft [ESC] ) Ft 94 G2 1BH 2AH Ft [ESC] * Ft 94 G3 1BH 2BH Ft [ESC] + Ft 96 G1 1BH 2DH Ft [ESC] - Ft 96 G2 1BH 2EH Ft [ESC]. Ft 96 G3 1BH 2FH Ft [ESC] / Ft 94 n G0 1BH 24H 28H Ft 1BH 24H Ft 94 n G1 1BH 24H 29H Ft [ESC] $ ) Ft 94 n G2 1BH 24H 2AH Ft [ESC] $ * Ft 94 n G3 1BH 24H 2BH Ft [ESC] $ + Ft 96 n G1 1BH 24H 2DH Ft [ESC] $ - Ft 96 n G2 1BH 24H 2EH Ft [ESC] $. Ft 96 n G3 1BH 24H 2FH Ft [ESC] $ / Ft [ESC] $ ( Ft [ESC] $ Ft 6: ISO/IEC 2022 7
G0 2 ISO/IEC 2022 Ft 7 Ft 94 @ ISO/IEC 646 B H I J ISO/IEC 646 IRV US-ASCII JIS X 0201 JIS X 0201 96 A ISO/IEC 9959-1 GR B ISO/IEC 9959-2 GR 94 n @ JIS X 0208-1978 JIS C 6226 A GB 2312-80 B JIS X 0208-1983,1990,1997 C KS C 5601-1987 D JIS X 0212-1990 O P JIS X 0213 1 JIS X 0213 2 7: ISO/IEC 2022 ISO/IEC 2022 ISO/IEC 2022 JIS X 0201,0208,0212,0213 ISO/IEC 2022 JIS 0201 JIS X 0208,0213 JIS X 0212 JIS X 0208 JIS X 0213 obsolete 2.3.1 : ISO/IEC 8859 ISO/IEC 8859 ISO/IEC 2022 G0 IRV G1 ISO/IEC 8859-x GR G0 GL G1 GR 8
2.3.2 : JIS X 0201 7 bit JIS X 0201 8 bit 2.2.1 JIS X 0201 7 bit ISO/IEC 2022 G0 JIS X 0201 IRV G1 JIS X 0201 G0 GL JIS X 0201 G1 GL G0 G0 JIS X 0208,0212,0213 ISO-2022-JP 2.3.3 ISO/IEC 2022 2.3.3 : ISO-2022-JP ISO-2022-JP RFC1468 4 JIS X 0208 ISO-2022-JP ISO/IEC 2022 RFC2822 5 RFC822 7 bit G0 IRV US-ASCII G0 GL G0 JIS X 0201 1BH 28H 42H [ESC] ( B ISO/IEC 646 IRV US-ASCII 1BH 28H 4AH [ESC] ( J JIS X 0201 1BH 24H 40H [ESC] $ @ JIS C 6226-1978 1BH 24H 42H [ESC] $ B JIS X 0208-1983,1990,1997 0DH 0AH [CR] [LF] IRV IRV JIS X 0201 IRV 4 5 http://www.asahi-net.or.jp/ bd9y-ktu/dtd f/rfc f/rfc1468j.html Internet Message Format: http://www.puni.net/ mimori/rfc/rfc2822.txt 9
RFC1554 6 ISO-2022-JP ISO-2022-JP-2 JIS JIS X 0212 ISO/IEC 8859 G2 Unicode 2.5 JIS X 0213 ISO-2022-JP-3 7 ISO-2022-JP-3 1BH 28H 42H [ESC] ( B ISO/IEC 646 IRV US-ASCII 1BH 24H 42H [ESC] $ B JIS X 0213 1 ISO-2022-JP 1BH 24H 28H 4FH [ESC] $ ( O JIS X 0213 1 1BH 24H 28H 50H [ESC] $ ( P JIS X 0213 2 JIS X 0213 1 8 JIS JIS X 0208 JIS 2 JIS ISO-2022-JP [ESC] $ B 1 ISO-2022-JP-3 RFC 2.3.4 : EUC-JP EUC Extended UNIX Code UNIX EUC EUC EUC-JP EUC ISO/IEC 2022 EUC-JP G0 IRV JIS X 0201 G1 JIS X 0208 G2 JIS X 0201 G3 JIS X 0212 G0 GL G1 GR G2 G3 EUC-JP JIS X 0201 2 byte JIS X 0201 JIS X 0212 UNIX ISO-2022-JP JIS X 0201 6 http://www.asahi-net.or.jp/ bd9y-ktu/dtd f/rfc f/rfc1554j.html 7 http://www.asahi-net.or.jp/ wq6k-yn/code/enc-x0213.html 8 JIS X 0213 JIS X 0208 JIS X 0208 94 94 2 JIS X 0213 2 94 94 3 10
EUC-JP JIS X 0213 EUC-JISX0213 G1 JIS X 0213 1 G3 JIS X 0213 2 2.3.5 ISO/IEC 6429 JIS X 0211 8 C0 00H NUL NULL 08H BS BACKSPACE 09H HT CHARACTER TABULATION 0AH LF LINE FEED 0CH FF FORM FEED 0DH CR CARRIAGE RETURN 0EH SO/LS1 SHIFT-OUT /LOCKING-SHIFT ONE 1 0FH SI/LS0 SHIFT-IN /LOCKING-SHIFT ZERO 0 1BH ESC ESCAPE C1 8EH SS2 SINGLE-SHIFT TWO 2 8FH SS3 SINGLE-SHIFT THREE 3 8: ESC C0 BS â ISO-2022-JP RFC2822 NUL CR/LF HT FF MIME 9 2.4 JIS JIS DOS/Windows Macintosh JIS 16 bit Multi16 CP/M-86 Microsoft MS JIS ISO/IEC 2022 JIS X 0201 8 bit 4 CR GR JIS X 0208 2 byte GL 9 RFC2046 http://www.asahi-net.or.jp/ bd9y-ktu/dtd f/rfc f/rfc2046j.html 11
JIS JIS X 0212 JIS JIS X 0213 JIS X 0212 JIS Shift JISX0213 JIS 2.5 ISO/IEC 10646-1 Unicode 1984 ISO I18N: InternationalizatioN ISO/IEC 10646 Unicode 10 ISO/IEC 10646-1 JIS X 0221 ISO/IEC 10646-1 UCS Universal Multiple-Octet Coded character Set UTF UCS Transfer Format 2.5.1 UCS UCS UCS-2 UCS-4 UCS-4 ISO/IEC 10646-1 31 bit group plane row cell 4 256 256 1 256 1 128 00 00 BMP Basic Multilingual Plane: BMP BMP 16 bit UCS-2 Unicode UCS-2 Unicode ISO/IEC 10646-1 2.5.2 UTF UTF UTF-16 UTF-8 UTF-7 UTF-16 UCS-2 BMP 16 Surrogate Pair UTF-8 UCS ISO/IEC 2022 UCS IRV GL 2 6 byte 96 n GR GL UTF-8 UTF-7 UTF-8 7 bit Base64 10 http://www.unicode.org/ 12
8 bit 3 JIS 3.1 IRV JIS 2.1.1 IRV JIS 2 TILDE OVER LINE JIS X 0201 2 Windows 7EH ISO/IEC 10646-1 ã ISO/IEC 10646-1 \ REVERSE SOLIDUS Y= YEN SIGN C UCS : printf("total: Y=%d.\n", maney); : Total: Y=646. UCS JIS : printf("total: Y=Y=%d.Y=n", maney); : Total: Y=646. JIS IRV : printf("total: \\%d.\n", maney); : Total: \646. JIS ISO-2022-JP-3 JIS JIS Y= JIS UCS/UTF 13
3.2 JIS X 0201 JUNET UNIX 2.3.4 ISO H 7 I ISO-2022-JP JIS X 0208 1 JIS JIS X 0213 JIS X 0201 ISO-2022-JP-3 JIS X 0201 JIS X 0208,0212,0213 ISO/IEC 10646-1 Halfwidth and Fullwidth Forms JIS ISO/IEC 646 / ISO/IEC 10646-1 JIS IRV 3.3 JIS JIS JIS X 0208 1983 1983 JIS JIS C 6226-1978 11 JIS 1983 JIS JIS X 0208,0212,0213 JIS JIS JIS ISO-2022-JP ISO-2022-JP JIS JIS JIS JIS JIS JIS ISO-2022-JP JIS JIS 11 JIS X 0208 JIS C 6226 14
ISO-2022-JP JIS JIS JIS 1BH 24H 42H 1 byte IRV 1BH 28H 42H JIS JIS RFC1468 JIS 3.4 JIS X 0208 JIS C 6226-1978 JIS JIS 2 IBM : IBM JIS C 6226-1978 NEC : NEC IBM 98 JIS C 6226-1978 1 9 15 85 94 2 12 JIS C 6226-1978 JIS JIS JIS X 0213 13 JIS X 0213 UNIX JIS X 0213 ISO-2022-JP-3 ISO/IEC 10646-1 JIS X 0213 14 UTF-8/UTF-7 12 JIS 2 byte 1 byte 85H 88H EBH FFH 13 JIS JIS 14 JIS X 0213 ISO/IEC 10646-1 360 15
4 Unicode CJK http://euc.jp/i18n/charcode.ja.html Unicode http://euc.jp/i18n/ucsnote.ja.html JIS ISO-2022 http://www.d2.dion.ne.jp/ imady/kcode/kcode jis.html Mew : ASCII http://www.mew.org/newsletters/6.html JIS X 0213 http://www.asahi-net.or.jp/ wq6k-yn/code/enc-x0213.html [1]. JIS 64 I / /., 2001. [2].., 2001. [3],.., 1999. 16