(2) Copyright 2006 Kota Abe
( )!?
: This is a pen. 84 104 105 83 (, encode) ( ) 84 104 105 83 This is a pen. (, decode)
Do you know Tom Riddle? Yes!!
ASCII American Standard Code for Information Interchange 1963 ANSI (American National Standards Institution) 7 7 128 0x00 0x7F 8 (MSB) 0
ASCII! " # $ % & ' ( ) * +, -. / 0 1 2 3 4 5 6 7 8 9 : ; < = >? @ ABCDEFGH I J K L M N O P Q R S T U V W X Y Z [ \ ] ^ _ ` a b c d e f g h i j k l m n o p q r s t u v w x y z { } ~ 0x20 ( )
0x09 0x0a 0x0d 0x1b (Horizontal Tabulation, HT) ( Tab ) (Line Feed, LF) ( ) (Carriage Return, CR) ( ) (Escape, ESC) ( )
Windows CR + LF (0x0d + 0x0a) + ( ) UNIX (Linux, MacOS X ) LF (0x0a)
JIS X 0201 Japanese Industrial Standard 7 8 ASCII 0x5c ( ) ( ) Windows Windows 0x7e ( ) ( ) 2 :
JIS (JIS X 0208) (1) 7 8 2 7000 1 2
JIS (JIS X 0208) (2) 2 1 1 + 2 0x21 0x7e 1978 78JIS ( JIS) 1983 83JIS ( JIS) : http://www.infonet.co.jp/ueyama/ip/ binary/x0208txt.html
JIS ( )
JIS X 0212 ( ),, JIS X 0213 (JIS2000) 7 8 2 3, 4 (4344 ) JIS X 0208 〴
JIS X 0201 JIS X 0208
JIS ASCII : or ISO-2022-JP Shift_JIS, EUC-JP!
ISO-2022-JP International Standardization Organization JIS ISO-2022-JP JIS ASCII 0x21 0x7e ASCII JIS
ASCII JIS X 0208 ASCII JIS X 0208 ASCII ASCII JIS X 0201 0x20 0x7e JIS X 0208(78 ) JIS X 0208(83 ) ESC ( B ESC ( J ESC $ @ ESC $ B
(kwrite) ABC DEF jis7 khexedit
Shift_JIS JIS X 0201 JIS X 0201 0x81 0x9F, 0xE0 0xEF Windows MS ( ) : http://www.rtpro.yamaha.co.jp/rt/docs/mi sc/kanji-sjis.html
EUC-JP EUC = Extended UNIX Code ( UNIX ) EUC-JP( EUC) EUC(EUC-KR) UNIX OS ASCII : http://www.rtpro.yamaha.co.jp/rt/docs/mi sc/kanji-euc.html
Kwrite Shift_JIS, EUC-JP khexedit
: ASCII JIS X 0201 ( ) : JIS X 0208 ( ) ASCII JIS
Windows ( NEC ) Macintosh ( )( )( ) i Web
Unicode(1) EUC-KR GB18030 BIG5 TSCII ISO-8859-1!
Unicode(2) Microsoft, Apple, Sun Microsystems, etc. Unicode Consortium http://www.unicode.org/ Windows2000 XP, MacOS X Unicode Java Unicode Unicode
Unicode(3) http://www.unicode.org/charts/
Unicode(4)
Unicode(5) ( )16 ( 65536 ) U+261A JIS CJK (Chinese-Japanese-Korean) JIS?
Unicode(6) Windows2000 XP : MS : SimSun : Gulim
Unicode(7) Unicode UTF-8 1 6 ASCII 1 UTF-16 ( )2
etc. kwrite,, etc. khexedit, etc.
Web <meta> Meta Meta <meta HTTP-EQUIV= Content-Type CONTENT= text/html; charset=iso-2022-jp > <meta HTTP-EQUIV= Content-Type CONTENT= text/html; charset=euc-jp > <meta HTTP-EQUIV= Content-Type CONTENT= text/html; charset=shift_jis >
7 ISO-2022-JP Date: Thu, 09 Oct 2003 12:01:22 +0900 From: Kota Abe <k-abe@media.osaka-cu.ac.jp> Content-Type: text/plain; charset=iso-2022-jp Content-Transfer-Encoding: 7bit
etc.
Base64 6 3 4 6 0 63 0 63 (64 ) 4/3
http://internet.watch.impress.co.jp/ www/column/ogata/index.htm http://euc.jp/i18n/charcode.ja.html 2003