1 2 (2) l ASCIIJISUnicode ISO-2022-JP, Shift_JIS, EUC-JP Web l Copyright 2006-2018 Kota Abe 2018/06/12 3 4 l ()!?
5 6 l : This is a pen. 84 104 105 83 This is a pen. (, encode) () (, decode) l 41 42 43 : A B C B 7 8 l! l l Do you know Tom Riddle? Yes! = l ASCII JIS X 0201 JIS X 0208 (JIS) Unicode
ASCII 9 ASCII l American Standard Code for Information Interchange () l 1963, ASA (ANSI) 0x20 ( ) 10 American Standard Association () American National Standards Institution () l 7 7 = 128 0x000x7F 80 l http://www.me w.org/ 0x7F ( ) 11 12 l 0x09 (Horizontal Tabulation, HT) (Tab) l 0x0a (Line Feed, LF) () l 0x0d (Carriage Return, CR)() l 0x1b (Escape, ESC) () l WindowsUNIX Windows: CR + LF (=0x0d + 0x0a) (2) UNIX (Linux, MacOS X ): LF (=0x0a) (1) l ABC DEF 012 l 16(0xED) ASCII
13 l ASCII (16) ASCII 78 3d 32 33 0a 79 3d 78 2b 35 39 0a? (Windows, UNIX) JIS X 0201 14 l JIS: (Japanese Industrial Standard) l 78 (1969) l ASCII 0x5c () () 0x7e ()() ()()2 : JIS X 0201 15 JIS(JIS X 0208) (1) 16 l 782 (1978) l l JISJIS 7000 l 12 ASCII
JIS(JIS X 0208) (2) 17 JIS() 18 l 21 (1!) 1+2 0x210x7e 0x3021 = l 1978 78JIS (JIS) 1983 83JIS (JIS) l etc. l JIS X 0208 (+ ARIB) 19 l JIS X 0213 (JIS2000, JIS2004) 782 JIS X 0208 3, 4 (4300) 11,233 () WindowsVistaJIS2004 MSMS 20 l ()? ) l JIS X 0201 JIS X 0208 ()
21 22 l JISASCII l : or l ISO-2022-JP JIS l Shift_JIS, EUC-JP JISASCII Shift_JIS (SJIS): Windows, MacOS X EUC-JP: UNIX ASCII ASCII JIS X 0208 ASCII JIS X 0208 JIS X 02010x200x7e JIS X 0208(78) JIS X 0208(83) ASCII ESC ( B ESC ( J ESC $ @ ESC $ B ESC=0x1b 23 l Shift_JIS 82 b2 96 bc 93 9a 21 21 : 10x200x7fASCII ( JIS X 0201) l 0xED 24 l : ASCII JIS X 0201 () : JIS X 0208 () l ASCIIJIS l
25 Unicode (1) 26 l Windows ⅠⅡⅢⅣⅤ (JIS2000) (Unicode6) l Windows: l Web l EUC-KR GB18030 BIG5 TIS-620 ISO-8859-1 l! Unicode (2) 27 Unicode () 28 l! Unicode Consortium http://www.unicode.org/ Microsoft, Apple, Oracle, etc. l Windows, OSX Unicode Web Unicode Unicode http://www.unicode.org/charts/
Unicode 29 Unicode 30 l 16 U+261A ( ) 21 () U+0000 U+10FFFF l JIS l CJK (Chinese-Japanese-Korean) JIS ()() Unicode emoji 31 l Unicode (emoji) Google (Unicode6, 2010/10) macos () Unicode emoji 32 l l l Unicode 85 /
Unicode 33 l Unicode UTF-8 14 U+0000~U+007F (ASCII)1 U+0080~U+07FF 2 U+0800~U+10FFFF 3 () UTF-16 2 U+100004 34 l Unicode U+XXXX 35 Web 36 l <meta> l Meta Meta <meta HTTP-EQUIV= Content-Type CONTENT= text/html; charset=iso-2022-jp > <meta HTTP-EQUIV= Content-Type CONTENT= text/html; charset=euc-jp > <meta HTTP-EQUIV= Content-Type CONTENT= text/html; charset=shift_jis > <meta HTTP-EQUIV= Content-Type CONTENT= text/html; charset=utf-8 >
Web (2) 37 38 l HTMLUnicode U+3042 HTML あ あ 10 16 Unicode HTML l 7 l l ISO-2022-JP l Date: Thu, 09 Oct 2003 12:01:22 +0900 From: Kota Abe <k-abe@media.osaka-cu.ac.jp> Content-Type: text/plain; charset=iso-2022-jp Content-Transfer-Encoding: 7bit Excel 39 40 l l code() ASCIIJIS code( A ) = 65 char() ASCIIJIS char(65) = A 16 hex2dec() 1610 hex2dec( 10 )=16 dec2hex() 1016 dec2hex(10)= A l l l ASCII l l ISO-2022-JP l l Unicode