corpus.indd



Similar documents
1 // BK // BK 1997 PM // BK 2003 // BK DHogrefe Publishing 2002 PM // BK 2000 WR // BK 1986

Modal Phrase MP because but 2 IP Inflection Phrase IP as long as if IP 3 VP Verb Phrase VP while before [ MP MP [ IP IP [ VP VP ]]] [ MP [ IP [ VP ]]]

untitled

-like BCCWJ CD-ROM CiNii NII BCCWJ BCCWJ


1 2 Sample Sample Sample 3 1

Powered by TCPDF ( Title 第 11 講 : フィッシャー統計学 II Sub Title Author 石川, 史郎 (Ishikawa, Shiro) Publisher Publication year 2018 Jtitle コペンハーゲン解

A Japanese Word Dependency Corpus ÆüËܸì¤Îñ¸ì·¸¤ê¼õ¤±¥³¡¼¥Ñ¥¹

untitled

MOMW_I_,II 利用ガイド.PDF


コーパスに基づく言語学教育研究報告 8

紀要No.9_006王_CS.indd

スライド 1

Microsoft Word - H19_活動報告書案/広報研究会.doc

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

2

大学等における社会人の受け入れ状況調査

untitled

自然言語処理24_705

11_寄稿論文_李_再校.mcd

36 Theoretical and Applied Linguistics at Kobe Shoin No. 20, 2017 : Key Words: syntactic compound verbs, lexical compound verbs, aspectual compound ve

untitled

81_mediaguide_07

([ ]!) name1 name2 : [Name]! name SuperSQL,,,,,,, (@) < >@{ < > } =,,., 200,., TFE,, 1 2.,, 4, 3.,,,, Web EGG [5] SSVisual [6], Java SSedit( ss


pp Excel Excel Excel Microsoft Excel 2015 OS Windows7 Excel2010(Microsoft Office2010) Office Excel 2 Excel 33

訪問看護ステーションにおける安全性及び安定的なサービス提供の確保に関する調査研究事業報告書

untitled


IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

untitled

LWW EJ on Ovid LWW Ovid Online (Ovid Web Gateway) Ovid Online LWW tutorial Ovid Online Refresh Ovid Online LWW Ovid Medline, Cinahl, EBMR, Ovid

自然言語処理21_249

Powered by TCPDF ( Title 明治以前日本水害史年表 Sub Title A chronological table of flood disasters before Meiji era in Japan Author 高木, 勇夫 (Takagi,

長崎県消費生活審議会

pp DC 2,

‚å−w…p…u



(2008) JUMAN *1 (, 2000) google MeCab *2 KH coder TinyTextMiner KNP(, 2000) google cabocha(, 2001) JUMAN MeCab *1 *2 h


pwd

2016

270万回再生レポート

大学における原価計算教育の現状と課題

matsuda.dvi

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing


II III I ~ 2 ~

中堅中小企業向け秘密保持マニュアル


PR映画-1

- 2 -


1 (1) (2)

: Name, Tel name tel (! ) name : Name! Tel tel ( % ) 3. HTML. : Name % Tel name tel 2. 2,., [ ]!, [ ]!, [ ]!,. [! [, ]! ]!,,. ( [ ], ),. : [Name], nam


ACS電子ジャーナル利用マニュアル

Excelfl—‘ãŁª’Í-flO“Z

untitled

AcVBA

Web Web Web Web Web, i

1 2 2


ii II Web Web HTML CSS PHP MySQL Web Web CSS JavaScript Web SQL Web

% 15.8% 14.8% 15.0% 16.0% 16.5% 0.5% 16.1% 15.2% 16.9% 15.7% 17.1% 18.6% 0.4% 21.4% 15.8% 14.8


新入_本文.smd

I II III IV A B C V.

11/’X›ª/’ÓŠ¹

PowerPoint Presentation

A Study of a Change in Japanese Public Relations (2) Mari Mishima PR PR

報告書.PDF

橡第4回行財政改革懇話会会議録.PDF

2 : Open Clip Art Library [4] Microsoft Office PowerPoint Web PowerPoint 2 Yahoo! Web [5] SlideShare Yahoo! Web Yahoo! Web

manual.dvi

Admissions Assistance Office

eBook白書_日本の研究者の声_A4PDF用_ クレジット

1

1

URL

1 3 [1] [2, 3] WWW 2.1 WWW WWW DjVu 3 ( 1) 2 DjVu DjVu DjVu[2] 16 ( ) http

短大29号.indd

p6-18/村松様

IPSJ SIG Technical Report Vol.2014-CLE-12 No /1/31 EFL 1,a) 1 EFL(English as a Foreign Language) EFL 1. [1] EFL (English as a Foreign Language)

計量国語学 アーカイブ ID KK 種別 特集 招待論文 A タイトル Webコーパスの概念と種類, 利用価値 語史研究の情報源としてのWebコーパス Title The Concept, Types and Utility of Web Corpora: Web Corpora as


MS Access ¤λȤ¤˽

LWW EJ on Ovid LWW Ovid Online (Ovid Web Gateway) Ovid Online LWW tutorial Ovid Online Refresh Ovid Online LWW Ovid Medline, Cinahl, EBMR, Ovid

09‘o’–


本文/YAZ325T

fiš„v8.dvi


AP_12_15_yonezawa.indd


すぐに使える!Essbase キューブ開発テクニック集


Windowsユーザーの為のOracle Database セキュリティ入門

Transcription:

22 JC-D-10-02 23 2 c 2011 21

1 I BCCWJ 3 1 BCCWJ 5 1.1 BCCWJ 3..................... 5 1.2 BCCWJ 2...................... 6 2 3 SC 7 2.1 SC SC............. 7 2.1.1 SC SC................... 7 2.1.2...................... 8 2.1.3.......................... 8 2.1.4....................... 9 2.2 SC.................. 21 2.2.1 SC........................ 21 2.2.2.......................... 21 3 23 3.1..................... 23 3.2 SC.................................. 24 3.3 SC.................................. 26 3.4 SC.................................. 28 3.5 SC................................. 30 3.6 SC................................ 32 3.7 SC............................... 34 3.8 SC............................... 36 3.9 SC........................... 38 3.10 SC Yahoo!........................... 40 3.11 SC Yahoo!........................... 42

3.12 SC................................ 44 3.13 SC................................ 46 3.14 SC............................ 48 II 51 4 BCCWJ 53 4.1....................... 53 4.2............................ 53 5 Bibliography.txt 55 5.1............................... 55 5.2............................... 57 5.2.1 ID................................... 57 5.2.2................................... 62 5.2.3..................................... 62 5.2.4..................................... 63 5.2.5................................... 64 5.2.6.................................... 64 5.2.7.................................... 65 5.2.8 ISBN.................................... 65 5.2.9..................................... 65 5.2.10................................... 66 5.2.11 (1) (4).............................. 66 5.2.12 ID................................. 73 5.3................................ 74 5.3.1...................... 74 5.3.2...................... 77 5.3.3...................... 79 5.3.4...................... 80 5.3.5 Yahoo!................. 81 5.3.6 Yahoo!................. 84 5.3.7...................... 90 5.3.8.................. 91

6 Sample.txt 93 6.1............................. 93 6.2............................. 94 6.2.1 ID................................. 94 6.2.2 ID................................... 100 6.2.3........................ 100 6.2.4......................... 101 7 Directory.txt 103 7.1................................ 103 7.2................................ 103 7.2.1 ID................................... 103 7.2.2..................................... 104 7.2.3..................................... 104 7.2.4..................................... 104 8 Sample author.txt 105 8.1............................. 105 8.2........................ 105 8.2.1 ID................................. 105 8.2.2 ID................................... 106 9 107 9.1............................ 107 9.2............................ 110 III 111 10 113

1 2006 Balanced Corpus of Contemporary Written Japanese; BCCWJ5 4 BCCWJ SSG; 2006 BCCWJ 3 3 2011 1 I BCCWJ II III 3

2 2006 9 BCCWJ

I BCCWJ

5 1 BCCWJ 1.1 BCCWJ 3 BCCWJ 3 SC BCCWJ 1.1 1.1: BCCWJ SC SC 2001 2005 3,500 5 SC SC 1986 2005 20 3,000 SC SC SC SC Yahoo!Yahoo! 3,500

6 1 BCCWJ 1.2 BCCWJ 2 BCCWJ 2 2 1 1,000 1 1 1 3 SC SC SC SC SC

7 2 3SC 2.1 SC SC 2.1.1 SC SC BCCWJ SC SC SC 2001 2005 65,471,677,099 SC 1986 2005 13 47,877,656,072 1 1,000 SC 1,000 2.1 3,900 3,000 1,000 1 1.7 SC 3,500 SC 3,000 SC 3,500 BCCWJ 1

8 2 3 SC 2.1: SC SC SC SC 12,604 7,414,118 28,915,059 2,730 1,605,882 4,817,647 1,666 980,000 980,000 17,000 10,000,000 34,712,706 SC 12,604 7,414,118 28,915,059 2.1.2 2006 5 4,534 3,873 980 SC 17,000 SC 12,604 80% 2.1.3 2010 5 2.1 2.2 SC 89.0% 91.0% 89.4% SC 1,000 89.3%

2.1. SC SC 9 2.2: SC SC SC SC 11,212 6,595,294 29,541,361 (89.0%) (89.0%) (102.2%) 2,483 1,460,588 5,687,485 (91.0%) (90.9%) (118.0%) 1,490 876,471 864,364 (89.4%) (89.4%) (88.1%) 15,185 8,932,353 36,093,211 (89.3%) (89.3%) (104.0%) SC 11,242 6,612,941 30,053,412 (89.2%) (89.2%) (103.9%) 893 SC 89.2% SC 102.2% 118.0% 88.1% SC 103.9% SC SC 2.3 2.4 S SC 2001 2005 5 SC 1986 2005 20 5 4 2.5 2.13 2.1.4 2.2 2.2

10 2 3 SC 2.3: SC S S S S S S S S 0. 425 250,000 2.5% 3,900 975,000 363 213,529 2.4% 3,902 833,197 85.4% 1. 674 396,471 4.0% 3,900 1,546,235 610 358,824 4.0% 4,155 1,490,930 90.5% 2. 1,117 657,059 6.6% 3,900 2,562,529 926 544,706 6.1% 4,493 2,447,545 82.9% 3. 3,222 1,895,294 19.0% 3,900 7,391,647 2,721 1,600,588 17.9% 4,495 7,194,570 84.5% 4. 1,316 774,118 7.7% 3,900 3,019,059 1,119 658,235 7.4% 4,021 2,646,734 85.0% 5. 1,199 705,294 7.1% 3,900 2,750,647 1,008 592,941 6.6% 4,127 2,447,023 84.1% 6. 570 335,294 3.4% 3,900 1,307,647 480 282,353 3.2% 4,366 1,232,742 84.2% 7. 846 497,647 5.0% 3,900 1,940,824 728 428,235 4.8% 4,225 1,809,129 86.1% 8. 231 135,882 1.4% 3,900 529,941 198 116,471 1.3% 4,001 466,008 85.7% 9. 2,426 1,427,059 14.3% 3,900 5,565,529 2,557 1,504,118 16.8% 5,070 7,625,880 105.4% n. 578 340,000 3.4% 3,900 1,326,000 502 295,294 3.3% 4,564 1,347,602 86.9% 12,604 7,414,118 74.1% 28,915,059 11,212 6,595,294 73.8% 29,541,361 89.0% 1. 1,927 1,133,529 11.3% 3,000 3,400,588 1,786 1,050,588 11.8% 3,914 4,111,719 92.7% 2. 228 134,118 1.3% 3,000 402,353 193 113,529 1.3% 4,163 472,600 84.6% 3. 119 70,000 0.7% 3,000 210,000 114 67,059 0.8% 3,105 208,197 95.8% 4. 29 17,059 0.2% 3,000 51,176 25 14,706 0.2% 2,258 33,200 86.2% 5. 381 224,118 2.2% 3,000 672,353 323 190,000 2.1% 4,159 790,200 84.8% 6. 47 27,647 0.3% 3,000 82,941 42 24,706 0.3% 2,897 71,569 89.4% 2,730 1,606,471 16.1% 4,819,412 2,483 1,460,588 16.4% 5,687,485 91.0% 628 369,412 3.7% 1,000 369,412 550 323,529 3.6% 1,069 345,956 87.6% 337 198,235 2.0% 1,000 198,235 305 179,412 2.0% 903 162,057 90.5% 702 412,941 4.1% 1,000 412,941 635 373,529 4.2% 954 356,351 90.5% 1,666 980,588 9.8% 980,588 1,490 876,471 9.8% 864,364 89.4% 17,000 10,000,000 100% 34,715,059 15,185 8,932,353 100% 36,093,211 89.3%

2.1. SC SC 11 2.4: SC S S S S S S S S 0. 263 154,706 2.1% 3,900 603,353 249 146,471 2.2% 4,108 601,669 94.7% 1. 617 362,941 4.9% 3,900 1,415,471 560 329,412 5.0% 4,452 1,466,585 90.8% 2. 1,321 777,059 10.5% 3,900 3,030,529 1,133 666,471 10.1% 4,587 3,056,778 85.8% 3. 2,356 1,385,882 18.7% 3,900 5,404,941 2,195 1,291,176 19.5% 4,427 5,716,463 93.2% 4. 797 468,824 6.3% 3,900 1,828,412 663 390,000 5.9% 4,315 1,682,878 83.2% 5. 828 487,059 6.6% 3,900 1,899,529 690 405,882 6.1% 3,983 1,616,570 83.3% 6. 444 261,176 3.5% 3,900 1,018,588 380 223,529 3.4% 4,274 955,392 85.6% 7. 1,070 629,412 8.5% 3,900 2,454,706 897 527,647 8.0% 4,107 2,167,036 83.8% 8. 252 148,235 2.0% 3,900 578,118 217 127,647 1.9% 3,348 427,326 86.1% 9. 4,076 2,397,647 32.3% 3,900 9,350,824 3,765 2,214,706 33.5% 5,063 11,212,003 92.4% n. 583 342,941 4.6% 3,900 1,337,471 493 290,000 4.4% 3,968 1,150,711 84.6% 12,607 7,415,882 100% 28,921,941 11,242 6,612,941 100% 30,053,412 89.2%

12 2 3 SC 2.5: SC 2001 S S S S S S S S 0. 99 58,235 0.6% 3,900 227,118 83 48,824 0.5% 3,902 190,511 83.8% 1. 134 78,824 0.8% 3,900 307,412 116 68,235 0.8% 4,155 283,521 86.6% 2. 244 143,529 1.4% 3,900 559,765 203 119,412 1.3% 4,493 536,557 83.2% 3. 659 387,647 3.9% 3,900 1,511,824 557 327,647 3.7% 4,495 1,472,758 84.5% 4. 249 146,471 1.5% 3,900 571,235 211 124,118 1.4% 4,021 499,071 84.7% 5. 280 164,706 1.6% 3,900 642,353 234 137,647 1.5% 4,127 568,059 83.6% 6. 126 74,118 0.7% 3,900 289,059 108 63,529 0.7% 4,366 277,367 85.7% 7. 177 104,118 1.0% 3,900 406,059 150 88,235 1.0% 4,225 372,760 84.7% 8. 58 34,118 0.3% 3,900 133,059 52 30,588 0.3% 4,001 122,386 89.7% 9. 460 270,588 2.7% 3,900 1,055,294 470 276,471 3.1% 5,070 1,401,707 102.2% n. 67 39,412 0.4% 3,900 153,706 62 36,471 0.4% 4,564 166,437 92.5% 2,553 1,501,765 15.0% 5,856,882 2,246 1,321,176 14.8% 5,891,134 88.0% 1. 371 202,941 2.0% 3,000 608,824 345 202,941 2.3% 3,914 794,257 93.0% 2. 47 27,059 0.3% 3,000 81,176 46 27,059 0.3% 4,163 112,640 97.9% 3. 23 14,706 0.1% 3,000 44,118 25 14,706 0.2% 3,105 45,657 108.7% 4. 6 2,941 0.0% 3,000 8,824 5 2,941 0.0% 2,258 6,640 83.3% 5. 91 35,294 0.4% 3,000 105,882 60 35,294 0.4% 4,159 146,786 65.9% 6. 9 2,353 0.0% 3,000 7,059 4 2,353 0.0% 2,897 6,816 44.4% 547 285,294 2.9% 855,882 485 285,294 3.2% 1,112,797 88.7% 126 74,118 0.7% 1,000 74,118 110 64,706 0.7% 1,069 69,191 87.3% 67 39,412 0.4% 1,000 39,412 61 35,882 0.4% 903 32,411 91.0% 140 82,353 0.8% 1,000 82,353 128 75,294 0.8% 954 71,831 91.4% 333 195,882 2.0% 195,882 299 175,882 2.0% 173,434 89.8%

2.1. SC SC 13 2.6: SC 2002 S S S S S S S S 0. 94 55,294 0.6% 3,900 215,647 82 48,235 0.5% 3,902 188,215 87.2% 1. 139 81,765 0.8% 3,900 318,882 123 72,353 0.8% 4,155 300,630 88.5% 2. 223 131,176 1.3% 3,900 511,588 185 108,824 1.2% 4,493 488,980 83.0% 3. 662 389,412 3.9% 3,900 1,518,706 569 334,706 3.7% 4,495 1,504,487 86.0% 4. 263 154,706 1.5% 3,900 603,353 223 131,176 1.5% 4,021 527,455 84.8% 5. 259 152,353 1.5% 3,900 594,176 219 128,824 1.4% 4,127 531,645 84.6% 6. 112 65,882 0.7% 3,900 256,941 94 55,294 0.6% 4,366 241,412 83.9% 7. 176 103,529 1.0% 3,900 403,765 151 88,824 1.0% 4,225 375,245 85.8% 8. 50 29,412 0.3% 3,900 114,706 42 24,706 0.3% 4,001 98,850 84.0% 9. 477 280,588 2.8% 3,900 1,094,294 525 308,824 3.5% 5,070 1,565,736 110.1% n. 122 71,765 0.7% 3,900 279,882 108 63,529 0.7% 4,564 289,922 88.5% 2,577 1,515,882 15.2% 5,911,941 2,321 1,365,294 15.3% 6,112,579 90.1% 1. 383 224,118 2.2% 3,000 672,353 381 224,118 2.5% 3,914 877,136 99.5% 2. 46 25,294 0.3% 3,000 75,882 43 25,294 0.3% 4,163 105,294 93.5% 3. 25 14,706 0.1% 3,000 44,118 25 14,706 0.2% 3,105 45,657 100.0% 4. 6 3,529 0.0% 3,000 10,588 6 3,529 0.0% 2,258 7,968 100.0% 5. 81 39,412 0.4% 3,000 118,235 67 39,412 0.4% 4,159 163,911 82.7% 6. 10 7,647 0.1% 3,000 22,941 13 7,647 0.1% 2,897 22,152 130.0% 551 314,706 3.1% 944,118 535 314,706 3.5% 1,222,119 97.1% 126 74,118 0.7% 1,000 74,118 110 64,706 0.7% 1,069 69,191 87.3% 67 39,412 0.4% 1,000 39,412 61 35,882 0.4% 903 32,411 91.0% 140 82,353 0.8% 1,000 82,353 125 73,529 0.8% 954 70,148 89.3% 333 195,882 2.0% 195,882 296 174,118 1.9% 171,751 88.9%

14 2 3 SC 2.7: SC 2003 S S S S S S S S 0. 87 51,176 0.5% 3,900 199,588 72 42,353 0.5% 3,902 165,262 82.8% 1. 132 77,647 0.8% 3,900 302,824 125 73,529 0.8% 4,155 305,518 94.7% 2. 227 133,529 1.3% 3,900 520,765 188 110,588 1.2% 4,493 496,910 82.8% 3. 680 400,000 4.0% 3,900 1,560,000 575 338,235 3.8% 4,495 1,520,352 84.6% 4. 282 165,882 1.7% 3,900 646,941 244 143,529 1.6% 4,021 577,125 86.5% 5. 253 148,824 1.5% 3,900 580,412 215 126,471 1.4% 4,127 521,934 85.0% 6. 115 67,647 0.7% 3,900 263,824 94 55,294 0.6% 4,366 241,412 81.7% 7. 175 102,941 1.0% 3,900 401,471 153 90,000 1.0% 4,225 380,215 87.4% 8. 41 24,118 0.2% 3,900 94,059 35 20,588 0.2% 4,001 82,375 85.4% 9. 503 295,882 3.0% 3,900 1,153,941 511 300,588 3.4% 5,070 1,523,983 101.6% n. 130 76,471 0.8% 3,900 298,235 117 68,824 0.8% 4,564 314,083 90.0% 2,625 1,544,118 15.4% 6,022,059 2,329 1,370,000 15.3% 6,129,170 88.7% 1. 388 201,765 2.0% 3,000 605,294 343 201,765 2.3% 3,914 789,653 88.4% 2. 49 18,235 0.2% 3,000 54,706 31 18,235 0.2% 4,163 75,910 63.3% 3. 24 12,353 0.1% 3,000 37,059 21 12,353 0.1% 3,105 38,352 87.5% 4. 6 3,529 0.0% 3,000 10,588 6 3,529 0.0% 2,258 7,968 100.0% 5. 72 36,471 0.4% 3,000 109,412 62 36,471 0.4% 4,159 151,679 86.1% 6. 9 6,471 0.1% 3,000 19,412 11 6,471 0.1% 2,897 18,744 122.2% 548 278,824 2.8% 836,471 474 278,824 3.1% 1,082,306 86.5% 126 74,118 0.7% 1,000 74,118 109 64,118 0.7% 1,069 68,562 86.5% 67 39,412 0.4% 1,000 39,412 62 36,471 0.4% 903 32,943 92.5% 140 82,353 0.8% 1,000 82,353 123 72,353 0.8% 954 69,025 87.9% 333 195,882 2.0% 195,882 294 172,941 1.9% 170,530 88.3%

2.1. SC SC 15 2.8: SC 2004 S S S S S S S S 0. 81 47,647 0.5% 3,900 185,824 68 40,000 0.4% 3,902 156,081 84.0% 1. 151 88,824 0.9% 3,900 346,412 139 81,765 0.9% 4,155 339,737 92.1% 2. 232 136,471 1.4% 3,900 532,235 190 111,765 1.3% 4,493 502,196 81.9% 3. 665 391,176 3.9% 3,900 1,525,588 553 325,294 3.6% 4,495 1,462,182 83.2% 4. 281 165,294 1.7% 3,900 644,647 236 138,824 1.6% 4,021 558,203 84.0% 5. 224 131,765 1.3% 3,900 513,882 186 109,412 1.2% 4,127 451,534 83.0% 6. 120 70,588 0.7% 3,900 275,294 104 61,176 0.7% 4,366 267,094 86.7% 7. 172 101,176 1.0% 3,900 394,588 149 87,647 1.0% 4,225 370,275 86.6% 8. 45 26,471 0.3% 3,900 103,235 38 22,353 0.3% 4,001 89,436 84.4% 9. 517 304,118 3.0% 3,900 1,186,059 548 322,353 3.6% 5,070 1,634,330 106.0% n. 146 85,882 0.9% 3,900 334,941 121 71,176 0.8% 4,564 324,820 82.9% 2,634 1,549,412 15.5% 6,042,706 2,332 1,371,765 15.4% 6,155,888 88.5% 1. 391 208,235 2.1% 3,000 624,706 354 208,235 2.3% 3,914 814,977 90.5% 2. 43 24,706 0.2% 3,000 74,118 42 24,706 0.3% 4,163 102,846 97.7% 3. 22 14,118 0.1% 3,000 42,353 24 14,118 0.2% 3,105 43,831 109.1% 4. 5 2,941 0.0% 3,000 8,824 5 2,941 0.0% 2,258 6,640 100.0% 5. 71 45,294 0.5% 3,000 135,882 77 45,294 0.5% 4,159 188,376 108.5% 6. 9 4,706 0.0% 3,000 14,118 8 4,706 0.1% 2,897 13,632 88.9% 541 300,000 3.0% 900,000 510 300,000 3.4% 1,170,301 94.3% 126 74,118 0.7% 1,000 74,118 112 65,882 0.7% 1,069 70,449 88.9% 67 39,412 0.4% 1,000 39,412 61 35,882 0.4% 903 32,411 91.0% 140 82,353 0.8% 1,000 82,353 127 74,706 0.8% 954 71,270 90.7% 333 195,882 2.0% 195,882 300 176,471 2.0% 174,131 90.1%

16 2 3 SC 2.9: SC 2005 S S S S S S S S 0. 65 38,235 0.4% 3,900 149,118 58 34,118 0.4% 3,902 133,128 89.2% 1. 119 70,000 0.7% 3,900 273,000 107 62,941 0.7% 4,155 261,524 89.9% 2. 192 112,941 1.1% 3,900 440,471 160 94,118 1.1% 4,493 422,902 83.3% 3. 557 327,647 3.3% 3,900 1,277,824 467 274,706 3.1% 4,495 1,234,790 83.8% 4. 240 141,176 1.4% 3,900 550,588 205 120,588 1.4% 4,021 484,880 85.4% 5. 183 107,647 1.1% 3,900 419,824 154 90,588 1.0% 4,127 373,851 84.2% 6. 97 57,059 0.6% 3,900 222,529 80 47,059 0.5% 4,366 205,457 82.5% 7. 145 85,294 0.9% 3,900 332,647 125 73,529 0.8% 4,225 310,633 86.2% 8. 37 21,765 0.2% 3,900 84,882 31 18,235 0.2% 4,001 72,961 83.8% 9. 468 275,294 2.8% 3,900 1,073,647 503 295,882 3.3% 5,070 1,500,124 107.5% n. 113 66,471 0.7% 3,900 259,235 94 55,294 0.6% 4,564 252,340 83.2% 2,216 1,303,529 13.0% 5,083,765 1,984 1,167,059 13.1% 5,252,590 89.5% 1. 395 213,529 2.1% 3,000 640,588 363 213,529 2.4% 3,914 835,696 91.9% 2. 43 18,235 0.2% 3,000 54,706 31 18,235 0.2% 4,163 75,910 72.1% 3. 24 11,176 0.1% 3,000 33,529 19 11,176 0.1% 3,105 34,700 79.2% 4. 5 1,765 0.0% 3,000 5,294 3 1,765 0.0% 2,258 3,984 60.0% 5. 65 33,529 0.3% 3,000 100,588 57 33,529 0.4% 4,159 139,447 87.7% 6. 9 3,529 0.0% 3,000 10,588 6 3,529 0.0% 2,897 10,224 66.7% 541 281,765 2.8% 845,294 479 281,765 3.2% 1,099,961 88.5% 126 74,118 0.7% 1,000 74,118 109 64,118 0.7% 1,069 68,562 86.5% 67 39,412 0.4% 1,000 39,412 60 35,294 0.4% 903 31,880 89.6% 140 82,353 0.8% 1,000 82,353 132 77,647 0.9% 954 74,076 94.3% 333 195,882 2.0% 195,882 301 177,059 2.0% 174,518 90.4%

2.1. SC SC 17 2.10: SC 1986 1990 S S S S S S S S 0. 34 20,000 0.3% 3,900 78,000 32 18,824 0.3% 4,108 77,323 94.1% 1. 92 54,118 0.7% 3,900 211,059 81 47,647 0.7% 4,452 212,131 88.0% 2. 200 117,647 1.6% 3,900 458,824 171 100,588 1.5% 4,587 461,350 85.5% 3. 304 178,824 2.4% 3,900 697,412 282 165,882 2.5% 4,427 734,416 92.8% 4. 106 62,353 0.8% 3,900 243,176 88 51,765 0.8% 4,315 223,368 83.0% 5. 92 54,118 0.7% 3,900 211,059 77 45,294 0.7% 3,983 180,400 83.7% 6. 62 36,471 0.5% 3,900 142,235 56 32,941 0.5% 4,274 140,795 90.3% 7. 167 98,235 1.3% 3,900 383,118 141 82,941 1.3% 4,107 340,638 84.4% 8. 39 22,941 0.3% 3,900 89,471 35 20,588 0.3% 3,348 68,924 89.7% 9. 726 427,059 5.8% 3,900 1,665,529 628 369,412 5.6% 5,063 1,870,156 86.5% n. 137 80,588 1.1% 3,900 314,294 115 67,647 1.0% 3,968 268,421 83.9% 1,959 1,152,353 15.5% 4,494,176 1,706 1,003,529 15.2% 4,577,921 87.1%

18 2 3 SC 2.11: SC 1991 1995 S S S S S S S S 0. 58 34,118 0.5% 3,900 133,059 57 33,529 0.5% 4,108 137,731 98.3% 1. 149 87,647 1.2% 3,900 341,824 125 73,529 1.1% 4,452 327,363 83.9% 2. 322 189,412 2.6% 3,900 738,706 287 168,824 2.6% 4,587 774,312 89.1% 3. 562 330,588 4.5% 3,900 1,289,294 525 308,824 4.7% 4,427 1,367,263 93.4% 4. 186 109,412 1.5% 3,900 426,706 158 92,941 1.4% 4,315 401,048 84.9% 5. 166 97,647 1.3% 3,900 380,824 139 81,765 1.2% 3,983 325,657 83.7% 6. 90 52,941 0.7% 3,900 206,471 76 44,706 0.7% 4,274 191,078 84.4% 7. 271 159,412 2.1% 3,900 621,706 226 132,941 2.0% 4,107 545,987 83.4% 8. 59 34,706 0.5% 3,900 135,353 49 28,824 0.4% 3,348 96,493 83.1% 9. 1,055 620,588 8.4% 3,900 2,420,294 968 569,412 8.6% 5,063 2,882,661 91.8% n. 148 87,059 1.2% 3,900 339,529 123 72,353 1.1% 3,968 287,094 83.1% 3,066 1,803,529 24.3% 7,033,765 2,733 1,607,647 24.3% 7,336,688 89.1%

2.1. SC SC 19 2.12: SC 1996 2000 S S S S S S S S 0. 81 47,647 0.6% 3,900 185,824 80 47,059 0.7% 4,108 193,307 98.8% 1. 194 114,118 1.5% 3,900 445,059 192 112,941 1.7% 4,452 502,829 99.0% 2. 371 218,235 2.9% 3,900 851,118 321 188,824 2.9% 4,587 866,042 86.5% 3. 705 414,706 5.6% 3,900 1,617,353 692 407,059 6.2% 4,427 1,802,183 98.2% 4. 247 145,294 2.0% 3,900 566,647 205 120,588 1.8% 4,315 520,347 83.0% 5. 257 151,176 2.0% 3,900 589,588 212 124,706 1.9% 3,983 496,685 82.5% 6. 135 79,412 1.1% 3,900 309,706 113 66,471 1.0% 4,274 284,103 83.7% 7. 324 190,588 2.6% 3,900 743,294 266 156,471 2.4% 4,107 642,622 82.1% 8. 76 44,706 0.6% 3,900 174,353 66 38,824 0.6% 3,348 129,970 86.8% 9. 1,143 672,353 9.1% 3,900 2,622,176 1,086 638,824 9.7% 5,063 3,234,060 95.0% n. 153 90,000 1.2% 3,900 351,000 132 77,647 1.2% 3,968 308,101 86.3% 3,686 2,168,235 29.2% 8,456,118 3,365 1,979,412 29.9% 8,980,250 91.3%

20 2 3 SC 2.13: SC 2001 2005 S S S S S S S S 0. 90 52,941 0.7% 3,900 206,471 80 47,059 0.7% 4,108 193,307 88.9% 1. 182 107,059 1.4% 3,900 417,529 162 95,294 1.4% 4,452 424,262 89.0% 2. 428 251,765 3.4% 3,900 981,882 354 208,235 3.1% 4,587 955,075 82.7% 3. 785 461,765 6.2% 3,900 1,800,882 696 409,412 6.2% 4,427 1,812,601 88.7% 4. 258 151,765 2.0% 3,900 591,882 212 124,706 1.9% 4,315 538,115 82.2% 5. 313 184,118 2.5% 3,900 718,059 262 154,118 2.3% 3,983 613,828 83.7% 6. 157 92,353 1.2% 3,900 360,176 135 79,412 1.2% 4,274 339,415 86.0% 7. 308 181,176 2.4% 3,900 706,588 264 155,294 2.3% 4,107 637,790 85.7% 8. 78 45,882 0.6% 3,900 178,941 67 39,412 0.6% 3,348 131,939 85.9% 9. 1,152 677,647 9.1% 3,900 2,642,824 1,083 637,059 9.6% 5,063 3,225,126 94.0% n. 145 85,294 1.2% 3,900 332,647 123 72,353 1.1% 3,968 287,094 84.8% 3,896 2,291,765 30.9% 8,937,882 3,438 2,022,353 30.6% 9,158,553 88.2%

2.2. SC 21 2.2 SC 2.2.1 SC SC SC SC SC SC Yahoo!Yahoo! 9 SC SC SC SC Yahoo!Yahoo! 1 SC SC SCYahoo!Yahoo! 2.2.2 SC 2.14

22 2 3 SC 2.14: SC S S 1976 2005 1,006 1,500 500 2005 2007 145 483 120 2008 100 355 400 1976 2005 951 1,696 447 Yahoo! 2004 2005 3,120,839 91,450 1,000 Yahoo! 2008 2009 3,463,413 52,680 1,000 1980 2005 130 253 15 1976 2005 718 348 100 1976 2005 32,925 159 500

23 3 3.1 3.1 3.1: SC S S 2001 2005 485 11,212 2,954 SC 2001 2005 105 2,483 569 2001 2005 64 1,490 86 1986 2005 479 11,242 3,005 SC 1976 2005 1,006 1,500 500 2005 2007 145 483 120 SC 2008 100 355 400 1976 2005 951 1,696 371 Yahoo! 2004 2005 312 91,450 1,000 Yahoo! 2008 2009 346 52,680 1,000 1980 2005 130 253 15 1976 2005 718 348 100 1976 2005 32,925 159 500 SC SC (2006,2007) (2009) (2011)

24 3 3.2 SC SC 2001 2005 5 11,212 2001 2005 5 J-BISC 2001 2005 5 40 2001 2005 317,117 74,911,520 NDC 227 1,135 1 74,911,520 48,539,925,351 SC 2 55 NDC 11 J-BISC NDC 1 0 9 NDC 11 5 2001 2005 5 NDC 3.1

3.2. SC 25 3.1: SCNDC 55 11,212 NDC 3.2 3.2: SCNDC

26 3 3.3 SC SC 2001 2005 5 2,483 2001 2005 5 2001 2005 5 2001 2005 1,259 55,779 10,414,955 53 265 1 10,414,955 10,515,681,636 SC 2 30 6 1. 2. 3. 4. 5. 6. 6 5 2001 2005 5 3.3

3.3. SC 27 3.3: SC 30 2,483 3.4 3.4: SC

28 3 3.4 SC SC 2001 2005 5 1,490 2001 2005 5 16 2001 2005 16 49,625 1,198,189 4 8 211 1,198,189 6,416,070,114 SC 2 80 16 16 5 2001 2005 5 3.5

3.4. SC 29 3.5: SC 80 1,490 3.6 3.6: SC

30 3 3.5 SC SC 1986 2005 20 11,242 1986 2005 20 ISBN SC SC 13 335,721 85,363,019 47,877,656,072 SC SC 2 220 NDC 11 J-BISC NDC 1 0 9 NDC 11 20 1986 2005 20 NDC 3.7

3.5. SC 31 3.7: SCNDC 220 11,242 NDC 3.8 3.8: SCNDC

32 3 3.6 SC SC 1976 2005 30 1,500 1976 2005 30 2001 2005 2001 2005 1997 1976 30 1989 1 40 1,006 SC 2 54 9 9 6 1976 2005 30 5 6 1 1976 1980 2 1981 1985 3 1986 1990 4 1991 1995 5 1996 2000 6 2001 2005

3.6. SC 33 500 1 6 250 1,500 40 1,500 1,500 3.9 3.9: SC

34 3 3.7 SC SC 483 10 11 15 2005 2007 1 145 SC 2 25 10 10 3 3 7,859,456 3.10 25

3.7. SC 35 3.10: SC 483 3.11 3.11: SC

36 3 3.8 SC SC 355 2008 100 2008 100 2008 Web PDF 8 8 1 6 1 1 6 355 3.12

3.8. SC 37 3.12: SC

38 3 3.9 SC SC 1976 2005 30 1,696 1976 2005 30 20 20 951 SC 1971 1976 1 2 951 1,902 1,696 1,696 NDC 3.13

3.9. SC 39 3.13: SCNDC

40 3 3.10 SC Yahoo! SC Yahoo! Q&A Yahoo! 91,450 Yahoo! 2004 10 2005 10 3,120,839 SC Yahoo! Yahoo! 15 82 279 3 > > 2078523513 > > 2078297515 > > 2078297810 279 1 1 1 URL

3.10. SC Yahoo! 41 1,000 1 91,450 279 91,450 14 59 130 91,450 3.14 3.14: SC Yahoo!

42 3 3.11 SC Yahoo! SC Yahoo! Yahoo! 52,680 Yahoo! 3,463,413 SC Yahoo! 1. 2008 4 26 2009 4 25 2. 1,000 3. 1 4. Yahoo! 5. 1 20 Yahoo! 15 54 316 3 > > 555000540 > > 555000549 > > 555002691

3.11. SC Yahoo! 43 1,000 1.8% 52,680 3.15 3.15: SC Yahoo!

44 3 3.12 SC SC 3 253 2002 14 17 1980 1982 8 15 1986 2005 118 34 1959 63 1988 25 1950 54 1979 BCCWJ 3 60 92 101 5 253 3.16

3.12. SC 45 3.16: SC

46 3 3.13 SC SC 1976 2005 30 2009 348 Web http://law.e-gov.go.jp/1976 2005 2009 9 718 SC 6 6 1976 2005 30 5 6 1 1976 1980 2 1981 1985 3 1986 1990 4 1991 1995 5 1996 2000 6 2001 2005 50 3.17 1 6 30 180 100 200 1

3.13. SC 47 3.17: SC 348 3.2 3.2: SC 2 5 2 1 3 1 18 3 22 7 11 4 3 1 5 13 1 6 3 9 4 8 2 12 1 4 10 17 5 18 13 15 36 4 40 1 7 3 11 6 4 3 4 348

48 3 3.14 SC SC1976 2005 30 159 Web http://kokkai.ndl.go.jp/ 77 163 32,986 SC 61 1,000 6,401 77 1975 33 3 48 2 2 6 1976 2005 5 6 1 1976 1980 2 1981 1985 3 1986 1990 4 1991 1995 5 1996 2000 6 2001 2005 4 4

3.14. SC 49 500 159 1 1 48 159 159 159 3.18 3.18: SC

II

53 4 BCCWJ 4.1 BCCWJ BCCWJ BCCWJ Web BCCWJ 4.2 BCCWJ 2011 1 BCCWJ Bibliography.txt Sample.txt ID Directory.txt Sample author.txt

55 5 Bibliography.txt 5.1 Bibliography.txt BCCWJ 5.1 15 5.1: 1. ID Bib ID ID 2. Title 3. Subtitle 4. Number 5. Bib author 6. Publisher 7. Year 8. ISBN ISBN ISBN 9. Size 10. Pages 11. (1) Genre 1 (1) 12. (2) Genre 2 (2) 13. (3) Genre 3 (3) 14. (4) Genre 4 (4) 15. ID Bib author ID ID 5.2 Yahoo!Yahoo! 15

56 5 Bibliography.txt 5.2: Bib ID Title Subtitle Number Bib author Publisher BK 20126734 PM 00070308 2003 8 PN 01030302 2003/3/2 WR 00000003 51 TB 01000009 PR 14212017 2008 17 Yahoo! YC 00297502 Yahoo! Yahoo! Yahoo! YB 00002691 Yahoo! Yahoo! VE 93066308 LA S63HO108 MD 02010001 154 Year ISBN Size Pages Genre 1 Genre 2 Genre 3 Genre 4 Bib author ID 2001 4167105926 16cm 368 9 913 0193 00070104 2003 A5 260 1 2003 37 1976 2006 5 00045734 2008 Yahoo! 2005 Yahoo! 2008 1991 4783708665 19cm 160 00093767 1988 23 2002

5.2. 57 5.2 Bibliography.txt 5.2.1 ID ID Bib ID ID BK 20000563 PM 00010409 PN 01010202 WR 00000001 TB 01000001 PR 01103001 YC 00297787Yahoo! YB 00000549Yahoo! VE 00010001 LA S51HO042 MD 00297787 1 2 BK PM PN WR TB PR YC YB VE LA MD 8 ID ID ID BK 20000215 BK 99131275 BK XXXXXX02 BK XXXXXX40 BK 7501115D BK 8900620D 1 2 BK Book 3 4 11 ID 4 11 ID 4 X ID 2005 10 ID 11 D 1 D

58 5 Bibliography.txt ID ID PM 00010120 PM 12590109 1 2 PM Magazine 3 4 7 ID 8 9 10 11 4 7 0001 1259 1,259 ID 0001 0002 2001 2005 ID 1229 2001 2005 ID 8 9 01 052001 2005 2 10 11 01 52 11 11 11 (Number) ID ID PN 01010125 PN 31041101 1 2 PN Newspaper 3 4 5 ID 6 7 8 11 4 5 01 31 16 ID 01 31ID 79 5.3.3

5.2. 59 6 7 01 052001 2005 2 8 11 0101 12311 1 12 31 4 ID ID WR 00000001 WR 00001006 1 2 WR 3 4 11 ID 4 11 ID 1,006 ID ID ID TB 01000001 TB 91000002 1 2 TB TextBook 3 4 0 = 3 = 6 = 9 = 1 = 4 = 7 = 2 = 5 = 8 = 5 1 = 2 = 3 = 6 11 ID ID PR 01103001 PR 47209008 1 2 PR Public Relations 3 4 8 ID 9 11 4 8 ID 5

60 5 Bibliography.txt 10 11 01 36 112008 11 Yahoo! ID Yahoo! ID YC 00297287 YC 00585157 1 2 YC Yahoo!Yahoo! Chiebukuro 3 4 11 Yahoo! ID BCCWJ 130 Yahoo!81 5.3.5 Yahoo! ID Yahoo! ID YB 00000075 YB 00023084 1 2 YB Yahoo!Yahoo! Blog 3 4 11 Yahoo! ID BCCWJ 316 Yahoo!84 5.3.6 ID ID VE 00010001 VE 99099368 1 2 VE Verse 3 4 11 ID 4 11 ID 4 7 000100028 11 ID

5.2. 61 ID ID LA S51HO042 LA H17HO124 1 2 LA Law 3 4 6 7 8 HO 4 11 4 11 ID Web HTML ID ID MD 00010004 MD 99060001 1 2 MD Minutes of the Diet 3 4 5 6 7 01 = 05 = 02 = 06 = 03 = 07 = 04 = 08 = 8 11 ID 4 5 76 051976 2005 2 91 5.3.8

62 5 Bibliography.txt 5.2.2 Title ; Yahoo!Yahoo! Yahoo!Yahoo! 1 ; Yahoo!Yahoo! 1 5.2.3 Subtitle 4

5.2. 63 Yahoo!Yahoo! 5.2.4 Number 6 3() 2002 4 15 15 16 750 80 49 4467 2001/10/24 2008 12 17 ( 55 63 ) 154 Yahoo!Yahoo!

64 5 Bibliography.txt 5.2.5 Bib author ; ; A. ; ; ; ; 103 Directory.txt ; ; Yahoo!Yahoo! 5.2.6 Publisher ; () Yahoo!Yahoo!Yahoo!

5.2. 65 ; Yahoo!Yahoo! Yahoo! 1 5.2.7 Year4 2001 Yahoo! 2005 1 Yahoo! 2008 1 5.2.8 ISBN ISBN ISBN ISBN 4889916687 ISBN 2007 13 10 ISBN 5.2.9 Size 20cm A4 23cm

66 5 Bibliography.txt 5.2.10 Pages 222 5.2.11 (1) (4) (1) (4) Genre 1 Genre 4 5.3: (1) (2) (3) (4) 9 913 0193 1 3 Yahoo! Yahoo! 35 (1) (2) (3) NDC 9 1 + NDC 9 3 C

5.2. 67 (1) (1) NDC 9 1 0 4 8 1 5 9 2 6 3 7 2005 10 NDC (2) (2) NDC 9 3 002 992 NDC 2 2 74 5.3.1 3 3 9 2005 10 NDC (3) (3) C 0000 9979 C 4 1 2 3 4 C 1 0 3 6 I 9 1 4 7 II 2 5 8

68 5 Bibliography.txt C 2 0 3 6 9 1 4 7 2 5 8 C 3 4 76 5.3.1 C (1) (2) (3) (4) (1) 1 4 2 5 3 6 77 5.3.2 (4) 2... (1)

5.2. 69 (1) (1) 79 5.3.3 (1) (1) (1) 9 80 5.3.4 (1) (2) (3) (1) (1)

70 5 Bibliography.txt (2) (2) (3) (3) 123456 (1) (2) (1) (1) (2) (2) Yahoo! Yahoo! (1) (2) (3)

5.2. 71 Yahoo! 14 59 130 3 Yahoo! JAPAN PC Yahoo!81 5.3.5 Yahoo! Yahoo! (1) (2) (3) Yahoo! 15 54 316 3 Yahoo! Yahoo!84 5.3.6

72 5 Bibliography.txt (1) (1) (1) (1) 43 04 90 5.3.7 (1) (2) (3) (1) (1) 2 1 (2) (2) 4

5.2. 73 (3) (3) 59 91 5.3.8 5.2.12 ID ID Bib author IDBib author ID ID Directory.txt ID Directory ID ID 103 7.2.1 00685074 00254659 ; 00184422 00113880 ; 00166885 ; 00124738 00037561 ID8 0 ID ; Directory.txt ID ID ID

74 5 Bibliography.txt 5.3 5.3.1 NDC 2 (2) 3 NDC 2 2 3 3 9 00 01. 02. 03 04. 05 06 07. 08.. 09.. 20 21 22. 23. 24 25 26 27. 28 29.. 10 11 12 13 14 15. 16 17 18 19 30 31 32 33 34 35 36 37 38.. 39.

5.3. 75 40 41 42 43 44. 45. 46. 47 48 49. 70. 71 72. 73 74. 75 76. 77. 78. 79. 50. 51. 52 53. 54. 55.. 56. 57 58 59. 80 81 82. 83 84 85 86 87 88 89 60 61 62 63 64. 65 66 67 68. 69 90 91 92. 93 94 95 96 97 98 99

76 5 Bibliography.txt C (3) 4 C 3 4 C NDC 2 2 00 01 02 04 10 11 12 14 15 16 20 21 22 23 25 26 30 31-32 33 34 36 37 39 40 41 42 43 44 45 47 50 51 52 53 54 55 56 57 58 60 61 62 63 65 70 71 72 73 74 75 76 77 79 80 81 82 84 85 87 90 91 92 93 95 97 98

5.3. 77 5.3.2 (1) (3) 6 27 71 3 1

78 5 Bibliography.txt 1 2 3 4 5 6

5.3. 79 5.3.3 ID 4 5 ID 01 31 16 ID ID ID 01 17 02 18 03 19 04 20 05 21 06 22 09 23 10 24 11 25 12 26 13 27 14 28 15 30 16 31 ID 070829

80 5 Bibliography.txt 5.3.4 (1) 9 / / / / / / / / ODA / / / / / / / / / /1976 2005

5.3. 81 5.3.5 Yahoo! Yahoo! (1) (3) 14 59 130 3 CM PC AV AV

82 5 Bibliography.txt

5.3. 83 Yahoo! JAPAN Yahoo! Yahoo! Yahoo! Yahoo! Yahoo! Yahoo!

84 5 Bibliography.txt 5.3.6 Yahoo! Yahoo! (1) (3) 15 54 316 3 Windows Macintosh UNIX

5.3. 85 UFO

86 5 Bibliography.txt

5.3. 87

88 5 Bibliography.txt CLUB KEIBA Yahoo! Yahoo! Yahoo! Yahoo! Yahoo! Yahoo! Yahoo! Yahoo! Yahoo! Yahoo!

5.3. 89

90 5 Bibliography.txt 5.3.7 (1) 01 02 03 04 05 07 08 09 10 11 12 14 15 16 17 19 20 21 23 24 25 26 27 28 29 30 31 32 33 34 35 37 38 39 40 42 43 44 45 46 47 49 50

5.3. 91 5.3.8 (2) (3) 4

93 6 Sample.txt 6.1 Sample.txt BCCWJ ID 6.1 4 6.1: 1. ID Sample ID ID 2. ID Bib ID ID 3. Sampling page 4. Sampling point Sample ID Bib ID Sampling Sampling page point PB10 00047 BK 20205918 163 5D PM11 00053 PM 10550109 76 9F PN1a 00013 PN 01010225 4 6C LBa1 00004 BK 86049602 230 2H OW6X 00009 WR 00000066 285 4C OT01 00008 TB 01000002 31 8A OP00 00001 PR 01103001 OB0X 00001 BK 75079014 358 4D Yahoo! OC01 00001 YC 00297514 Yahoo! OY01 00005 YB 00010571 OV0X 00001 VE 00010001 OL3X 00072 LA H01HO058 OM11 00001 MD 80010001

94 6 Sample.txt 6.2 Sample.txt 6.2.1 ID ID Sample ID ID PB10 00047 SC PM11 00053 SC PN1a 00013 SC LBa1 00004 SC OW6X 00009 SC OT01 00008 SC OP01 00008 SC OB0X 00001 SC OC01 00001 SC Yahoo! OY01 00005 SC Yahoo! OV0X 00001 SC OL1X 00001 SC OM11 00001 SC 1 P L O SC 2 B M N W T P C Y V L SC 3 4 1 2 5 ID

6.2. 95 SC ID SC ID PB10 00001 PB5n 00141 1 P SC Publication 2 B Book 3 1 5 1 2001 3 2003 5 2005 2 2002 4 2004 4 0 9,n NDC 1 0 48 1 59 2 6n 3 7 5 6 10 NDC SC ID SC ID PM11 00002 PM56 00004 1 P SC Production 2 B Magazine 3 1 5 1 2001 3 2003 5 2005 2 2002 4 2004 4 1 6 14 25 36 5 6 10

96 6 Sample.txt SC ID SC ID PN1a 00001 PN5o 00021 1 P SC Publication 2 N Newspaper 3 1 5 1 2001 3 2003 5 2005 2 2002 4 2004 4 a o afk bgl chm dio ej 5 6 10 SC ID SC ID LBa0 00002 LBtn 00025 1 L SC Library 2 B Book 3 a t a 1986 h 1993 o 2000 b 1987 i 1994 p 2001 c 1988 j 1995 q 2002 d 1989 k 1996 r 2003 e 1990 l 1997 s 2004 f 1991 m 1998 t 2005 g 1992 n 1999 4 0 9,n NDC 1 0 48 1 59 2 6n 3 7 5 6 10 NDC

6.2. 97 SC ID SC ID OW1X 00000 OW6X 03369 1 O SC 2 W White Paper 3 1 6 1 1 1976 1980 2 2 1981 1985 3 3 1986 1990 4 4 1991 1995 5 5 1996 2000 6 6 2001 2005 4 X 5 6 10 SC ID SC ID OT01 00002 OT91 00009 1 O SC 2 T TextBook 3 0 9 0 = 5 = 1 = 6 = 2 = 7 = 3 = 8 = 4 = 9 = 4 1 3 1 = 2 = 3 = 5 6 10 SC ID SC ID OP00 00001 OP99 00003 1 O SC 2 P Public Relation 3 4 00 99 100 5 6 10

98 6 Sample.txt SC ID SC ID OB0X 00001 OB6X 00257 1 O SC 2 B Best-seller 3 0 6 0 0 1975 4 4 1991 1995 1 1 1976 1980 5 5 1996 2000 2 2 1981 1985 6 6 2001 2005 3 3 1986 1990 4 X 5 6 10 SC Yahoo! ID SC Yahoo! ID OC01 00001 OC15 01173 1 O SC 2 C Yahoo!Chiebukuro 3 4 01 15 ID 01 02 PC 03 04 05 06 08 09 10 11 12 13 14 Yahoo! JAPAN 15 5 6 10 ID 07 Yahoo!

6.2. 99 SC Yahoo! ID SC Yahoo! ID OY01 00005 OY15 09456 1 O SC 2 Y Yahoo! Blog 3 4 01 15 ID 01 02 03 04 05 06 07 08 09 10 11 12 13 14 Yahoo! 15 5 6 10 SC ID SC ID OV0X 00001 OV2X 00108 1 O SC 2 V Verse 3 0 2 0 1 2 4 X 5 6 10

100 6 Sample.txt SC ID SC ID OL1X 00001 OL6X 00066 1 O SC 2 L Law 3 1 6 1 1 1976 1980 2 2 1981 1985 3 3 1986 1990 4 4 1991 1995 5 5 1996 2000 6 6 2001 2005 4 X 5 6 10 SC ID SC ID OM11 00001 OM68 00001 1 O SC 2 M Minutes of the Diet 3 1 6 1 1 1976 1980 4 4 1991 1995 2 2 1981 1985 5 5 1996 2000 3 3 1986 1990 6 6 2001 2005 4 1 8 1 5 2 6 3 7 4 8 5 6 10 6.2.2 ID ID Bib ID ID ID Bibliography.txt ID Bib ID ID 57 5.2.1 6.2.3 Sampling page 6.2.4

6.2. 101 6.2.4 Sampling point 1 0 9 A J 10 10 3E

103 7 Directory.txt 7.1 Directory.txt Directory.txt 4 1. ID Directory ID ID 2. Name 3. Sex 4. BirthYear 10 Directory ID Name Sex BirthYear 634 1910 98948 1940 153494 1950 840303 2000130 2502212 NHK X 7.2 Directory.txt 7.2.1 ID ID Directory ID ID

104 7 Directory.txt 4078 ID 31535 ID 2505106 ID ID ID 7.2.2 Name 7.2.3 Sex 7.2.4 BirthYear19501960 10

105 8 Sample author.txt 8.1 Sample author.txt Sample author.txt 2 1. ID Sample ID ID 2. ID Directory ID ID Sample ID Directory ID PB10 00022 107107 PM43 00020 303855 LBl9 00073 327382 LBl9 00073 556836 1 LBl9 00073 ID ID Yahoo!Yahoo! 8.2 Sample author.txt 8.2.1 ID ID Sample ID ID ID Sample.txt ID Sample ID

106 8 Sample author.txt ID 94 6.2.1 8.2.2 ID ID Directory ID ID ID Directory.txt ID Directory ID ID 103 7.2.1

107 9 9.1 4 9.1 Microsoft Access MySQL SQL Server 4 ID 9.2 1 Microsoft Excel ID

108 9 Bibliography.txt 1. Bib ID ID 2. Title 3. Subtitle 4. Number 5. Bib author 6. Publisher 7. Year 8. ISBN ISBN 9. Size 10. Pages 11. Genre 1 (1) 12. Genre 2 (2) 13. Genre 3 (3) 14. Genre 4 (4) 15. Bib author ID ID Sample.txt 1. Sample ID ID 2. Bib ID ID 3. Sampling page 4. Sampling point 5. Status 6. Core Directory.txt 1. Directory ID ID 2. Name 3. Sex 4. BirthYear Sample author.txt 1. Sample ID ID 2. Directory ID ID 9.1:

9.1. 109 1. Sample ID ID 2. Bib ID ID 3. Title 4. Subtitle 5. Number 6. Bib author 7. Publisher 8. Year 9. ISBN ISBN 10. Size 11. Pages 12. Genre 1 (1) 13. Genre 2 (2) 14. Genre 3 (3) 15. Genre 4 (4) 16. Bib author ID ID 17. Sampling page 18. Sampling point 19. Status 20. Core 21. Author ID ID 22. Author 23. Sex 24. BirthYear 9.2:

110 9 9.2 SC 2001 2005 2003 ID PB39 00742, 1905 2 ID BCCWJ

III

113 10 III SSG; 5 [1], (2007)., 18 (JC-D-06-02),. [2],,,,,, (2008)., 19 (JC-D-07-02),. [3], (2008). (2), 19 (JC-D-07-01),. [4],,,,,,, (2009)., 20 (JC-D-08-02),. [5],,,,,,, (2009)., 20 (JC-D-08-01),. [6],,,,,,, (2011).,

114 10 22 (JC-D-10-01),. [7],,,,,,, (2011)., 22 (JC-D-10-02),. [8],,,,,,,, (2006).. 18. 9-16. [9],,,,,, (2007).. 18. 79-88. [10],,,,,,,, (2007). 18. 18. 25-28. [11],,,,,,,, (2007). 19. 19. 3-8. [12] (2007).. 18. 127-136. [13],,,,,, (2008).. 19. 143-152. [14] (2008).. 20. 83-90. [15],,,,,, (2008). (2)

115. 19. 37-46. [16], (2008). (2). 19 (JC-D-07-01),. [17],,,,,,,,,, (2008). 20. 20. 5-10. [18],,,,,,,, (2008). 19. 19. 65-72. [19],,,,,,, (2009). (3). 20. 33-42. [20],,,,,,,,, (2009). 21. 21. 3-8. [21] (2009).. 20. 5-12. [22] (2010).. 21. 47-54. [23] (2010). Yahoo!. 22. 73-80. [24] (2010). Q&A Yahoo!. 21. 55-62.

116 10 [25],,,,,,, (2010). (4). 21. 37-46. [26],,,,,,,,,, (2010). 22. 22. 3-8. [27] (2010).. 21. 5-14. [28] (2010). BCCWJ. 22. 109-112. [29], (2011).. 22. [30], (2011). Yahoo!. 22. [31],,,,,,, (2011). (5). 22. [32],,,,,,,,,, (2011).. 22. [33] (2011).. 22., [34] (2006).. 25(9). 18-27..

117 [35], (2007).. 22. 5-12.. [36] (2007). KOTONOHA. 8. 180-183.. [37] (2008).. 12. 49-69.. [38] Sano, M. & Thomson, E. A. (2008). Japanese Folk Tales: text structure and evaluative expression. Bridging Discourse: ASFLA 2007 Online Proceedings. 1-17. [39] (2009). Yahoo!. 7. 57-68.. [40] (2009).. 140. 26-36.. [41] (2009).. 21 1 122-130.. [42] (2009). 1.2 1.2.1 1.2.2.,. 58-71.. [43] (2009).. 74(1). 183-191.. [44] (2009).. 24(5). 623-631.. [45] (2009).. Japio. 118-121.. [46] Maruyama, T., Yamazaki, M. & Maekawa, K. (2009). Statistical sampling method used in the Balanced Corpus of Contemporary Written Japanese. Current ISSUES in Unity and Diversity of Languages: Collection of the papers selected from the CIL 18. 3864-3876. [47] (2010).. 12. 19-26.. [48] (2010). /. 27(7). 249-269..

118 10 [49] (2010).. 109(390). 37-42.. [50] (2010).. 7. 3-48.. [51] (2010). Q&A. 109(390). 7-12.. [52] (2011).. 110(400). 19-24.. [53], (2011 ).,,. 6.. [54] (2011 ).. 8.. [55], (2011). Yahoo! -. 110(400). 13-18.. [56] (2011 ). QA WebQA Yahoo!. 8. 65-80.. [57] (2011 ). 10.,.. [58] (2011 ). 3.,, IT 5.. [59], (2011 ).. 6. [60],,,,, (2006).. 12. 444-447.

119 [61],,,,, (2006).. 12. 680-683. [62],,,,,,,,,,, (2006).. 12. 440-443. [63],,,,,, (2007).. 13. 708-711. [64],,,,, (2007).. 13. 704-707. [65],,,,,,,,,,,,,,, (2007).. 2007. 239-246. [66],,,,,, (2008). NDC. 14. 939-942. [67] (2008).. 30. 11-32. [68], (2008).. 14. 1097-1100. [69] (2008).. 2008.. [70] (2008). Yahoo!. 22. 114-117. [71] (2008).. 21. 92-95. [72] (2008).. 14. 911-914. [73] (2008). Yahoo!. 2008.

120 10 [74] Sano, M. & Maruyama, T. (2008). Lexical Density in Japanese Texts: classifying text samples in the Balanced Corpus of Contemporary Written Japanese (BCCWJ). Proceedings of 35th International Systemic Functional Congress. 359-364. [75] Sano, M. & Mizusawa, Y. (2008). Describing Japanese Language and Text: applications of systemic functional theory. Columbia University Teachers College Public Seminar. Columbia University Teachers College. [76],,,,,,, (2009).. 15. 196-199. [77], (2009)... 38-43. [78], (2009). Context based register typology. 2009.. [79] (2009).. 23. 28-31. [80] (2009).. 11. [81],,,,,,, (2009).. 15. 618-621. [82] (2009)..,.. 47-50. [83] (2009).. 20. 163-177. [84], (2010).. 88. 59-70. [85] (2010).. 35. 63-72. [86],, (2010). 140. 370-375.

121 [87], (2010). ( ). 25. 182-185. [88], (2010).. 16. 174-177. [89] (2010)... 43-48. [90] (2010). attitude. 2010. [91] (2010)... 49-54. [92] (2010). QA Web. 26. 142-145. [93] (2010). Yahoo!. 2010. [94] (2010).. 16. 150-153. [95] (2010)... 25-30. [96] (2010). BCCWJ. 25 25. 16-17. [97] Kashino, W. & Okumura, M. (2010). An Approach toward Register Classification of Book Samples in the Balanced Corpus of Contemporary Written Japanese. Proc. of PACLIC24. 433-438. [98], (2011 ).. 17. [99] (2011 ).. 17.

122 10 [100] (2011 ).. 27. [101], (2011 ). Yahoo! -. 27. [102], (2011 ). Q&A. 17. [103] (2011 ).. 17. [104] (2006).. NAIST. [105] (2006)... [106] (2006).. 12. [107] (2008). Balanced Corpus of Contemporary Written Japanese -its design and compilation-.. [108] (2008)... [109] (2008).. COE. [110] (2009)...

/ / / 22 23 2 25 190 8561 10 2 JC-D-10-02 c 2011 Data Handling Group, Priority-Area Research Japanese Corpus