計量国語学 アーカイブ ID KK 種別 特集 招待論文 A タイトル Webコーパスの概念と種類, 利用価値 語史研究の情報源としてのWebコーパス Title The Concept, Types and Utility of Web Corpora: Web Corpora as

Similar documents
pp DC 2,

pp R R Word R R R R Excel SPSS R Microsoft Word 2016 OS Windows7 Word2010 Microsoft Office2010 R Emacs ESS R R R R https:

pp Excel Excel Excel Microsoft Excel 2015 OS Windows7 Excel2010(Microsoft Office2010) Office Excel 2 Excel 33

pp Word Excel PowerPoint Microsoft Word Excel PowerPoint Word Excel PowerPoint a 201

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

Kyushu Communication Studies 第2号

29 jjencode JavaScript


Modal Phrase MP because but 2 IP Inflection Phrase IP as long as if IP 3 VP Verb Phrase VP while before [ MP MP [ IP IP [ VP VP ]]] [ MP [ IP [ VP ]]]

NINJAL Research Papers No.10


36 Theoretical and Applied Linguistics at Kobe Shoin No. 20, 2017 : Key Words: syntactic compound verbs, lexical compound verbs, aspectual compound ve

学位研究17号

matsuda.dvi


1. David Murray

i5 Catalyst Case Instructions JP


1 1 tf-idf tf-idf i

, IT.,.,..,.. i

pp


*.E (..).R

1,a) 1,b) TUBSTAP TUBSTAP Offering New Benchmark Maps for Turn Based Strategy Game Tomihiro Kimura 1,a) Kokolo Ikeda 1,b) Abstract: Tsume-shogi and Ts

NINJAL Project Review Vol.3 No.3

_Y05…X…`…‘…“†[…h…•

展示の刹


( ) CD-ROM Web ( ) 1 mp3 2.インターネット 時 代 の 図 書 館 を 巡 る 諸 問 題 2.1 情 報 センターと 図 書 館 の 境 界 の 揺 らぎ Web Gopher FTP 2 3 NACSIS OPAC( )


Web Web Web Web Web, i

<95DB8C9288E397C389C88A E696E6462>

先端社会研究所紀要 第9号☆/2.島村

ネットワーク化するデジタル情報家電の動向


形容詞的過去分詞(Adjectival Past Participle)の選択束縛について

IPSJ SIG Technical Report Vol.2009-HCI-134 No /7/17 1. RDB Wiki Wiki RDB SQL Wiki Wiki RDB Wiki RDB Wiki A Wiki System Enhanced by Visibl

2reN-A14.dvi

The object of this paper is to look into the transition of discourse about Asia in 'The Nippon' one of the most famous newspapers in the period from 1

文を綴る、文を作る

DOUSHISYA-sports_R12339(高解像度).pdf

:. * ** *** **** Little Lord Fauntleroy Little Lord Fauntleroy Frances Eliza Hodgson Burnett, - The Differences between the Initial Edition and First

09‘o’–

三税協力の実質化 : 住民税の所得税閲覧に関する国税連携の効果

-like BCCWJ CD-ROM CiNii NII BCCWJ BCCWJ

IPSJ SIG Technical Report Vol.2014-CE-127 No /12/7 1,a) 2,3 2,3 3 Development of the ethological recording application for the understanding of

3年生における国語表現指導

<30375F97E996D88E812E696E6464>



WikiWeb Wiki Web Wiki 2. Wiki 1 STAR WARS [3] Wiki Wiki Wiki 2 3 Wiki 5W1H Wiki Web 2.2 5W1H 5W1H 5W1H 5W1H 5W1H 5W1H 5W1H 2.3 Wiki 2015 Informa

Core Ethics Vol. Talboks-och punktskriftsbiblioteket TPB DAISY Bookshare BookshareDAISY DAISY DAISYDAISY (Digital Accessible Information System) NGO D

01ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六七八九零壱弐02ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六七八九零壱弐03ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六七八九零壱弐04ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六七八九零壱弐05ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六七八九零壱弐06ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

中国における初現期の都市・都市形成の4段階

<33318FBC89598E81332E696E6464>

A Japanese Word Dependency Corpus ÆüËܸì¤Îñ¸ì·¸¤ê¼õ¤±¥³¡¼¥Ñ¥¹


Core Ethics Vol. -

表象される奈良: B面の「なら学」のために

井手友里子.indd


J No J. J

藤原京の条坊制‐その実像と意義‐

p _08森.qxd

The structure of chuan yue xiao shuo as a narrative abstract Chuan Yue Xiao Shuo appeared in Chinesewebsite in the late of 1990's as a popular fun fic

Core Ethics Vol. : - : : : -

IPSJ SIG Technical Report Vol.2014-EIP-63 No /2/21 1,a) Wi-Fi Probe Request MAC MAC Probe Request MAC A dynamic ads control based on tra

本文.indd


Vol.57 No

対朝鮮人絹織物移出と繊維専門商社の生産過程への進出

〈評論〉中国映画探訪--高考・成功・精神創傷(入試・出世・心的外傷)

X

はじめに

16_.....E...._.I.v2006


1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

Studies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth


2

声のことば、文字のことば : 古事記と万葉集から、古代日本の口頭語を考える

(1) a. He has gone already. b. He hasn't gone yet. c. Has he gone yet?

2reA-A08.dvi

スポーツ選手における日常的トレーニングが味覚に及ぼす影響について



Vol.60 No.3 December JACAR Ref. A - -

「向こう見ずな凝視」とフェミニズム

コーパスに基づく言語学教育研究報告 8


5

_09名嶋.indd


国民年金保険料における未納 免除 猶予 追納の分析 Analysis of People's Decision-Making for the Absence of Contribution Payments, the Exemption, the Contribution Postponement


Core Ethics Vol. QOL N N N N N N N K N N

Microsoft Word - hozon-fujimura-HP-伊勢工業高校における造船教育の歴史から学ぶ

Transcription:

計量国語学 アーカイブ ID KK300601 種別 特集 招待論文 A タイトル Webコーパスの概念と種類, 利用価値 語史研究の情報源としてのWebコーパス Title The Concept, Types and Utility of Web Corpora: Web Corpora as a Source of Information for Etymological Studies 著者 田野村忠温 Author TANOMURA Tadaharu 掲載号 30 巻 6 号 発行日 2016 年 9 月 20 日 開始ページ 326 終了ページ 343 著作権者 計量国語学会

30 6 2016 9 pp.326-343. Web Web Web Google Web Web Web 2015a 2015b 2016a 2016b Web 326 2016

BCCWJ 2011 2012 2014 BCCWJ BCCWJ 1,000 BCCWJ Web BCCWJ BCCWJ Web Web Web Web Web Web Web Web 327

Web Web Web Web Web Web Web Web Web Google https://books.google.co.jp/ Google OCR Google Google Web Google Google Google Google Google Web 3.2 Google Google Google Google Google Google Web 328

Google Web http://dl.ndl.go.jp/ http://kokkai.ndl.go.jp/ Google Google Web Web Web Web Web Web Web Web Google 1980 Google Web 1979 Internet Archive https://archive.org/ Wayback Machine Wayback Machine 329

Google 13 1975-2001 - Google 330

Google 1940 15 1954 29 3 1964 39 11 1974 49 2008 20 2014 26 331

1980 SVP CPU central processing unit SVP CPU J63-D 3 1980 55 OS 55 1981 56 Word 1979 1979 54 Web 332

Web Google television Web 20 2008 1920 40 1986 Wikipedia television Wikipedia 1934 television television television Wikipedia 1934 333

Web television 1922 11 8 2 1924 13 television 596 1924 13 10 1 1927 2 4 1 3 2 4 1927 2 2 1 4 1 (2008).NET http://www.nikkoku.net/tomonokai/ 2006 10 25 1928 3 television 5.2 1927 2 12 17 334

4 4 6 1927 2 10 1 12 1 2 12 17 1927 2 12 17 television 1917 6 3 4 1917 6 4 25 1915 4 19 20 television 10 7 1925 14 10 30 335

Television 1927 16 5 8 Bell Laboratory Herbert E. Ives 12 6 1927 16 6 20 Herbert Hoover Bell 24 17 1927 16 9 10 television television Visagraph 15 12 1928 17 1932 21 12 8 television 2008 336

tele- telegraph telephone 4 4 4 4 Tele Vision Vision Tele 4 4 4 337

1929 4 11 3 television tele- television télautographie téléphotographie 21 13 1924 13 tele- 1929 18 27 television television television telegraph telephone television (2007) 1879 12 7 9 1883 3 10 9 2 2 (2007) 338

television television 1932 7 2 24 1933 8 2 24 1934 9 4 23 10 4 1928 3 television 1931 20 15 3 telephone television television telephone (1950) 339

Web Google 4 4 4 Google Google Google 340

Google Google Google 4 Google Google Google Google Web Web Web Web Web 341

5.2 television 1917 1925 1927 2007 19 263-282.. 2007. 2012 BCCWJ 46 59-82.. 2014 BCCWJ 119-151.. 2015a 55 81-137. 2015b Amsterdam 49 9-34.. 2016a 56 123-181. 2016b 50... 2008 95-110.. 1950. 2016 3 13 342

Mathematical Linguistics, Vol.30 No.6 (September 2016) pp.326-343. Invited Paper (A) to the Special Issue The Concept, Types and Utility of Web Corpora: Web Corpora as a Source of Information for Etymological Studies TANOMURA Tadaharu (Osaka University) Abstract: The defining condition of a Web corpus will be that it is a huge amount of text data collected from the Internet. Although Websites such as Google Books, National Diet Library Digital Collections and newspaper archives do not satisfy the condition, they nevertheless cannot be clearly distinguished from typical Web corpora, and thus it may not be groundless to regard them as a type of Web corpus. This article, drawing upon two case studies, will demonstrate that we can easily enhance the level of the description of the history of Japanese as well as Chinese terms of the modern era with the help of information obtainable from those Websites. Keywords: Web corpus, diachronic corpus, modern Japanese and Chinese, etymology, tatiageru (transitivized form of the verb tatiagaru), densi/dianshi (Japanese/ Chinese term for television) 343