Similar documents
Web Web Web Web Web, i

3_39.dvi

インストール取扱説明書

インストール取扱説明書

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

評論・社会科学 84号(よこ)(P)/3.金子

(2 Linux Mozilla [ ] [ ] [ ] [ ] URL 2 qkc, nkc ~/.cshrc (emacs 2 set path=($path /usr/meiji/pub/linux/bin tcsh b

IPSJ SIG Technical Report Vol.2014-EIP-63 No /2/21 1,a) Wi-Fi Probe Request MAC MAC Probe Request MAC A dynamic ads control based on tra

,,.,,., II,,,.,,.,.,,,.,,,.,, II i

IPSJ SIG Technical Report Vol.2013-GN-86 No.35 Vol.2013-CDS-6 No /1/17 1,a) 2,b) (1) (2) (3) Development of Mobile Multilingual Medical

橡最終原稿.PDF

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

pp R R Word R R R R Excel SPSS R Microsoft Word 2016 OS Windows7 Word2010 Microsoft Office2010 R Emacs ESS R R R R https:

21 A contents organization method for information sharing systems

7,, i

soturon.dvi

A B C B C ICT ICT ITC ICT

Web Web Web Web i

1_26.dvi

IT,, i

自然言語処理16_2_45

10/ / /30 3. ( ) 11/ 6 4. UNIX + C socket 11/13 5. ( ) C 11/20 6. http, CGI Perl 11/27 7. ( ) Perl 12/ 4 8. Windows Winsock 12/11 9. JAV

29 jjencode JavaScript

25 Removal of the fricative sounds that occur in the electronic stethoscope

IPSJ SIG Technical Report Vol.2014-IOT-27 No.14 Vol.2014-SPT-11 No /10/10 1,a) 2 zabbix Consideration of a system to support understanding of f


(3.6 ) (4.6 ) 2. [3], [6], [12] [7] [2], [5], [11] [14] [9] [8] [10] (1) Voodoo 3 : 3 Voodoo[1] 3 ( 3D ) (2) : Voodoo 3D (3) : 3D (Welc

デジタルメディアの時代における協働社会のデザインと地方行政の役割 : 元住吉商店街プロジェクトでの実践活動を通して

24 Region-Based Image Retrieval using Fuzzy Clustering

卒業論文2.dvi

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

wide94.dvi

ID 3) 9 4) 5) ID 2 ID 2 ID 2 Bluetooth ID 2 SRCid1 DSTid2 2 id1 id2 ID SRC DST SRC 2 2 ID 2 2 QR 6) 8) 6) QR QR QR QR


1 1 tf-idf tf-idf i

12研究資料02.indd

JP1/Integrated Management - Service Support 操作ガイド

10_細川直史.indd

ネットワークビデオレコーダー VK-64/VK-16/VK-Lite v2.2 セットアップガイド

ディスプレイと携帯端末間の通信を実現する映像媒介通信技術

Vol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L


, IT.,.,..,.. i

1 Web Web 1,,,, Web, Web : - i -


johnny-paper2nd.dvi

駒田朋子.indd

LAN LAN LAN LAN LAN LAN,, i


Introduction to Information and Communication Technology (a)

P2P P2P peer peer P2P peer P2P peer P2P i

Vol. 42 No. SIG 8(TOD 10) July HTML 100 Development of Authoring and Delivery System for Synchronized Contents and Experiment on High Spe

kut-paper-template.dvi

E MathML W3C MathJax 1.3 MathJax MathJax[5] TEX MathML JavaScript TEX MathML [8] [9] MathSciNet[10] MathJax MathJax MathJax MathJax MathJax MathJax We

finalrep.dvi

Virtual Window System Virtual Window System Virtual Window System Virtual Window System Virtual Window System Virtual Window System Social Networking

SNS GIS Abstract The Tourism-based Country Promotion Basic Act was enacted in Japan over a decade ago. Tourism is expected to be the primary contribut

計量国語学 アーカイブ ID KK 種別 特集 招待論文 A タイトル Webコーパスの概念と種類, 利用価値 語史研究の情報源としてのWebコーパス Title The Concept, Types and Utility of Web Corpora: Web Corpora as

09_加藤_紀要_2007

IPSJ SIG Technical Report Vol.2010-GN-74 No /1/ , 3 Disaster Training Supporting System Based on Electronic Triage HIROAKI KOJIMA, 1 KU

22 Google Trends Estimation of Stock Dealing Timing using Google Trends

fiš„v8.dvi

fiš„v3.dvi

e-learning e e e e e-learning 2 Web e-leaning e 4 GP 4 e-learning e-learning e-learning e LMS LMS Internet Navigware

matsuda.dvi

TCP/IP Internet Week 2002 [2002/12/17] Japan Registry Service Co., Ltd. No.3 Internet Week 2002 [2002/12/17] Japan Registry Service Co., Ltd. No.4 2

1: ( 1) 3 : 1 2 4

indd

X Window System X X &

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

28 Docker Design and Implementation of Program Evaluation System Using Docker Virtualized Environment

B HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int

MAC root Linux 1 OS Linux 2.6 Linux Security Modules LSM [1] Security-Enhanced Linux SELinux [2] AppArmor[3] OS OS OS LSM LSM Performance Monitor LSMP


Web Basic Web SAS-2 Web SAS-2 i

1 2. Nippon Cataloging Rules NCR [6] (1) 5 (2) 4 3 (3) 4 (4) 3 (5) ISSN 7 International Standard Serial Number ISSN (6) (7) 7 16 (8) ISBN ISSN I

1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

2 2.1 NPCMJ ( (Santorini, 2010) (NPCMJ, 2016) (1) (, 2016) (1) (2) (1) ( (IP-MAT (CONJ ) (PP (NP (D ) (N )) (P )) (NP-SBJ *

WIDE 1

100 SDAM SDAM Windows2000/XP 4) SDAM TIN ESDA K G G GWR SDAM GUI

The structure of chuan yue xiao shuo as a narrative abstract Chuan Yue Xiao Shuo appeared in Chinesewebsite in the late of 1990's as a popular fun fic

IP ( ) IP ( ) IP DNS Web Web DNS Web DNS DNS 利用者 1 利用者 2 東京都調布市の天気情報を応答 東京都調布市の天気を問い合わせ 北海道旭川市の天気を問い合わせ 北海道旭川市の天気情報を応答 Fig. 1 1 DNS サーバ 東京都調布市の天気情報 We

IPSJ SIG Technical Report Vol.2009-HCI-134 No /7/17 1. RDB Wiki Wiki RDB SQL Wiki Wiki RDB Wiki RDB Wiki A Wiki System Enhanced by Visibl


08+11Extra

/* sansu1.c */ #include <stdio.h> main() { int a, b, c; /* a, b, c */ a = 200; b = 1300; /* a 200 */ /* b 200 */ c = a + b; /* a b c */ }

UNIX

2006 [3] Scratch Squeak PEN [4] PenFlowchart 2 3 PenFlowchart 4 PenFlowchart PEN xdncl PEN [5] PEN xdncl DNCL 1 1 [6] 1 PEN Fig. 1 The PEN

IP IP DHCP..

システム開発プロセスへのデザイン技術適用の取組み~HCDからUXデザインへ~

GNU Emacs GNU Emacs

untitled

IT講習会

日本感性工学会論文誌

WebRTC P2P Web Proxy P2P Web Proxy WebRTC WebRTC Web, HTTP, WebRTC, P2P i

Development and Field Test of a Portable Camera System for Long Term Observation of Natural Dam Ken AKIYAMA (Tohoku Univ.), Genki YAMAUCHI (Tohoku Uni

install1_5_1.book

Networking Semester 802.3

,,,, : - i -

Transcription:

2 2 2 3 3 4 4 4 4 4 5 5 5 6 N 6 7 Patricia 7 7 7 8 8 Namazu9 Namazu 9 9 10 10 10 10 11 11 HTML 11 12 web 12 12 URL 14

全文検索システムの機能とその活用 渡邉里美 swatan14@cs.reitaku-u.ac.jp 麗澤大学国際経済学部国際経済学科 概要 WWW WWW WWW The function and its practical use of a full-text search system Satomi WATANABE Reitaku University Abstract We can obtain various information now easily by the spread of the Internet so that it may be represented by the information retrieval by WWW. However, it became difficult for required information to come to hand with increase of the amount of information. On the Internet or intranet, the information retrieval system for taking out required information attracts attention. In this paper, the system using a full-text search function was observed. A main subject explains the knowledge needed when using a full-text search system. Next, it introduces using actually typical free full-text search software namazu about the example, which built the mailing list search engine, which made a full-text search and WWW cooperate. Keywords: Information retrieval,, full-text search system, Namaz mazumorphological analysis 1

DBMS 1 web () 2 UNIX grep 3 grep 1 2 2

5 () () URL URL URL 263 3

[1] Lycos spider() Infoseek InfoSeek Robot HTML HTMLXML PDFWord ChaSen KAKASI N web CGI 4

() KAKASI[2] KAKASI(kanji kana simple inverter) 4 KAKASI SKK 5 [3] LARGE LARGE [4] () 5

JUMAN[5][6] ChaSen[7] JUMAN version 2.0 [8] [9] NTT ( NTT ) ()[10] Breakfast [11] Windows SuperMorpho-J[12] N 6 1 1 N 6 N N N N () 例文 : 東京都の明日の天気予報を確認する N=1 (uni-gram) 東 N=2 (bi-gram) 東京 N=3 (tri-gram) 東京都 京 京都 京都の 都の明日の天気予 都のの明明日日のの天天気気予予報 都の明の明日明日の日の天の天気天気予気予報予報を 6

1 1 (2 3 ) patricia Patricia Practical Algorithm To Retrieve Information Code In Alphanumeric () 7 例 : A man in the room. リストNO 半無制限文字列 1 A man in the room. 2 man in the room. 3 an in the room. 4 n in the room. 5 in the room. 6 n the room. 7 the room. 8 he room. 9 e room. 10 room. 11 oom. 12 om. 13 m. リストNO 半無制限文字列 1 A man in the room. 2 an in the room. 3 e room. 4 he room. 5 in the room. 6m. 7 man in the room. 8 n in the room. 9 n the room. 10 om. 11 oom. 12 room. 13 the room. 7

[13] Inktomi Search Software 4.0 ( UltraSeek) Inktomi Search Software4.0 ultraseek Inktomi Ultraseek Digital Garage Inktomi Japan MS-WordExcelPowerPointPDF Verity Information Server Verity Information Server( Search 97 Information Server) VERITY (Super Morpho-J) Namazu[14] Namazu 5 Namazu NTT DoCoMo web 8

Freya [15] Freya Namazu Namazu Namazu Namazu [14] Windows UNIX OS Web WWW Namazu Namazu Namazu [16][17] LinuxFreeBSD Solaris UNIX Windows OS2 Namazu CGI GUI X window system Windows web CGI(namazu.cgi) namazu CGI Namazu Namazu nkf() Perl KAKASI( ) ChaSen() MHonArc nkf Perl Namazu C Perl KAKASI ChaSen Windows MS-Word MS-ExcelPDF 9

MHonArc RFC822 MINE HTML 6 Namazu 1 PC CPU Pentium4 1.5GHz 256MB 80G WWW Apache Apache UNIX [18] [19] WWW CGI WWW 5 Namazu Namazu Perl 10

5 Perl UNIX OS GNU nkf 1.9 8 1.72 KAKASI Text-KAKASI KAKASI Perl Text-KAKASI KAKASI 2 File-MMagic File-Mmagic Namazu CPAN 9 MHonArc MHonArc MHonArc [17] Namazu Namazu Namazu 8 nkf1.9 Namazu 9 Comprehensive Perl Active Network gcc make GMU Make csh (tcsh) Sh (bash) impression office[18] 1 1 1 HTML MHonArc 11

HTML HTML nkf EUC [19]EUCShift-JISJIS 1 HTML HTML HTML 3 web Namazu CGI WWW WWW Namazu Apache CGI Namazu CGI Namazu Namazu 1999 年 2000 年 2001 年 文書数 ( ファイル数 ) 821 2,910 2,380 文書サイズ (KB) 4,076 14,260 8,404 インデックス作成時間 ( 秒 ) 690 2,537 2,056 インデックスサイズ (bytes) 2,867,825 10,145,199 8,518,327 キーワード数 13,882 32,964 27,861 12

13

Namazu (KAKASI ChaSen ) [23] 1999 AND/OR [24] URL [1] The Web Robots Pages http://www.robotstxt.org/wc/robots.html [2] KAKASI http://kakasi.namazu.org/ [3] SKK http://openlab.ring.gr.jp/skk/index-j.html [4] http://www.kusastro.kyoto-u.ac.jp/~baba/di c/free-dic.html [5] JUMAN http://pine.kuee.kyoto-u.ac.jp/nl-resource/ juman.html [6] JUMAN version 1.0 http://www.naklab.dnj.ynu.ac.jp/~komachi/ manual/maincont2.html [7] http://chasen.aist-nara.ac.jp/index.html.ja [8] http://cactus.aist-nara.ac.jp/lab/nlt/ vi4ma.html [9] http://www.t.onlab.ntt.co.jp/sumomo/ index.html [10] http://www.iijnet.or.jp/edr/j_index.html [11] Breakfast http://www.labs.fujitsu.com/free/breakfast/ index.html [12] SuperMorpho-J 14

http://www.omronsoft.co.jp/sp/embedded morpho/ [13] http://www.kusastro.kyoto-u.ac.jp/~baba/ wais/other-system.html [14] Namazu http://www.namazu.org/ [15] Freya http://www.ingrid.org/ja/project/freya/ [16] Namazu 2001 [17] 1998 [18] TA NO.3pp.32001 [19] NO.1pp.22001 [20] MHonArc http://www.mhonarc.org/ [21] impression office http://www.asi.co.jp/imoffice/ [22] MhonArc http://www.shiratori.riec.tohoku.ac.jp/ ~p-katoh/hack/docs/mhonarc-jp/ [23] http://www.gengokk.co.jp/zenbun.htm [24] RCAAU http://www.kuamp.kyoto-u.ac.jp/labs/ infocom/mondou/index.html [25] 10 pp.90-991999 http://www.ftsanet.com/dbtokyo99/ Db99.htm [26] 1998 15