1 3 1.1................................. 3 1.2................................... 4 1.2.1................... 4 1.2.2..................... 4 1.2.3.....



Similar documents
1 AND TFIDF Web DFIWF Wikipedia Web Web AND 5. Wikipedia AND 6. Wikipedia Web Ma [4] Ma URL AND Tian [8] Tian Tian Web Cimiano [3] [

,, WIX. 3. Web Index 3. 1 WIX WIX XML URL, 1., keyword, URL target., WIX, header,, WIX. 1 entry keyword 1 target 1 keyword target., entry, 1 1. WIX [2

独立行政法人情報通信研究機構 Development of the Information Analysis System WISDOM KIDAWARA Yutaka NICT Knowledge Clustered Group researched and developed the infor

untitled


2 3, 4, [1] [2] [3]., [4], () [3], [5]. Mel Frequency Cepstral Coefficients (MFCC) [9] Logan [4] MFCC MFCC Flexer [10] Bogdanov2010 [3] [14],,,

TF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

Vol.6 No (Aug. 2013) Twitter 1,a) , Twitter Twitter Study of Twitter s Follow Mechanism Based on Network

2. Twitter Twitter 2.1 Twitter Twitter( ) Twitter Twitter ( 1 ) RT ReTweet RT ReTweet RT ( 2 ) URL Twitter Twitter 140 URL URL URL 140 URL URL

IPSJ SIG Technical Report Vol.2014-DBS-159 No.6 Vol.2014-IFAT-115 No /8/1 1,a) 1 1 1,, 1. ([1]) ([2], [3]) A B 1 ([4]) 1 Graduate School of Info

[1] [3]. SQL SELECT GENERATE< media >< T F E > GENERATE. < media > HTML PDF < T F E > Target Form Expression ( ), 3.. (,). : Name, Tel name tel

22 Google Trends Estimation of Stock Dealing Timing using Google Trends

Ł\1.pdf

1 Fogg Fogg Behavior Model [1] information cascade [2] TPO [3] Fig. 2 Target area of this paper. 1 Fig. 1 Fogg b

IPSJ SIG Technical Report Vol.2013-HCI-152 No /3/13 1,a) 1,b) 2,c) / GPS Bluetooth(BT) WiFi BT WiFi 1. Bluetooth WiFi 1 / 1 2 a)

2 2.1 SNS web Facebook Google+ SNS web SNS web HITS ANT(Auction Network Trust) web [4] SNS WEB PageRank HITS HITS web authorities, hubs Pagerank web S

DEIM Forum 2012 E Web Extracting Modification of Objec

1 Broder Navigational URL URL Informational Web Transactional Web Web Web 2 Broder [16] SearchLife Broder [16] Daniel [17] Broder

[2][3][4][5] 4 ( 1 ) ( 2 ) ( 3 ) ( 4 ) 2. Shiratori [2] Shiratori [3] [4] GP [5] [6] [7] [8][9] Kinect Choi [10] 3. 1 c 2016 Information Processing So

main.dvi

([ ]!) name1 name2 : [Name]! name SuperSQL,,,,,,, (@) < >@{ < > } =,,., 200,., TFE,, 1 2.,, 4, 3.,,,, Web EGG [5] SSVisual [6], Java SSedit( ss

: ( 1) () 1. ( 1) 2. ( 1) 3. ( 2)

Microsoft Word - toyoshima-deim2011.doc

main.dvi

IT,, i

IPSJ SIG Technical Report Vol.2013-ICS-172 No /11/12 1,a), 1,b) Anomaly Detection 1. 1 Nagoya Institute of Technology 1 Presently with Nagoya In

3 2 2 (1) (2) (3) (4) 4 4 AdaBoost 2. [11] Onishi&Yoda [8] Iwashita&Stoica [5] 4 [3] 3. 3 (1) (2) (3)

WISS 2018 [2 4] [5,6] Query-by-Dancing Query-by- Dancing Cao [1] OpenPose 2 Ghias [7] Query by humming Chen [8] Query by rhythm Jang [9] Query-by-tapp

DEIM Forum 2017 E Netflix (Video on Demand) IP 4K [1] Video on D

20mm 63.92% ConstantZoom U 5

48_14_11.dvi

再発見を試みるユーザ 入力閲覧ページ出力同位ページ 以前に閲覧したページ 同位ページの推定 2. 1 [4], [13] Dubroy [4] [13] 4 [1], [2], [8], [10], [12] Nshmoto [8] Capra [2] Exact Path Su

FIT2014( 第 13 回情報科学技術フォーラム ) RD-002 Web SNS Yuanyuan Wang Gouki Yasui Yuji Hosokawa Yukiko Kawai Toyokazu Akiyama Kazutoshi Sumiya 1. Twitter 1 Facebo

2

27 AR

Web情報検索の新技術と動向

( 1) 3. Hilliges 1 Fig. 1 Overview image of the system 3) PhotoTOC 5) 1993 DigitalDesk 7) DigitalDesk Koike 2) Microsoft J.Kim 4). 2 c 2010

1. [5] Wikipedia 4. ( ) Wikipedia 5. 3 ( ) ( ) ( ) Wikipedia ( ) ( ) 2.2 Global Database of Events, Language and Tone (GDELT) Global Datab

3807 (3)(2) ,267 1 Fig. 1 Advertisement to the author of a blog. 3 (1) (2) (3) (2) (1) TV 2-0 Adsense (2) Web ) 6) 3

( : A8TB2163)

AHPを用いた大相撲の新しい番付編成


main.dvi

Microsoft Word - deim2011_new-ichinose doc

. Yahoo! 1!goo 2 QA..... QA Web Web [1]Web Web Yin [2] Web Web Web. [3] Web Wikipedia 1 2

Vol.20, No.1, 2018 Castillo [10] Yang [11] Sina Weibo 3 Castillo [10] Twitter 4 Twitter [12] Twitter ) 2 Twitter [13] 3. Twitter Twitter 3

IPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta

DEIM Forum 2010 A Web Abstract Classification Method for Revie

2reN-A14.dvi

Honda 3) Fujii 4) 5) Agrawala 6) Osaragi 7) Grabler 8) Web Web c 2010 Information Processing Society of Japan

WII-D 2017 (1) (2) (1) (2) [Tanaka 07] [ 04] [ 10] [ 13, 13], [ 08] [ 13] (1) (2) 2 2 e.g., Wikipedia [ 14] Wikipedia [ 14] Linked Open

2 JAWS web web Share = authorities ReShare = (hubs SNS i j i authorities j i i hubs 1 User i 情報が j によってシェアされる (authorities) j の情報をシェアする (hu

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

卒論タイトル

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

1: 2: 3: 4: 2. 1 Exploratory Search [4] Exploratory Search 2. 1 [7] [8] [9] [10] Exploratory Search

1 4 4 [3] SNS 5 SNS , ,000 [2] c 2013 Information Processing Society of Japan


2015 9

教師情報を必要としないWebページ群のコンテンツ自動抽出ツールの提案

DEIM Forum 2012 E8-4 Wikipedia y

1 Web,.,, Web..,, Web.,,,.,,,., CGI.,, Web, Web.,,. PC,,.

3 Venue Venue Venue Venue Venue Venue SNS [2] Venue Venue [3] Venue Venue Venue [4] / Venue [5] Venue Venue Foursquare Venue Foursquare

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

Berchtold 1) ActiServ 1 ALKAN Fig. 1 ALKAN overview Herren 2) 20 HASC Challenge 3) HASC Challenge ALKAN 4),5) ALKAN i

1.0, λ. Holt-Winters t + h,ỹ t ỹ t+h t = ỹ t + hf t.,,.,,,., Hassan [5],,,.,,,,,,Hassan EM,, [6] [8].,,,,Stenger [9]. Baum-Welch, Baum-Welch (Incremen

2009 2

main.dvi

IPSJ SIG Technical Report Vol.2014-MBL-70 No.49 Vol.2014-UBI-41 No /3/15 2,a) 2,b) 2,c) 2,d),e) WiFi WiFi WiFi 1. SNS GPS Twitter Facebook Twit

DEIM Forum 2009 E

”‰−ofiI…R…fi…e…L…X…g‡ðŠp‡¢‡½„�“õ„‰›Ê‡Ì™ñ”¦

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

[12] Qui [6][7] Google N-gram[11] Web ( 4travel 5, 6 ) ( 7 ) ( All About 8 ) (1) (2) (3) 3 3 (1) (2) (3) (a) ( (b) (c) (d) (e) (1

IPSJ SIG Technical Report Vol.2010-SLDM-144 No.50 Vol.2010-EMB-16 No.50 Vol.2010-MBL-53 No.50 Vol.2010-UBI-25 No /3/27 Twitter IME Twitte

<> <name> </name> <body> <></> <> <title> </title> <item> </item> <item> 11 </item> </>... </body> </> 1 XML Web XML HTML 1 name item 2 item item HTML

和文タイトル

DEIM Forum 2019 H2-2 SuperSQL SuperSQL SQL SuperSQL Web SuperSQL DBMS Pi

Lyra X Y X Y ivis Designer Lyra ivisdesigner Lyra ivisdesigner 2 ( 1 ) ( 2 ) ( 3 ) ( 4 ) ( 5 ) (1) (2) (3) (4) (5) Iv Studio [8] 3 (5) (4) (1) (

DEIM Forum 2014 B Twitter Twitter Twitter 2006 Twitter 201

DEIM Forum 2015 F8-4 Twitter Twitter 1. SNS

DEIM Forum 2014 P3-3 A Foreseeing System of Search Results based on Query Operations on the Graph Interface

_314I01BM浅谷2.indd

地図操作 操作解析 チャンク判定 アプリケーション判定 提示画面決定 1 緯度経度情報抽出 注記 DB 領域 DB Gemini (Geographical Enhanced Map Interface for Navigation on the Internet)

untitled

Web 1 q q Step1) Twitter Step2) (w i, w j ) S(w i, w j ) Step3) q I Twitter MeCab[6] URL 2.2 (w i, w j ) S(w i, w j ) I w i w

和文タイトル

untitled

IPSJ SIG Technical Report Vol.2009-DBS-149 No /11/ Bow-tie SCC Inter Keyword Navigation based on Degree-constrained Co-Occurrence Graph

,,, Twitter,,, ( ), 2. [1],,, ( ),,.,, Sungho Jeon [2], Twitter 4 URL, SVM,, , , URL F., SVM,, 4 SVM, F,.,,,,, [3], 1 [2] Step Entered

SICE東北支部研究集会資料(2017年)

1 Web DTN DTN 2. 2 DTN DTN Epidemic [5] Spray and Wait [6] DTN Android Twitter [7] 2 2 DTN 10km 50m % %Epidemic 99% 13.4% 10km DTN [8] 2

Vol. 46 No. SIG 13(TOD 27) ) ),4) 4) 1 2 5) 2 Cutting Scatter/Gather 6),7) Fractionation 6) Leuski 8)


2014 2


SERPWatcher SERPWatcher SERP Watcher SERP Watcher,

HASC2012corpus HASC Challenge 2010,2011 HASC2011corpus( 116, 4898), HASC2012corpus( 136, 7668) HASC2012corpus HASC2012corpus

14 2 5

GUI(Graphical User Interface) GUI CLI(Command Line Interface) GUI

Transcription:

2012 STUDIES ON RANKING DOCUMENTS WITH QUERY-INTENT SENSITIVITY 11R3129 Shota HATAKENAKA

1 3 1.1................................. 3 1.2................................... 4 1.2.1................... 4 1.2.2..................... 4 1.2.3................ 5 1.2.4......... 5 1.3................................. 5 1.4................................... 6 2 8 2.1.................................... 8 2.2..................... 9 2.2.1........................ 9 2.2.2............................. 10 2.3............................... 10 2.3.1 PageRank...................... 10 2.3.2..................... 11 2.4..................................... 13 2.4.1............................... 13 2.4.2............................... 13 2.4.3................................. 14 2.5..................................... 14 3 PageRank 17 3.1.................................... 17 3.2........................... 18 3.3............................ 19 3.4 PageRank.......................... 19 3.5........... 20 3.5.1 Topic Sensitive PageRank..................... 20 3.5.2..................... 21 1

3.5.3............ 23 3.6..................................... 23 3.6.1............................... 23 3.6.2............................... 24 3.6.3................................. 25 3.7..................................... 26 4 30 4.1................................... 30 4.2................................. 31 4.3.................................. 31 4.4..................................... 32 4.4.1............................... 32 4.4.2............................... 32 4.5..................................... 33 4.6..................................... 34 5 36 5.1.................................... 36 5.2 Rocchio............................ 37 5.3................................... 38 5.3.1............................ 38 5.3.2........................... 39 5.3.3................................. 40 5.4..................................... 40 5.4.1............................... 40 5.4.2............................... 40 5.4.3............................... 41 5.4.4................................. 41 5.5..................................... 41 6 48 49 50 2

1 1.1,,,,, Web, Wiki, blog, twitters,, (query),, (term-matching) (term frequency), (inverse document frequency), TF*IDF,,,,,,,,,,.,,,,,. 3

1.2,. 1.2.1,,, Web (theme) (authoritative) (distributive),, Web PageRank 2 10 90 [1]. 1.2.2,, Web PageRank HITS,??,,, Topic Sensitive PageRank 10 4 Topic Sensitive PageRank [2] 4

1.1: 1.2.3?? Bhattacharyya.[3] 1.2: 1.2.4 5

, Rocchio 5 3 Rocchio [4] 1.3 2 3 PageRank 4 5 6 1.4 1., : 3 (DEIM ), 2011. PageRank 2., : Ranking Documents using Similarity-based PageRanks, IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PacRim), 2011. PageRank 6

3., : PageRank, 4 (DEIM ), 2012., 2, 4., : Ranking Documents with Query and Topic Sensitivity, 7th International Conference on Digital Information Management (ICDIM ), 2012., 2, 5., : Query and Topic Sensitive PageRank for General Documents, 14th IEEE International Symposium on Web Systems Evolution (WSE), 2012., 2, 6.,, :, 11 FIT, 2012.,,, Bhattacharyya.,Bhattacharyya,. 7.,, : Ranking Documents with Query-Topic Sensitivity, International Workshop on Web Information Retrieval Support Systems in IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology(WIRSS), 2012.., 7

. 8.,, :, 5 (DEIM ), 2013. Web 8

2 PageRank 2.1,,, Web, Wiki, blog, twitters,, (query),, (term-matching) (term frequency) (inverse document frequency) TF*IDF,, ( ) 9

, Web (theme) (authoritative) (distributive),,,,,,,,,,, Web PageRank HITS, Web PageRank 2.2 2.2.1 d d =< v 1,..., v n > i = 1,..., n w i v i 2 TF*IDF q d i 10

cos(d i q) cos(d i q) cos(d i, q) = d i q d i q q d 1, d 2 d n 2.2.2 2.3 2.3.1 PageRank PageRank Web Web Web PageRank Web PageRank A B PageRank 11

P i PageRank P R i P j PageRank P R j P R i = P j B Pi P R j P R j B Pi P i P R j P j PageRank 1 n PageRank H 1 n t t = t+1 H H 2 PageRank H Web 2 1 1 Web PageRank 1 Web Web 2 Web Web 2 H H H = (1 d)h + d N N Web d 2.3.2 Web Web PageRank Web PageRank Web 12

d i d j cos(d i, d j ) = d i d j d i d j 0 PageRank 2 d 0 d 1 d n N ( ) d 1 d 0 r 01 r 01 = 1 N r 01 d 1 d n d 0 d 0 d 1 d n d 1 d 0 rw 01 rw 01 = w 01 ni=1 w 0i w 01 d 0 d 1 d 0 d 0 1 1 0 PageRank 2 1/N N M M = (1 d)m + d d 0.15 M PageRank 13

A B B C A C ( ) 2.4 2.4.1 2009 1 6 1 1 1 PageRank 10 10 PageRank 10 10 r 1 r 2 10 Sim(r 1, r 2 ) = (A B) k A B r 1 r 2 k PageRank 10 10 2.4.2 1 PageRank PageRank 2 3 PageRank 10 1 10 2 10 10 9 3 90 1 4 2 4 14

1 9996 10000 2.4.3 1 2 10 90 1 2 4 9 4 9 1 9997 10000 1 (PageRank) Web PageRank 2.5 Web 15

2.1: PageRank Top10 PR 1 0.032 2 0.031 3 0.030 4 0.0307 5 0.0294 6 0.0290 7 0.0288 8 0.0288 9 0.0286 10 0.0284 9996 0.001 9997 0.001 9998 0.001 9999 0.001 10000 0.001 16

2.2: TOP10 1 1436.3933 2 1379.4339 3 1348.871501 4 1344.996201 5 1270.10094 6 1258.451663 7 1248.997552 8 1244.844031 9 1242.542849 10 1238.969157 2.3: k Sim (PageRank, ) 10 0.9 17

3 PageRank, 2, 3.1,,,,, Web, Wiki, blog, twitters,,, (query),, (term-matching) (term frequency), (inverse document frequency), TF*IDF,,,,,,,,,, ( ),, 18

,,, (Topic Driftting), Web PageRank HITS,,,, Web PageRank 3.2 2 1 2 19

2 3.3 d d =< v 1,..., v n > i = 1,..., n w i v i 2 TF*IDF q d i cos(d i q) cos(d i q) cos(d i, q) = d i q d i q q d 1, d 2 d n 3.4 PageRank PageRank PageRank PageRank A B PageRank 20

d i PageRank P R i d j PageRank P R j P R i = d j B di P R j P R j B di d i P R j d j PageRank 1 n PageRank H 1 n t t = t+1 H H 2 PageRank H Web 2 1 1 PageRank 1 2 2 H H H = (1 d)h + d N N Web d 3.5 3.5.1 Topic Sensitive PageRank Topic Sensitive PageRank [7] Web ODP ODP PageRank 2 PageRank ODP c j PageRank d PageRank rank jd q PageRank c j q P (c j q ) P (c j q ) = P (c j) P (q c j ) P (q ) P (c j ) P (q c j ) 21

s qd s qd = n P (c j q ) rank jd j ODP 3.5.2 [?] PageRank 2 d i t j T ji T ji = d i t j d i t j t j t j s j t j s j d i d j w ij w ij = d i d j d i d j 0 22

PageRank 2 d 0 d 1 d n N ( ) d 1 d 0 r 01 r 01 = 1 N r 01 d 1 d n d 0 d 0 d 1 d n d 1 d 0 r 01 r 01 = w 01 ni=1 w 0i d 0 d 0 1 1 PageRank PageRank 2 1/N N t j M M = (1 d)m + d d PageRank 0.15 M PageRank s j PageRank PageRank s j d i PageRank P R ji P R ji = s j nk=1 s k P R ji s j d i PageRank P R ji 23

3.5.3 t j t j q m s j Q mj s j w j Q jm = q m w j q m w j Q jm q m d i n v mi = T ji P R ji Q jm j=1 q m d i v mi PageRank [?] 3.6 3.6.1 24

8 ( ) 2010 1 6 1000 1 1 2010 1 6 8 PageRank Topic Sensitive PageRank Topic Sensitive PageRank PageRank 2 ODP 8 PageRank 8 1000 8 8 988 3.6.2 3.1: TSPR MYPR TSPR MYPR 3 6 8 9 9 10 15 15 15 2,3,5 3,4,5 5 10 11 11 5 8 10 9 9 9 8 4 8 5 5 6 11 10 11 1 Topic Sensitive PageRank 1000 25

10 4 Topic Sensitive PageRank 4 2 Topic Sensitive PageRank 3.6.3 1 Topic Sensitive PageRank 3 694 Topic Sensitive PageRank 694 2 3 7 Topic Sensitive PageRank 694 694 Topic Sensitive PageRank 694 3 2 4 7 694 694 6 988 Topic Sensitive PageRank 988 2 5 8 Topic Sensitive PageRank 988 988 988 Topic Sensitive PageRank 988 2 6 8 988 988 4 26

3.2: (TSPR) 4.56E-11 0 0 3.55E-10 0 0 2.44E-08 0 (MYPR) 0.1326 0.1509 0 0.1324 0.1492 0.1358 0.1399 0.1294 (TSPR) 0 0 0 0 0 0 1.07E-06 3.41E-08 (MYPR) 0.1388 0.1331 0 0.1222 0.14063 0.15065 0.1591 0.1552 Topic Sensitive PageRank 1 Topic Sensitive PageRank ODP Web 3.7 27

3.3: Topic Sensitive PageRank PageRank ( ) 677 0.01628549 0.01543551 0.01642703 0.01434725 0.01414995 0.0158754 0.05786429 0.01602105 694 0.02621034 0.02478334 0.02624309 0.02300665 0.02275057 0.02519013 0.06747065 0.02585367 707 0.0279393 0.02634675 0.02806745 0.02461977 0.02420318 0.02692705 0.06919561 0.02758511 740 0.02331051 0.02204983 0.02318837 0.02055867 0.02012315 0.02235665 0.06482445 0.02300719 748 0.02692198 0.02539224 0.02695318 0.02380562 0.02334913 0.02608048 0.06836908 0.02664311 768 0.02320812 0.02191476 0.02323929 0.02053375 0.0201492 0.02245379 0.06472778 0.02300338 770 0.02416074 0.02280096 0.02406911 0.02120447 0.0209023 0.02313225 0.06550857 0.02380637 819 0.02508562 0.02368776 0.02501543 0.02208009 0.02170155 0.02411804 0.06649766 0.02476353 3.4: PageRank * ( ) 677 0.000441584 0.000433684 0 0 0.00034025 0.000388661 0.00144729 0.000433972 694 0.000977919 0.000798366 0 0.000398389 0 0.000732579 0.002108727 0.000688847 707 0.001440559 0.001033526 0 0 0.000820688 0.001343557 0.00249171 0.001186773 740 0.000951179 0.000591478 0 0.000516558 0 0.000810154 0.002673234 0.000597504 748 0.00110303 0 0 0.001230758 0.0009621 0.001061925 0.002487807 0.000899565 768 0.001117554 0.000452364 0 0.000600762 0.000757061 0.000582189 0.001677901 0.0005313 770 0.000867845 0.000759111 0 0 0.000448544 0.000682351 0.002246525 0.000590367 819 0.001420582 0.000596109 0 0.000720661 0 0.000958302 0.002508245 0.000889678 3.5: Topic Sensitive PageRank PageRank ( ) 692 0.02374389 0.02237004 0.02367169 0.02091047 0.02048126 0.02284057 0.0650916 0.02348498 712 0.01978951 0.01869293 0.01978766 0.01746852 0.01709531 0.01899625 0.06135691 0.019642 721 0.02067844 0.01954869 0.02058044 0.0182153 0.01788259 0.01993703 0.06221481 0.02041511 730 0.01943435 0.01839565 0.01966527 0.01715896 0.01693975 0.0189899 0.06083285 0.01911198 742 0.02041298 0.01924279 0.02033717 0.0180109 0.01761391 0.01968302 0.06192326 0.02019657 775 0.00912514 0.00861247 0.00914079 0.00808068 0.00788348 0.00883151 0.050626 0.00898688 782 0.02532507 0.02394848 0.02524539 0.02227661 0.02184676 0.02437671 0.06665876 0.0249562 988 0.02325761 0.02185824 0.02330936 0.02057891 0.02016374 0.02250221 0.02282538 0.06228756 28

3.6: PageRank * ( ) 692 0.001156051 0.000665472 0 0 0 0.000723613 0.001425498 0.00098955 712 0.001026558 0 0 0 0 0.000909314 0.002742536 0.000951096 721 0.000461732 0 0 0.000635344 0.000684857 0.000524384 0.001514467 0.000426173 730 0.000671832 0.000554855 0 0 0.000478327 0.000645225 0.001602642 0.000567606 742 0 0 0 0.000795937 0.000786941 0.000552131 0.001344852 0.000469283 775 0.000226757 0 0 0 0.000194107 0.000449187 0.00087587 0.000324726 782 0.001061882 0.000595193 0 0.000446785 0.000660955 0.001168546 0.00152643 0.000711374 988 0.000565488 0 0 0.000725272 0 0.001175026 0.001189103 0.000974241 3.7: TD 1 677 694 707 740 748 768 770 819 29

3.8: TD 1 692 712 721 730 742 775 782 988 30

4 4.1 Blog twitter 31

4.2 ( ) ( ) 4.3 1 2 Bhattacharyya [6] Bhattacharyya Bhattacharyya 1 L = m u=1 P u Q u p q m u=1 P u = m u=1 Q u = 1 blog twitter Bhattacharyya Bhattacharyya 2 d q = N n=1 tf i Bha i N q d d q tf i d i Bha q i q i Bhattacharyya N d q q q Bhattacharyya q Bhattacharyya 0 Bhattacharyya 32

q Bhattacharyya q Bhattacharyya Bhattacharyya Bhattacharyya q q Bhattacharyya Bhattacharyya 0 Bhattacharyya Bhattacharyya q q d 4.4 4.4.1 2010 1 2 3 1 9,911 1 20 1,000 6,788 10 4.4.2 Bhattacharyya 4.1 Bhattacharyya 4.2 Bhattacharyya 4.3 22 5 4.4 22 15 3 ID2675 33

4.1: Bhattacharyya Bhattacharyya ID 1 2644 0.828794 2653 0.800367 3899 0.756451 1929 0.742017 6489 0.722026 4278 0.720092 6384 0.669799 2056 0.658522 5731 0.648716 4338 0.639959 2281 4.2: Bhattacharyya ( ) Bhattacharyya ID 1.828794138 2644 1.828794138 2653 1.30290872 3974 1.272601265 4953 1.262531914 4789 1.236375728 1930 1.225277334 6021 1.194493536 1101 1.131219219 5742 1.117314461 4718 1.114044308 2057 1.109182 3057 4.5 4.1 4.2 4.3 Bhattacharyya 4.3 Bhattacharyya Bhattacharyya Bhattacharyya 34

4.3: Bhattacharyya ( ) Bhattacharyya ID 2.435927321 2644 2.416154491 2653 2.194493536 1101 1.590458825 3068 1.58998331 5460 1.559216392 4782 1.53386505 2032 1.481068325 2601 1.477399884 4783 1.476763755 6489 1.461201778 4278 1.436617236 2460 1.431435544 3899 4.4 Bhattacharyya Bhattacharyya Bhattacharyya 4.6 Bhattacharyya Bhattacharyya 35

4.4: 5 ID: 1 2675: 31 5 9 25 2 5072: 2 3 4797: 21 7 11 4 4 4955: 28 5 4960: 28 36

5 5.1 Blog twitter,,,, ( ) ( ) 37

3 Rocchio 5.2 Rocchio d d =< v 1,..., v n > i = 1,..., n w i v i 2 TF*IDF q d i cos(d i q) cos(d i q) 38

cos(d i, q) = d i q d i q q d 1, d 2 d n Rocchio [15] q D r D n TF*IDF q q q = q + D R d i D R d i D N d j D N d j D R D N 5.3 5.3.1 1 Bhattacharyya [?] Bhattacharyya Bha = m u=1 P u Q u (0 Bha 1) p q ( m u=1 P u = m u=1 Q u = 1) blog twitter 39

Bhattacharyya Bhattacharyya q q i BC iq 1 BC iq = Bha iq log( ) CO iq Bha iq q i Bhattacharyya CO iq q i 2 Bha CO Bha Bha Bha CO iq CO iq = c a + b c a i b q c i q 5.3.2 q q BC dr N BC iq,n+1 = BC iq,n (1 + w i,d + N + w i,d N ) BC i,n n q i N N + n w i,d + w i,d i i n+1 j dr j,n+1 Nn=1 tf i,j BC iq,n+1 dr j,n+1 = N tf i,j j i N j j CO j CO j = Nn=1 tf i,j CO iq N 40

5.3.3 1. q i BC iq 2. N BC i,n 3. 5.4 5.4.1 2010 1 3 1 7,999 1 20 1,000 6,788 5.4.2 Rocchio 3 =1.0 =0.8 =0.1 7 41

5.4.3 1 Rocchio 3 top10 R) Rocchio 3 5 3 Rocchio 5.4.4 3 7815 2 Rocchio 5 ( ), 1, 4 5 Rocchio 6 5.5 42

5.1: Rocchio FB[0] FB[1] FB[2] FB[3] R) 8 9 10 10 R) 2 7 7 7 R) 1 3 1 1 R) ( ) 0 0 0 0 R) ( ) 3 9 9 10 5 8 8 10 1 5 6 10 1 1 1 2 ( ) 2 3 4 4 ( ) 5 10 10 10 43

5.2: Rocchio 6776 6812 6816 5728 SMAP 6393 6832 6765 5 6 6 10 9 10 SMAP 20 19 7, 44

5.3: 6765 7815 6776 6832 5773??NY, 9 7 5 7, 21 6055 ASEM 6923 3 ASEM 2 11 45

5.4: Rocchio 1663 3686 7551 11?? 1020 1197 PT 24 836?? 1188 10 3 10 12 10 11 11, 18 PT 24 3 18 3 19 24 46

5.5: 3686 4500 4731 10 4268?? 194 4616 3591 12, 25 4 10 14 10 NHK 35 34 30 8 40 9 47

5.6: 2 4500 4268?? 4320 3 4731 10 3686 4351 4235, 25 14 16 3 4 10 12 18 12 48

6,. Web,,,,,,,,, 49

50

[1] Hatakenaka, S. and Miura, T.: Ranking Documents using Similarity-based Page- Ranks, IEEE Pacific Rim Conference on Communications, Computers and Signal Processing(PacRim), 2011. [2] Hatakenaka, S. and Miura, T.: Query and Topic Sensitive PageRank for General Documents, 14th IEEE International Symposium on Web Systems Evolution(WSE), 2012. [3] :, 11 FIT, 2012. [4] :, 5 (DEIM), 2013. [5] S. Brin and L. Page The anatomy of a large scale hypertextual Web search engine. ComputerNetworks and ISDN Systems, 30, 107-117, 1998. [6] Oren. Kurland and Lillian Lee PageRank without hyperlinks: Structural reranking using links induced by language models. Proceedings of the 28th annual international ACM SIGIR, 2005. [7] T. H. Haveliwala Topic-sensitive PageRank. Proceedings of the 11th international conference on World Wide Web, 2002. [8] Amy N. Langville, Carl D. Meyer,,, Google PageRank 2009 [9] J. Kleinberg Bursty and hierarchical structure in streams. Proc. 8th SIGKDD,2002, 91-101. [10] Masaya Murata, Hiroyuki Toda, Yumiko Matsuura and Ryoji Kataoka, A Query Expansion Method Using Access Concentration Sites in Search Result Proceedings of the DataBase and Web symposium, 2007. 51

[11] Hang Cui, Ji-RongWen, Jian-Yun Nie andwei-yingma, Probabilistic Query Expansion Using Quer Logs Proceedingsof the 11th international conference on World Wide Web 2002, 325-332 [12] Georges E. Dupret and Benjamin Piwowarski. A user browsing model to predict search engine click data from past observations ACM SIGIR2008.331-338, [13] KMamoru Komachi, Shimpei Makimoto, Kei Uchiumi, and Manabu Sassano. Learning semanticcategories from clickthrough logs ACLIJCNLP 2009.189-192, [14] Qingshan LIU and Dimitris N METAXAS Unifying Subspace and Distance Metric Learning with Bhattacharyya Coefficient for Image Classification Lecture Notes in Computer Science 2009 Volume 5416/2009 254-267, [15] Rocchio, J.J Relevance fssdback in information retrieval. The SMART Retrieval Systems, pp313-323, Prentice-Hall, 1971. 52