Wikipedia 2 Wikipedia Web Wikipedia 2. Web [6] [11] [8] 2 SVM Bollegala [1] 5-gram URL URL 2-gram [6] [11] SVM 3 SVM [8] Bollegala [1] SVM [7] [9] [6]



Similar documents
DEIM Forum 2010 A Web Abstract Classification Method for Revie

DEIM Forum 2009 E

1 AND TFIDF Web DFIWF Wikipedia Web Web AND 5. Wikipedia AND 6. Wikipedia Web Ma [4] Ma URL AND Tian [8] Tian Tian Web Cimiano [3] [

SERPWatcher SERPWatcher SERP Watcher SERP Watcher,

Microsoft Word - toyoshima-deim2011.doc

IPSJ SIG Technical Report Vol.2009-DBS-149 No /11/ Bow-tie SCC Inter Keyword Navigation based on Degree-constrained Co-Occurrence Graph

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

. Yahoo! 1!goo 2 QA..... QA Web Web [1]Web Web Yin [2] Web Web Web. [3] Web Wikipedia 1 2

TF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat

DEIM Forum 2010 A3-3 Web Web Web Web Web. Web Abstract Web-page R

IPSJ SIG Technical Report Vol.2010-SLDM-144 No.50 Vol.2010-EMB-16 No.50 Vol.2010-MBL-53 No.50 Vol.2010-UBI-25 No /3/27 Twitter IME Twitte

wki_shuronn.pdf

DEIM Forum 2009 C8-4 QA NTT QA QA QA 2 QA Abstract Questions Recomme

Web Web Web Web Web, i

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

Web [1] [2] [3] [4] [5] SupportVectorMachine SVM [6] [7] Google [11] Web

IPSJ SIG Technical Report Vol.2009-HCI-134 No /7/17 1. RDB Wiki Wiki RDB SQL Wiki Wiki RDB Wiki RDB Wiki A Wiki System Enhanced by Visibl

2 : Open Clip Art Library [4] Microsoft Office PowerPoint Web PowerPoint 2 Yahoo! Web [5] SlideShare Yahoo! Web Yahoo! Web

The 18th Game Programming Workshop ,a) 1,b) 1,c) 2,d) 1,e) 1,f) Adapting One-Player Mahjong Players to Four-Player Mahjong

2reN-A14.dvi

IT,, i

main.dvi

<> <name> </name> <body> <></> <> <title> </title> <item> </item> <item> 11 </item> </>... </body> </> 1 XML Web XML HTML 1 name item 2 item item HTML

2. Twitter Twitter 2.1 Twitter Twitter( ) Twitter Twitter ( 1 ) RT ReTweet RT ReTweet RT ( 2 ) URL Twitter Twitter 140 URL URL URL 140 URL URL

Web Hashtag Hashtag Twitter Hashtag Twitter Hashtag Hashtag Hashtag Twitter Hashtag Twitter Hashtag contexthashtag contexthashtag Hashtag contexthasht

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

名称未設定

FIT2014( 第 13 回情報科学技術フォーラム ) RD-002 Web SNS Yuanyuan Wang Gouki Yasui Yuji Hosokawa Yukiko Kawai Toyokazu Akiyama Kazutoshi Sumiya 1. Twitter 1 Facebo

DEIM Forum 2009 B4-6, Str

BOK body of knowledge, BOK BOK BOK 1 CC2001 computing curricula 2001 [1] BOK IT BOK 2008 ITBOK [2] social infomatics SI BOK BOK BOK WikiBOK BO

main.dvi

24 Region-Based Image Retrieval using Fuzzy Clustering

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

IPSJ SIG Technical Report Vol.2011-DBS-153 No /11/3 Wikipedia Wikipedia Wikipedia Extracting Difference Information from Multilingual Wiki

DEIM Forum 2012 E Web Extracting Modification of Objec

22 Google Trends Estimation of Stock Dealing Timing using Google Trends

29 jjencode JavaScript

DEIM Forum 2010 D Development of a La

WikiWeb Wiki Web Wiki 2. Wiki 1 STAR WARS [3] Wiki Wiki Wiki 2 3 Wiki 5W1H Wiki Web 2.2 5W1H 5W1H 5W1H 5W1H 5W1H 5W1H 5W1H 2.3 Wiki 2015 Informa

IPSJ SIG Technical Report Vol.2014-HCI-158 No /5/22 1,a) 2 2 3,b) Development of visualization technique expressing rainfall changing conditions

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

[1] [3]. SQL SELECT GENERATE< media >< T F E > GENERATE. < media > HTML PDF < T F E > Target Form Expression ( ), 3.. (,). : Name, Tel name tel

知能と情報, Vol.30, No.5, pp

Twitter ( ), ( ). i

2 3, 4, [1] [2] [3]., [4], () [3], [5]. Mel Frequency Cepstral Coefficients (MFCC) [9] Logan [4] MFCC MFCC Flexer [10] Bogdanov2010 [3] [14],,,

3.1 Thalmic Lab Myo * Bluetooth PC Myo 8 RMS RMS t RMS(t) i (i = 1, 2,, 8) 8 SVM libsvm *2 ν-svm 1 Myo 2 8 RMS 3.2 Myo (Root


Honda 3) Fujii 4) 5) Agrawala 6) Osaragi 7) Grabler 8) Web Web c 2010 Information Processing Society of Japan

独立行政法人情報通信研究機構 Development of the Information Analysis System WISDOM KIDAWARA Yutaka NICT Knowledge Clustered Group researched and developed the infor

Vol. 42 No MUC-6 6) 90% 2) MUC-6 MET-1 7),8) 7 90% 1 MUC IREX-NE 9) 10),11) 1) MUCMET 12) IREX-NE 13) ARPA 1987 MUC 1992 TREC IREX-N

,,, Twitter,,, ( ), 2. [1],,, ( ),,.,, Sungho Jeon [2], Twitter 4 URL, SVM,, , , URL F., SVM,, 4 SVM, F,.,,,,, [3], 1 [2] Step Entered

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

main.dvi

untitled

3D UbiCode (Ubiquitous+Code) RFID ResBe (Remote entertainment space Behavior evaluation) 2 UbiCode Fig. 2 UbiCode 2. UbiCode 2. 1 UbiCode UbiCode 2. 2

1 Fig. 2 2 Fig. 1 Sample of tab UI 1 Fig. 1 that changes by clicking tab 5 2. Web HTML Adobe Flash Web ( 1 ) ( 2 ) ( 3 ) ( 4 ) ( 5 ) 3 Web 2.1 Web Goo

DEIM Forum 2014 B Twitter Twitter Twitter 2006 Twitter 201

: ( 1) () 1. ( 1) 2. ( 1) 3. ( 2)

7,, i

Web Basic Web SAS-2 Web SAS-2 i

, IT.,.,..,.. i

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

..,,,, , ( ) 3.,., 3.,., 500, 233.,, 3,,.,, i

untitled

( )

教師情報を必要としないWebページ群のコンテンツ自動抽出ツールの提案

,, WIX. 3. Web Index 3. 1 WIX WIX XML URL, 1., keyword, URL target., WIX, header,, WIX. 1 entry keyword 1 target 1 keyword target., entry, 1 1. WIX [2

1 1 tf-idf tf-idf i

Mining Social Network of Conference Participants from the Web

HP cafe HP of A A B of C C Map on N th Floor coupon A cafe coupon B Poster A Poster A Poster B Poster B Case 1 Show HP of each company on a user scree

Twitter‡Ì”À‰µ…c…C†[…g‡ðŠŸŠp‡µ‡½…^…C…•…›…C…fi‘ã‡Ì…l…^…o…„‘îŁñ„�™m

Izard 10 [1]Plutchik 8 [2] [3] Izard Neviarouskaya [4][5] 2.2 Hao [6] 1 Twitter[a] a) Shook Wikipedia

,,,,., C Java,,.,,.,., ,,.,, i

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

27 YouTube YouTube UGC User Generated Content CDN Content Delivery Networks LRU Least Recently Used UGC YouTube CGM Consumer Generated Media CGM CGM U

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

untitled

IPSJ SIG Technical Report Vol.2014-CG-155 No /6/28 1,a) 1,2,3 1 3,4 CG An Interpolation Method of Different Flow Fields using Polar Inter

1 2. Nippon Cataloging Rules NCR [6] (1) 5 (2) 4 3 (3) 4 (4) 3 (5) ISSN 7 International Standard Serial Number ISSN (6) (7) 7 16 (8) ISBN ISSN I

[2] , [3] 2. 2 [4] 2. 3 BABOK BABOK(Business Analysis Body of Knowledge) BABOK IIBA(International Institute of Business Analysis) BABOK 7

1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing

1: ( 1) 3 : 1 2 4

3_23.dvi

Vol. 28 No. 2 Apr Web Twitter/Facebook UI Twitter Web Twitter/Facebook e.g., Web Web UI 1 2 SNS 1, 2 2

日本感性工学会論文誌

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website

本文.indd

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c

_念3)医療2009_夏.indd

Core1 FabScalar VerilogHDL Cache Cache FabScalar 1 CoreConnect[2] Wishbone[3] AMBA[4] AMBA 1 AMBA ARM L2 AMBA2.0 AMBA2.0 FabScalar AHB APB AHB AMBA2.0

Sobel Canny i

01ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六七八九零壱弐02ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六七八九零壱弐03ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六七八九零壱弐04ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六七八九零壱弐05ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六七八九零壱弐06ⅢⅣⅤⅥⅦⅧⅨⅩ一二三四五六

TA3-4 31st Fuzzy System Symposium (Chofu, September 2-4, 2015) Interactive Recommendation System LeonardoKen Orihara, 1 Tomonori Hashiyama, 1

IT i


IPSJ SIG Technical Report Vol.2014-GN-90 No.16 Vol.2014-CDS-9 No.16 Vol.2014-DCC-6 No /1/24 1,a) 2,b) 2,c) 1,d) QUMARION QUMARION Kinect Kinect

SNS GIS Abstract The Tourism-based Country Promotion Basic Act was enacted in Japan over a decade ago. Tourism is expected to be the primary contribut

1034 IME Web API Web API 1 IME Fig. 1 Suitable situations for context-aware IME. IME IME IME IME 1 GPS Web API Web API Web API Web )

DEIM Forum 2012 C3-1 QA QA QA Dependence relations

Transcription:

DEIM Forum 2012 F3-5 305 8550 1-2 305 8550 1-2 E-mail: {yamaguchi,satoh}@ce.slis.tsukuba.ac.jp, sat@slis.tsukuba.ac.jp Wikipedia SVM Abstract A study of Retrieval in Microblogging based on Person s Aliases Yutaro YAMAGUCHI, Satoshi SHIMADA, and Tetsuji SATOH College of Knowledge and Library Sciences, School of Informatics, University of Tsukuba 1 2 Kasuga, Tsukuba, Ibaraki, 305 8550 Japan Graduate School of Library, Information and Media Studies, University of Tsukuba 1 2 Kasuga, Tsukuba, Ibaraki, 305 8550 Japan E-mail: {yamaguchi,satoh}@ce.slis.tsukuba.ac.jp, sat@slis.tsukuba.ac.jp In microblogging which the user can easily post comments intuitive, People are referenced in a variety of aliases other than personal names. Aliases is used in the tweets which reflect the context and user s feelings, it s not limited to mere means of referring to the person. In this paper, we propose the method to extract person s aliases using search engine and Wikipedia,and analyze topic and polarity of the article. Based on the result, we created the system which can retrieve context and polarity of the article in which the person s alias appear when user input a personal name. Key words Microblogging alias SVM 1. Twitter 1 Twitter Web 1http://twitter.com/

Wikipedia 2 Wikipedia Web Wikipedia 2. Web [6] [11] [8] 2 SVM Bollegala [1] 5-gram URL URL 2-gram [6] [11] SVM 3 SVM [8] Bollegala [1] SVM [7] [9] [6] Wikipedia Brendan [3] 4 0.725 David [5] Aniket [4] K-means Affinity Propagation [2] 3 idf Affinity Propagation Web [7] Wikipedia 3. 3. 1 3. 1. 1 Wikipedia Wikipedia [8] Wikipedia 1 2http://ja.wikipedia.org/ 3 4 Conference-Board) 6

1 2 Wikipedia 3. 1. 2 2 Wikipedia Wikipedia 3. 1. 3 Wikipedia Wikipedia Wikipedia 3. 1. 4 Wikipedia [6] [11] 2 3. 2 alias fullname alias fullname 3 [6] alias fullname fullname alias fullname alias [11] alias fullname (1) fullname Web N (2) fullname 5 3 candidate fullname 3 5 3. 2 SVM SVM Support Vector Machine SVM SVM 3. 1. 4 [11] SVM alias fullname 6 Dice(fullname, candidate) OverlapC(fullname, candidate) OverlapN(fullname,

candidate) 6 9 Dice(fullname, candidate) = Hits(fullname, candidate) Hits(name) + Hits(candidate) OverlapC(fullname, candidate) = Hits(fullname, candidate) Hits(candidate) OverlapN(fullname, candidate) = Hits(fullname, candidate) Hits(name) Hits(name, candidate) name AND candidate Hits(name) Hits(candidate) namecandidate (1) (2) (3) 8 SVM (1) Dice(fullname, candidate) (2) OverlapC(fullname, candidate) (3) OverlapN(fullname, candidate) (4) candidate candidate log(cf(candidate)) (5) candidate fn(candidate) (6) candidate fp(candidate) (7) candidate bn(candidate) (8) candidate bp(candidate) 3. 2 3. 3 3. 1 3. 3. 1 [10] 8,500 tweet pn(tweet) = posi nega posi + nega posi tweet nega tweet pn(tweet) ( 1.0 < = pn(tweet) < = 1.0) tweet 3 1 4 0.5 3. 3. 2 (4) alias date pn alias(alias, day) pn alias(alias, date) = 1 n tweet T pn(tweet) (5) T date n T date pn alias(alias, date) candidate pn(tweet) pn alias(alias, date) 4 ( 1.0 < = pn alias(alias, day) < = 1.0) 4. 4. 1 3. 4. 2 Wikipedia 5 3. 1 500 1 SVM F 5http://ja.wikipedia.org/wiki/Category:

1 SVM SVM 2 fullname 6 fullname P recision = R N Recall = R C 2 precision recall F measure = precision + recall (6) 4 2011 6 27 0:00:00 2011 9 26 0:00:00 geocode = 35.67012719,139.8094368,100km R SVM N SVM C 10 4. 3 2 2 4 Twitter Search API 4 2011 8 27 0:00:00 9 20 23:59:59 RT URL 2 3 RT URL 224 70 346 313 3 3 1127 259 1389 985 53 34 5. 5. 1 SVM 3. 1. 4 N 300 SVM LibSVM 6 SVM RBF C-SVC C gridsearch 9 excite 7 NAVER 8 9 1 9 15 867 SVM 4. 2 Wikipedia 5 6 5 Wikipedia 7 8 Wikipedia 1 4 false negative 5 false positive 1 Web faridyu @faridyu 2 3 4 6http://www.csie.ntu.edu.tw/ cjlin/libsvm/ 7http://tt.excite.co.jp/people/ 8http://person.naver.jp/issue

4 4 7 5 1 faridyu Twitter Web Web URL 2 3 4 4 5 4 4 5 4 faridyu @faridyu 8 Wikipedia Wikipedia 9 C 1.34 10 8 0.25 1.00 0.50 5 Wikipedia 824 464 80 6 Wikipedia 428 229 168 10 Precision Recall F-measure 0.69 0.65 0.67 0.62 0.67 0.64 11 Precision Recall F-measure 0.83 0.70 0.76 0.65 0.80 0.71

5 : 7 : 6 : 8 : 5. 2 5 6 5 6 5 9 2 3 6 9 10 13 5 9 8 6 9 7 8 5 9 8 9 2 AKBINGO! 5. 3 7 8 5 pn alias pn alias 0 7 8 1 5. 4 9 10 jquery PHP MySQL DB Twitter API DB 5. 1 489 1208

9 10 6. SVM 0.83 [1] D. Bollegala, Y. Matsuo, and M. Ishizuka. Automatic discovery of personal name aliases from the web. Knowledge and Data Engineering, IEEE Transactions on, Vol. 23, No. 6, pp. 831 844, june 2011. [2] Brendan J. Frey and Delbert Dueck. Clustering by passing messages between data points. Science, Vol. 315, pp. 972 976, 2007. [3] Brendan O Connor, Ramnath Balasubramanyan, Bryan R. Routledge, and Noah A.Smith. From tweets to polls:linking text sentiment to public oppinion time series. ICWSM-2010, 2010. [4] Aniket Rangrej, Sayali Kulkarni, and Ashish V. Tendulkar. Comparative study of clustering techniques for short text documents. 20th International World Wide Web Conference (WWW2011), p. 111, 2011. [5] David A. Shamma, Lyndon Kennedy, and Elizabeth F. Churchill. Peaks and persistence: modeling the shape of microblog conversations. Proceedings of the ACM 2011 conference on Computer supported cooperative work, pp. 355 358, 2011. [6],. Web. (DBWS2006), 2006. [7],. blog. 18 (DEWS2007), 2007. [8],,,. Web. 19 DEWS2008, 2008. [9],,,,. Weblog. 22, 2008. [10],,.. 14, pp. pp.584 587, 2008. [11], Danushka Bollegala,,. Web. NLP 2, 2007. 21500091