untitled



Similar documents
橡dbweb2002-sato.PDF

( )

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

,,,,., C Java,,.,,.,., ,,.,, i

日本感性工学会論文誌

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

A Japanese Word Dependency Corpus ÆüËܸì¤Îñ¸ì·¸¤ê¼õ¤±¥³¡¼¥Ñ¥¹

Microsoft Word - deim2011_new-ichinose doc

[1] ICT SNS 1 % % [2], [3] [4] (e.g Fig. 1 The interaction model [1] [5], [6] [7], [8], [9] [10], [11], [12],. [13] (1) 2015 Informati

3_23.dvi

PowerPoint プレゼンテーション

Lytro [11] The Franken Camera [12] 2.2 Creative Coding Community Creative Coding Community [13]-[19] Sketch Fork 2.3 [20]-[23] 3. ourcam 3.1 ou

RDF-lecture-01_ key

IPSJ SIG Technical Report Vol.2014-DBS-159 No.6 Vol.2014-IFAT-115 No /8/1 1,a) 1 1 1,, 1. ([1]) ([2], [3]) A B 1 ([4]) 1 Graduate School of Info

CONTENTS Public relations brochure of Higashikawa October No


平成17年度大学院 知識システム特論

/ p p


BOK body of knowledge, BOK BOK BOK 1 CC2001 computing curricula 2001 [1] BOK IT BOK 2008 ITBOK [2] social infomatics SI BOK BOK BOK WikiBOK BO

Mining Social Network of Conference Participants from the Web

DEIM Forum 2009 C8-4 QA NTT QA QA QA 2 QA Abstract Questions Recomme

A Study on Throw Simulation for Baseball Pitching Machine with Rollers and Its Optimization Shinobu SAKAI*5, Yuichiro KITAGAWA, Ryo KANAI and Juhachi

Steel Construction Vol. 6 No. 22(June 1999) Engineering

149 (Newell [5]) Newell [5], [1], [1], [11] Li,Ryu, and Song [2], [11] Li,Ryu, and Song [2], [1] 1) 2) ( ) ( ) 3) T : 2 a : 3 a 1 :


, IT.,.,..,.. i


1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing

2) TA Hercules CAA 5 [6], [7] CAA BOSS [8] 2. C II C. ( 1 ) C. ( 2 ). ( 3 ) 100. ( 4 ) () HTML NFS Hercules ( )

..,,,, , ( ) 3.,., 3.,., 500, 233.,, 3,,.,, i

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

1 1 tf-idf tf-idf i

本文/YAZ325T

Introduction Purpose This training course describes the configuration and session features of the High-performance Embedded Workshop (HEW), a key tool


1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf


Kyoto University * Filipino Students in Japan and International Relations in the 1930s: An Aspect of Soft Power Policies in Imperial Japan

Ł×

自然言語処理21_249

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

04_奥田順也.indd

e.g. Kubota 2011 Piller & Takahashi 2006 Kubota 2011 Piller & Takahashi 2006 Kubota 2011 Piller et al Heller 2003 Piller, Takahashi & Watanabe

シラバス政治学H18.PDF

2016


IPSJ SIG Technical Report Vol.2014-EIP-63 No /2/21 1,a) Wi-Fi Probe Request MAC MAC Probe Request MAC A dynamic ads control based on tra

Izard 10 [1]Plutchik 8 [2] [3] Izard Neviarouskaya [4][5] 2.2 Hao [6] 1 Twitter[a] a) Shook Wikipedia

Transcription:

504 26 4 SP-C 2011 2010 Wikipedia Building up Ontologies with Many Properties from Japanese Wikipedia Susumu Tamagawa Takeshi Morita Keio University s tamagawa@ae.keio.ac.jp Aoyama Gakuin University Takahira Yamaguchi Keio University yamaguti@ae.keio.ac.jp,http://www.yamaguti.comp.ae.keio.ac.jp/ keywords: wikipedia, ontology, property, ontology learning Summary Here is discussed how to build up ontologies with many properties from Japanese Wikipedia. The ontologies include is-a relationship (rdfs:subclassof), class-instance relationship (rdf:type) and synonym relation (skos:altlabel) moreover it includes property relations and types. Property relations are triples, property domain (rdfs:domain) and property range (rdfs:range). Property types are object (owl:objectproperty), data (owl:datatypeproperty), symmetric (owl:symmetricproperty), transitive (owl:transitiveproperty), functional (owl:functionalproperty) and inverse functional (owl:inversefunctionalproperty). Experimental case studies show us that the built Japanese Wikipedia Ontology goes better than DBpedia from utility when we use, such as Hub of Linked Data, especially in Japan. 1. WordNet[Bond 09] [ 97] [Buitelaar 05] Web Wikipedia [Nakayama 07] Wikipedia Wikipedia Wikipedia Wikipedia Web Web ( ) RDF Linked Open Data(LOD) LOD RDF Wikipedia DBpedia[Auer 07] RDF DBpedia Wikipedia Wikipedia Infobox Wikipedia 68 13 LOD Wikipedia

Wikipedia 505 ( Infobox Infobox ) (Is-a ) ( Wikipedia ) [ 10] W3C OWL Wikipedia 1 Is-a Infobox Wikipedia 8,000 Infobox 1 2 Wikipedia Wikipedia Wikipedia [ 09] Wikipedia Wikipedia Wikipedia Linked Open Data DBpedia 2 Wikipedia 3 4 5 Wikipedia 6 2. Wikipedia DBpedia[Auer 07] Wikipedia RDF Wikipedia Infobox 170 720 Infobox Infobox Infobox YAGO2[Johannes 10] YAGO WordNet Wikipedia Wikipedia GeoNames 2 wasbornondate islocatedin 100 Fei Wu & Daniel Weld[Fei 08] Wikipedia Infobox Word Net Is-a Infobox Is-a Infobox Infobox Wikipedia Wikipedia Infobox Wikipedia 1 Wikipedia 2 GeoNames:http://www.geonames.org/

506 26 4 SP-C 2011 3. Wikipedia Wikipedia OWL () OWL 3 RDFS 4 ( ) (1) (2) (rdfs:domain) (3) (rdfs:range) (4) (rdfs:subpropertyof) (5) a (owl:objectproperty) b (owl:datatypeproperty) c (owl:symmetricproperty) d (owl:transitiveproperty e (owl:functionalproperty) f (owl:inversefunctional Property) 3 1 2 Wikipedia (1) Infobox (2) 1 Infobox [ 10] Infobox Infobox ) Wikipedia Infobox Wikipedia MediaWiki 1 Genre Infobox 3 4 OWL: http://www.w3.org/tr/owl-ref/ RDFS: http://www.w3.org/tr/rdf-schema/ 1 Infobox Genre 1 owl:objectproperty owl:datatypeproperty 2 40 Infobox Infobox 40 2009 10 Wikipedia Infobox 20 2000 40 Infobox 14 6000 72% Infobox 2 Wikipedia Wikitext (1) (4) (3) 5 5 (4)

Wikipedia 507 3 2 (1) (2) (1) (3) (2) ( 5 ) (4) (3) 2 2 Infobox Infobox 3 2 3 1 [ 10] 3 1 3 Ruby 3 3 3 1 Infobox 2 (1) (2) Is-a 3 1 Wikipedia Wikipedia Wikipedia ( ) Wikipedia Is-a Is-a Infobox 4

508 26 4 SP-C 2011 5 4 Wikipedia Is-a ( ) Is-a [ 10] 3 4 Wikipedia Infobox Wikipedia 3 1 1 Infobox 3 1 2 Infobox 1 Infobox 5 3 1 2 3 1 3 5 3 1 3 1 1 Infobox OWL 4 X(n) P (n) Y (n) P (n) Y (n) P (n) X(n) X(n) P (n) Y (n) Y (n) P (n) Z(n) P (n) X(n) P (n) Z(n) P (n) X Y (n) 1 P (n) Y X(n) 1 3 1 P (n) X(n) Y (n) P (n) Y (n) P (n) X(n) P (n) P (n) A 6 P (n) X(n)

Wikipedia 509 1 Infobox 5 59,751 owl:objectproperty 36,373 owl:objectproperty 30,042 owl:objectproperty 25,108 owl:datatypeproperty 22,239 owl:objectproperty 6 Y (n) Y (n) Z(n) P (n) X(n) P (n) Z(n) P (n) 6 PS3 PS2 PS2 PS PS3 PS P (n) X(n) Y (n) P (n) X Y (n) P (n) Y X(n) P (n) 6 4. 3 4 1 4 2 4 3 4 4 4 5 2 5 136,033 102,617 70,839 69,690 66,841 2010 11 Wikipedia (jawiki-latestpages-articles.xml) 5 MySQL Java 4 1 1 Infobox Wikipedia 3 1 1 7,137 1,962,411 Infobox 171,190 1 Infobox 5 2 Wikipedia 3 1 2 3,980 2,919,470 233,247 2 3 1 1 3 1 2 2 Infobox 10,769 4,867,882 Infobox 319,742 Infobox 148,552 3 5 3 5 Wikipedia : http://download.wikimedia.org/jawiki/

510 26 4 SP-C 2011 3 2 Infobox 7,137 1,962,411 171,190 95.2 1.33% 3,980 2,919,470 233,247 92.5 1.63% 2 10,769 4,867,882 319,742 94.3 1.44% 2,919,470 1,000 (1) [ 87] (1) N n ˆp 95% 92.5 1.63% 7 2 4,867,882 94.3 1.44% Infobox 1.5 2.5 [ ˆp 1.96 (1 n N ) ˆp(1 ˆp) n 1 4 2, ˆp+1.96 (1 n N ) ˆp(1 ˆp) n 1 ](1) 3 1 10,769 3 2 9,486 Infobox Infobox 1,888 8,831 82% Infobox 8 9,486 1,000 (1) 95% 95% 94.8 1.22% 4 26,251 21,140 10,871 10,088 9,299 4 5 4 4 4 3 3 3 4,007 14,053 14,053 1,000 (1) 95% 95% 88.3 1.92% 5 5 5 5 23,195 20,633 15,956 12,821 11,569

Wikipedia 511 6 Is-a 44,766 43,532 22,175 16,370 12,236 Wikipedia Wikipedia Wikipedia 3 3 Is-a 3,234 35,946 35,946 1,000 (1) 95% 95% 92.1 1.65% 6 Is-a 5 owl:datatypeproperty (rdfs:literal) 4 1 2 Is-a 6 2 40,262 5,120 4,316 1,886 Is-a 26,209 1,113 9,737 2,121 Is-a 1 Wikipedia Wikipedia 48% 5,120 Infobox 353 Infobox 1,278 3,489 3 3,980 Infobox 1.5 1 4 4 3 4 2,322 1,387 1,387

512 26 4 SP-C 2011 7 n 8 x 7 8 2,082 1,919 1,514 237 227 30 30 1 486 510 0.95 198 360 0.55 108 258 0.42 2,720 11,016 0.25 57.5% 7 n 7 n 18 75.7% 7 7 156 51.9% 105 59.0% 81 82.7% Infobox Infobox Infobox 4 5 3 5 3 1 4,867,882 10,769 1 3 5 10,927 415 415 45.1% 8 x 8 x 8 8

Wikipedia 513 19 14 18 0.5 34 55,887 3 1 Wikipedia 2 3 5 340 210 3 54 3 1 2 3 1 1 Infobox 3 Wikipedia Infobox 3 5 Wikipedia Infobox 3 PS PS2 PS3 3 3 5 185,700 1 2,267 2,267 54.3% owl:datatypeproperty owl:datatypeproperty 3 1 owl:datatypeproperty owl:objectproperty 47,295 1 3,670 3,670 22.4% owl:datatypeproperty

514 26 4 SP-C 2011 9 Wikipedia 10,769-4,867,882 DatatypeProperty 214-416,803 ObjectProperty 99-912,746 SymmetricProperty 415 45.1% 21,854 TransitiveProperty 210 0% 1,020 FunctionalProperty 2,267 54.3% 185,700 InverseFunctionalProperty 3,670 22.4% 47,295 10 Wikipedia 4,867,882 94.3 1.44% Infobox 1,962,411 95.2 1.33% 2,919,470 92.5 1.63% (rdfs:domain) 9,486 94.8 1.22% (rdfs:range) 40,262 90.4 1.81% 14,053 88.3 1.92% Is-a 35,946 92.1 1.65% 1,387 57.5% DVD 5. Wikipedia Wikipedia 5 1 Wikipedia 9 Wikipedia 10 95% 9 10 10,769 4,867,882 Infobox 2 94% 9,486 8,831 82% 2 40,262 5,120 48% 90% 57.5% 1,387 owl: Object/DatatypeProperty (owl: SymmetricProperty) (owl:transitiveproperty) (owl:functionalproperty) (owl: InverseFunctionalProperty) 8 5 2 Wikipedia Wikipedia Wikipedia RDF DBpedia DBpedia Wikipedia RDF DBpedia 2011 1 Wikipedia Infobox (infobox properties ja.nt,infobox property definitions ja.nt) 6 11 DBpedia Wikipedia DBpedia 200 DBpedia 700 DBpedia wikipageusestemplate 6 http://wiki.dbpedia.org/downloads

Wikipedia 515 11 Wikipedia DBpedia Wiki-Ont DBpedia 10,769 10,034 4,867,882 2,840,553 (rdfs:domain) 9,486 - (rdfs:range) 5,120-319,742 133,999 DBpedia Infobox wiki Wikipedia Infobox wiki Infobox Wikipedia 1,962,411 DBpedia 100 Wikipedia 8,447 DBpedia 5,056 Wikipedia DBpedia 2.4 DBpedia Infobox Wikipedia 3 1 2 Infobox 9 12 13 13 * ObjectProperty + DatatypeProperty 9 Wikipedia DBpedia 12 Wikipedia DBpedia ( ) DBpedia Wikipedia Ryunosuke Akutagawa Chokodo Shujin ( ) Kappa (short story) 9 City of Paris Parisian (person) Paris ( ) Paris (France) 30 9 DBpedia Wikipedia DBpedia Wikipedia 12 DBpedia Wikipedia Wikipedia Wikipedia DBpedia 9 Wikipedia 2 DBpedia 30 Wikipedia 3 13 DBpedia DBpedia yearsun DatatypeProperty Wikipedia DatatypeProperty DatatypeProperty DBpedia Wikipedia DatatypeProperty Wikipedia DBpedia ObjectProperty Wikipedia ObjectProperty Wikipedia

516 26 4 SP-C 2011 13 Wikipedia DBpedia DBpedia Wikipedia Genre* * notable works 1915 * birth place*, * children * ( ) relations * death date+ 1927-07-24 + 1927 7 24 birth date+ 1892-03-01 + 1892 3 1 wikipageusestemplate,imagesize, 6 6 7 63 sans 11,840,000 + 11,840,000 km 2 14,518 + 14,518km 2 10,540ha alt maxi 130m :130m alt mini 28m :28m maire ( ) cp 75001-75020 75116 75001-75020 75116 xprecipmm,xsun,région, 22 69, 17 DBpedia Wikipedia DatatypeProperty ObjectProperty DBpedia Linked Data Wikipedia Linked Data 6. Wikipedia Wikipedia Linked Data Wikipedia Is-a Wikipedia DBpedia Linked Data 2010 12 Wikipedia 68 Wikipedia Wikipedia Linked Data Wikipedia WordNet Wikipedia SourceForge.jp 7 [Auer 07] Soren Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, Zachary Ives: DBpedia: A Nucleus for a Web of Open Data, 6th International Semantic Web Conference, Vol. 4825, pp. 722-735 (2007) [Bond 09] Francis Bond, Hitoshi Isahara, Sanae Fujita, Kiyotaka Uchimoto, Takayuki Kuribayashi and Kyoko Kanzaki: Enhancing the Japanese WordNet, 7th Workshop on Asian Language Resources, pp. 1-8 (2009) [Buitelaar 05] Paul Buitelaar, Philipp Cimiano, Bernardo Magnini (Eds.) Ontology Learning from Text: Methods, Evaluation and Applications, Frontiers in Artificial Intelligence and Applications Series, Vol. 123, IOS Press (2005) [Fei 08] Fei Wu, Daniel S. Weld : Automatically Refining the Wikipedia Infobox Ontology, International World Wide Web Conference 2008, pp.634-644 (2008) [ 97],,,,,,, :, (1997) [Johannes 10] Johannes Hoffart, Fabian Suchanek, Klaus Berberich, Gerhard Weikum: YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia Research Report MPI-I-2010-5- 7 http://wikipedia-ont.sourceforge.jp/

Wikipedia 517 007, Max-Planck-Institut für Informatik (2010) [Miller 95] G.A.Miller: WordNet: A Lexical Database for English, ACM, Vol.38, No.11, pp.39-41 (1995) [Nakayama 07] Nakayama, K., Hara, T. and Nishio, S.: Wikipedia Mining for an Association Web Thesaurus Construction, in Proceedings of International Conference on Web Information Systems Engineering, pp. 322-334 (2007) [ 09],, Erdmann, M.,,,, Wikipedia,, Vol.24, No. 6, pp. 549-557 (2009) [ 10],,,, : Wikipedia,, Vol.25, No. 5, pp.623-636 (2010) [ 87] :, (1987) [Yokoi 95] T. Yokoi: The EDR Electronic Dictionary, Commun. ACM, Vol. 38, No. 11, pp. 42-44 (1995) 2011 1 17 2009 2011 Web 2003 2005 2007 4 (DC2) 2008 4 (PD) 2009 4 2011 4 Web 1979 1984 1989 1997 2004 Web 1992 2002 2007 AAAI IEEE-CS