2009/9 Vol. J92 D No. 9 HTML [3] Microsoft PowerPoint Apple Keynote OpenOffice Impress XML 4 1 (A) (C) (F) 2. 2. 1 1484 Fig. 1 1 An example of slide i



Similar documents
1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

2 : Open Clip Art Library [4] Microsoft Office PowerPoint Web PowerPoint 2 Yahoo! Web [5] SlideShare Yahoo! Web Yahoo! Web

2. Twitter Twitter 2.1 Twitter Twitter( ) Twitter Twitter ( 1 ) RT ReTweet RT ReTweet RT ( 2 ) URL Twitter Twitter 140 URL URL URL 140 URL URL

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

<> <name> </name> <body> <></> <> <title> </title> <item> </item> <item> 11 </item> </>... </body> </> 1 XML Web XML HTML 1 name item 2 item item HTML

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q

Vol. 48 No. 3 Mar PM PM PMBOK PM PM PM PM PM A Proposal and Its Demonstration of Developing System for Project Managers through University-Indus

IPSJ SIG Technical Report Vol.2012-HCI-149 No /7/20 1 1,2 1 (HMD: Head Mounted Display) HMD HMD,,,, An Information Presentation Method for Weara

Fig. 1. Example of characters superimposed on delivery slip.

1_26.dvi

TF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

1 7.35% 74.0% linefeed point c 200 Information Processing Society of Japan

HASC2012corpus HASC Challenge 2010,2011 HASC2011corpus( 116, 4898), HASC2012corpus( 136, 7668) HASC2012corpus HASC2012corpus

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

a) Extraction of Similarities and Differences in Human Behavior Using Singular Value Decomposition Kenichi MISHIMA, Sayaka KANATA, Hiroaki NAKANISHI a

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

3_23.dvi

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

2 Fig D human model. 1 Fig. 1 The flow of proposed method )9)10) 2.2 3)4)7) 5)11)12)13)14) TOF 1 3 TOF 3 2 c 2011 Information

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

xx/xx Vol. Jxx A No. xx 1 Fig. 1 PAL(Panoramic Annular Lens) PAL(Panoramic Annular Lens) PAL (2) PAL PAL 2 PAL 3 2 PAL 1 PAL 3 PAL PAL 2. 1 PAL

H19国際学研究科_02.indd

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

IPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing

IPSJ SIG Technical Report Vol.2013-GN-87 No /3/ Research of a surround-sound field adjustmen system based on loudspeakers arrangement Ak

3_39.dvi

Fig. 3 3 Types considered when detecting pattern violations 9)12) 8)9) 2 5 methodx close C Java C Java 3 Java 1 JDT Core 7) ) S P S

2006 [3] Scratch Squeak PEN [4] PenFlowchart 2 3 PenFlowchart 4 PenFlowchart PEN xdncl PEN [5] PEN xdncl DNCL 1 1 [6] 1 PEN Fig. 1 The PEN

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website

IPSJ SIG Technical Report Secret Tap Secret Tap Secret Flick 1 An Examination of Icon-based User Authentication Method Using Flick Input for

第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

.,,, [12].,, [13].,,.,, meal[10]., [11], SNS.,., [14].,,.,,.,,,.,,., Cami-log, , [15], A/D (Powerlab ; ), F- (F-150M, ), ( PC ).,, Chart5(ADIns

Microsoft Word - toyoshima-deim2011.doc

IPSJ SIG Technical Report Vol.2011-DBS-153 No /11/3 Wikipedia Wikipedia Wikipedia Extracting Difference Information from Multilingual Wiki

1 4 4 [3] SNS 5 SNS , ,000 [2] c 2013 Information Processing Society of Japan

2) TA Hercules CAA 5 [6], [7] CAA BOSS [8] 2. C II C. ( 1 ) C. ( 2 ). ( 3 ) 100. ( 4 ) () HTML NFS Hercules ( )

Vol. 42 No. SIG 8(TOD 10) July HTML 100 Development of Authoring and Delivery System for Synchronized Contents and Experiment on High Spe

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

DEIM Forum 2010 A3-3 Web Web Web Web Web. Web Abstract Web-page R

IPSJ SIG Technical Report Vol.2014-DPS-158 No.27 Vol.2014-CSEC-64 No /3/6 1,a) 2,b) 3,c) 1,d) 3 Cappelli Bazen Cappelli Bazen Cappelli 1.,,.,.,

2). 3) 4) 1.2 NICTNICT DCRA Dihedral Corner Reflector micro-arraysdcra DCRA DCRA DCRA 3D DCRA PC USB PC PC ON / OFF Velleman K8055 K8055 K8055

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

IPSJ SIG Technical Report Vol.2014-MBL-70 No.49 Vol.2014-UBI-41 No /3/15 2,a) 2,b) 2,c) 2,d),e) WiFi WiFi WiFi 1. SNS GPS Twitter Facebook Twit

Vol. 42 No MUC-6 6) 90% 2) MUC-6 MET-1 7),8) 7 90% 1 MUC IREX-NE 9) 10),11) 1) MUCMET 12) IREX-NE 13) ARPA 1987 MUC 1992 TREC IREX-N

DEIM Forum 2012 E Web Extracting Modification of Objec

知識ベースCFD

Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

和文タイトル

SEJulyMs更新V7

ID 3) 9 4) 5) ID 2 ID 2 ID 2 Bluetooth ID 2 SRCid1 DSTid2 2 id1 id2 ID SRC DST SRC 2 2 ID 2 2 QR 6) 8) 6) QR QR QR QR

IPSJ SIG Technical Report Vol.2010-CVIM-170 No /1/ Visual Recognition of Wire Harnesses for Automated Wiring Masaki Yoneda, 1 Ta

sigmusdemo.dvi

2015 9

1 Fogg Fogg Behavior Model [1] information cascade [2] TPO [3] Fig. 2 Target area of this paper. 1 Fig. 1 Fogg b

untitled

Microsoft Word - deim2011_new-ichinose doc

Sobel Canny i

教師情報を必要としないWebページ群のコンテンツ自動抽出ツールの提案

12研究資料02.indd

dsample.dvi

fl™‹ä1.eps

Vol. 9 No. 5 Oct (?,?) A B C D 132

BOK body of knowledge, BOK BOK BOK 1 CC2001 computing curricula 2001 [1] BOK IT BOK 2008 ITBOK [2] social infomatics SI BOK BOK BOK WikiBOK BO

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

IPSJ SIG Technical Report iphone iphone,,., OpenGl ES 2.0 GLSL(OpenGL Shading Language), iphone GPGPU(General-Purpose Computing on Graphics Proc

IPSJ SIG Technical Report An Evaluation Method for the Degree of Strain of an Action Scene Mao Kuroda, 1 Takeshi Takai 1 and Takashi Matsuyama 1

Accuracy Improvement by Compound Discriminant Functions for Resembling Character Recognition Takashi NAKAJIMA, Tetsushi WAKABAYASHI, Fumitaka KIMURA,

Vol.11-HCI-15 No. 11//1 Xangle 5 Xangle 7. 5 Ubi-WA Finger-Mount 9 Digitrack 11 1 Fig. 1 Pointing operations with our method Xangle Xa

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2011-MBL-57 No.27 Vol.2011-UBI-29 No /3/ A Consideration of Features for Fatigue Es

2007/2 Vol. J90 D No Web 2. 1 [3] [2], [11] [18] [14] YELLOW [16] [8] tfidf [19] 2. 2 / 30% 90% [24] 2. 3 [4], [21] 428

Publish/Subscribe KiZUNA P2P 2 Publish/Subscribe KiZUNA 2. KiZUNA 1 Skip Graph BF Skip Graph BF Skip Graph Skip Graph Skip Graph DDLL 2.1 Skip Graph S

学術会議講演 pptx

IPSJ SIG Technical Report Vol.2016-CE-137 No /12/ e β /α α β β / α A judgment method of difficulty of task for a learner using simple

2007/8 Vol. J90 D No. 8 Stauffer [7] 2 2 I 1 I 2 2 (I 1(x),I 2(x)) 2 [13] I 2 = CI 1 (C >0) (I 1,I 2) (I 1,I 2) Field Monitoring Server

& Vol.2 No (Mar. 2012) 1,a) , Bluetooth A Health Management Service by Cell Phones and Its Us

Computer Security Symposium October ,a) 1,b) Microsoft Kinect Kinect, Takafumi Mori 1,a) Hiroaki Kikuchi 1,b) [1] 1 Meiji U

( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst

独立行政法人情報通信研究機構 Development of the Information Analysis System WISDOM KIDAWARA Yutaka NICT Knowledge Clustered Group researched and developed the infor

24 Region-Based Image Retrieval using Fuzzy Clustering

( )

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and

IPSJ SIG Technical Report Vol.2014-DBS-159 No.6 Vol.2014-IFAT-115 No /8/1 1,a) 1 1 1,, 1. ([1]) ([2], [3]) A B 1 ([4]) 1 Graduate School of Info

知能と情報, Vol.30, No.5, pp

Vol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L

1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing

1 2 3 マルチメディア, 分散, 協調とモバイル (DICOMO2013) シンポジウム 平成 25 年 7 月.,.,,.,. Surrogate Diner,., Surrogate Diner,, 3,, Surrogate Diner. An Interface Agent for Ps

Vol. 45 No Web ) 3) ),5) 1 Fig. 1 The Official Gazette. WTO A

DEIM Forum 2010 A Web Abstract Classification Method for Revie

VRSJ-SIG-MR_okada_79dce8c8.pdf

Vol. 43 No. 2 Feb. 2002,, MIDI A Probabilistic-model-based Quantization Method for Estimating the Position of Onset Time in a Score Masatoshi Hamanaka

20mm 63.92% ConstantZoom U 5

IPSJ SIG Technical Report Vol.2014-GN-90 No.16 Vol.2014-CDS-9 No.16 Vol.2014-DCC-6 No /1/24 1,a) 2,b) 2,c) 1,d) QUMARION QUMARION Kinect Kinect

Transcription:

a) Structure Extraction from Presentation Slide Information Tessai HAYAMA a), Hidetsugu NANBA, and Susumu KUNIFUJI Web 1. Web Graduate School of Knowledge Science, Japan Advanced Institute of Science and Technology, 1 1, Nomi-shi, 923 1218 Japan Faculty of Information Sciences, Hiroshima City University, 3 4 1, Ozukahigashi, Asaminami-ku, Hiroshima-shi, 731 3194 Japan a) E-mail: t-hayama@jaist.ac.jp [1], [7], [8] Rosenfeld [6] Zhai [9] PDF Web [5] Web Web D Vol. J92 D No. 9 pp. 1483 1494 c 2009 1483

2009/9 Vol. J92 D No. 9 HTML [3] Microsoft PowerPoint Apple Keynote OpenOffice Impress XML 4 1 (A) (C) (F) 2. 2. 1 1484 Fig. 1 1 An example of slide information and its structure.

(F) (E) 2. 2 1 (A) (A) (B) (C) (C) (D) (G) (H) (I) (G) / 3. 2 3. 1 3. 2 3. 1 2 1485

2009/9 Vol. J92 D No. 9 Table 1 1 Score sheet of attribute based on the likelihood of the attribute. Ti1) >Threshold (fontsize1) +1 S1) +1 Ti2) >Threshold (y axis position) +1 S2) Ti3) +1 +1 Ti4) +1 S3) / Ti5) >Threshold (number of characters) +1 +1 S4) >Threshold (fontsize2) +1 S5) >Threshold (number of characters) +1 F 1) / 5 Ta1) F 2) / 4 5 F 3) / 4 Ta2) F 4) / / 4 3 Ta3) 4 F 5) / / Ta4) 3 1 Ta5) / 3 F 6) 4 Ta6) / F 7) <Threshold (number of characters) +1 1 T hreshold (fontsize1), T hreshold (fontsize2), T hreshold (Yaxis position) T hreshold (number of characters) 2 Fig. 2 Flow chart of organizing processing. (1) 1 1 Ti1 Ti2 Ti3 Ti4 Ti5 S1 S2 S3 S4 S5 1486

F 1 F 2 F 3 F 4 / F 5 S6 Ta1 Ta2 Ta3 Ta4 Ta5 / T 6 1 3 Fig. 3 An example of a slide including attributes scores. 3 Object(b) [ ] [3,5,0,0] Object(b) Object(c) (g) (h) (2) (1) (2.1) (2.3) (2.1) Li Attri (1) (2) 1487

2009/9 Vol. J92 D No. 9 Ev (attri) = Attri Val (attri) (if attri cand == attri) MaxScore (attri) (otherwise) Attri Val (attri) Li Attri = Ev ( title ) Ev ( body text ) (1) Ev ( figure ) Ev ( table ). (2) attri Attri Val (attri) attri cand MaxScore (attri) 1 (1) Ev (attri) attri attri (2) Li Attri (1) 2 Object(b) (g) 375 300 object(b) object(g) (2.2) (2.1) 0 3 object(d) object(d) object(f) 3 object(a) 0 (2.3) (2.1) (2.2) (3) / / (2.2) / (4) 3. 2 1 1 5 1488

Web (1) (2) (3) 1 2 / 4 block(a) (b) (a) 4 Fig. 4 Units attribute sequence in a block and it s dividing point. / (b) block(d) block(c) / (i) / / / block(a) block(b) (d) (ii) / / block(c) 3 (4) (2) (3) 1489

2009/9 Vol. J92 D No. 9 4. 4. 1 / / P recision Recall F measure (3) (5) Matched CorrectData Recall = (3) Total CorrectData Matched CorrectData P recision = (4) Total DetectedData 2 Recall P recision F measure = (5) Recall + P recision Matched CorrectData Total CorrectData Total DetectedData Web [4] 98 24.14 2366 7 2 5 18 1 2 3 XML Microsoft Visual Studio C# Microsoft PowerPoint PPT PPT PPT 1 5 5 Unit Object attribute Node- List Unit ID 2 1490

5 XML Fig. 5 An example of XML data outputted by an experimental system developted based on proposal method. 2 Table 2 Parameters of proposal method used in this experiment. Threshold (fontsize1) Threshold (fontsize2) Threshold (Yaxis position) Threshold (number of charactors) 1 24 pt 32 pt 1/4 8 24 pt 4. 2 3 4 3 3 F measure 0.89 0.69 4 4 0.95 0.90 1491

2009/9 Vol. J92 D No. 9 Table 3 3 Accuracy for each attribute results in the organizing process. (2333) (9285) (1905) (46) (2201) Recall 0.97 0.89 0.93 0.96 0.96 Precision 0.99 0.85 0.85 0.98 0.81 F-measure 0.98 0.85 0.89 0.97 0.87 Recall 0.87 0.69 0.64 0.93 0.91 Precision 0.96 0.88 0.63 0.93 0.63 F-measure 0.92 0.77 0.64 0.93 0.74 Table 4 4 Ratio in pages for each correct ratio of results in the structuring process. 1.00 0.99 0.80 0.79 0.60 0.59 0.00 N/A 0.95 0.03 0.04 0.05 0.12 0.90 0.05 0.06 0.07 0.12 0.76 0.07 0.08 0.15 0.12 6 [I] [II] Fig. 6 Slide samples matching/mis-matching structure data extracted by the proposal method to its correct data. 0.95 0.76 1492

/ 95% [I] [II] 6 (C) (D) (E) (B) (C) (D) (E) (E) (D) 5. 95% [2] 21 B 20300046 [1] A. Anjewierden, AIDAS: Incremental logical structure discovery in PDF documents, Proc. 6th International Conference on Document Analysis and Recognition, pp.374 378, 2001. [2] T. Hayama, H. Nanba, and S. Kunifuji, Alignment between a technical paper and presentation sheets using a hidden Markov model, Proc. Active Media Technology 2005, pp.102 106, 2005. [3] T. Ishihara, H. Takagi, T. Itoh, and C. Asakawa, Analyzing visual layout for a non-visual presentation-document interface, Proc. 8th International ACM SIGACCESS Conference on Computers and Accessibility, pp.165 172, 2006. [4] H. Nanba, T. Abekawa, M. Okumura, and S. Saito, Bilingual presri: Integration of multiple research paper databases, Proc. 7th RIAO Conference: Coupling Approaches, Coupling Media and Coupling Languages for Information Retrieval, pp.195 211, 2004. [5] Web vol.45, no.9, pp.2157 2167, 2004. [6] B. Rosenfeld, R. Feldman, and Y. Aumann, Structural extraction from visual layout of documents, Proc. 11th International Conference on Information and Knowledge Management, pp.203 210, 2002. [7] T. Watanabe, Q. Luo, and N. Sugie, Layout recognition of multi-kinds of table-form documents, IEEE Trans. Pattern Anal. Mach. Intell., vol.17, no.4, pp.432 445, 1995. [8] Y. Yang and H. Zhang, HTML page analysis based 1493

2009/9 Vol. J92 D No. 9 on visual cues, Proc. 6th International Conference on Document Analysis and Recognition, pp.859 864, 2001. [9] Y. Zhai and B. Liu, Structured data extraction from the Web based on partial tree alignment, IEEE Trans. Knowl. Data Eng., vol.18, no.12, pp.1614 1628, 2006. 20 12 15 21 4 13 2001 2003 2006 2007 1996 1998 2001 2002 2007 ACL ACM 1974 1982 1986 ICOT 1992 1998 25 1996 2004 1494