1 7.35% 74.0% linefeed point c 200 Information Processing Society of Japan

Similar documents
1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

untitled

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

Modal Phrase MP because but 2 IP Inflection Phrase IP as long as if IP 3 VP Verb Phrase VP while before [ MP MP [ IP IP [ VP VP ]]] [ MP [ IP [ VP ]]]

Vol. 42 No. SIG 8(TOD 10) July HTML 100 Development of Authoring and Delivery System for Synchronized Contents and Experiment on High Spe

IPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-

1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

IPSJ SIG Technical Report Vol.2017-ARC-225 No.12 Vol.2017-SLDM-179 No.12 Vol.2017-EMB-44 No /3/9 1 1 RTOS DefensiveZone DefensiveZone MPU RTOS

DEIM Forum 2010 A Web Abstract Classification Method for Revie

A Japanese Word Dependency Corpus ÆüËܸì¤Îñ¸ì·¸¤ê¼õ¤±¥³¡¼¥Ñ¥¹

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

Vol. 43 No. 7 July 2002 ATR-MATRIX,,, ATR ITL ATR-MATRIX ATR-MATRIX 90% ATR-MATRIX Development and Evaluation of ATR-MATRIX Speech Translation System

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

NINJAL Project Review Vol.3 No.3

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and

soturon.dvi

Q [4] 2. [3] [5] ϵ- Q Q CO CO [4] Q Q [1] i = X ln n i + C (1) n i i n n i i i n i = n X i i C exploration exploitation [4] Q Q Q ϵ 1 ϵ 3. [3] [5] [4]

2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

IPSJ SIG Technical Report Vol.2012-IS-119 No /3/ Web A Multi-story e-picture Book with the Degree-of-interest Extraction Function

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

1 1 tf-idf tf-idf i

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

IPSJ SIG Technical Report Pitman-Yor 1 1 Pitman-Yor n-gram A proposal of the melody generation method using hierarchical pitman-yor language model Aki

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

論文9.indd

2006 [3] Scratch Squeak PEN [4] PenFlowchart 2 3 PenFlowchart 4 PenFlowchart PEN xdncl PEN [5] PEN xdncl DNCL 1 1 [6] 1 PEN Fig. 1 The PEN

A pp CALL College Life CD-ROM Development of CD-ROM English Teaching Materials, College Life Series, for Improving English Communica

12) NP 2 MCI MCI 1 START Simple Triage And Rapid Treatment 3) START MCI c 2010 Information Processing Society of Japan


6 2. AUTOSAR 2.1 AUTOSAR AUTOSAR ECU OSEK/VDX 3) OSEK/VDX OS AUTOSAR AUTOSAR ECU AUTOSAR 1 AUTOSAR BSW (Basic Software) (Runtime Environment) Applicat

COM COM 4) 5) COM COM 3 4) 5) COM COM 6) 7) 10) COM Bonanza 6) Bonanza Hearts COM 7) 10) Hearts 3 2,000 4,000

知能と情報, Vol.30, No.5, pp

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

IPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing

log F0 意識 しゃべり 葉の log F0 Fig. 1 1 An example of classification of substyles of rap. ' & 2. 4) m.o.v.e 5) motsu motsu (1) (2) (3) (4) (1) (2) mot

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

08医療情報学22_1_水流final.PDF

IPSJ SIG Technical Report Vol.2009-HCI-134 No /7/17 1. RDB Wiki Wiki RDB SQL Wiki Wiki RDB Wiki RDB Wiki A Wiki System Enhanced by Visibl

The 15th Game Programming Workshop 2010 Magic Bitboard Magic Bitboard Bitboard Magic Bitboard Bitboard Magic Bitboard Magic Bitboard Magic Bitbo

大学における原価計算教育の現状と課題

自然言語処理16_2_45

Vol.11-HCI-15 No. 11//1 Xangle 5 Xangle 7. 5 Ubi-WA Finger-Mount 9 Digitrack 11 1 Fig. 1 Pointing operations with our method Xangle Xa

IPSJ SIG Technical Report Vol.2010-GN-74 No /1/ , 3 Disaster Training Supporting System Based on Electronic Triage HIROAKI KOJIMA, 1 KU

21 Pitman-Yor Pitman- Yor [7] n -gram W w n-gram G Pitman-Yor P Y (d, θ, G 0 ) (1) G P Y (d, θ, G 0 ) (1) Pitman-Yor d, θ, G 0 d 0 d 1 θ Pitman-Yor G

< A796BD8AD991E58A77976C2D8CBE8CEA C B B835E2E706466>

IPSJ SIG Technical Report Vol.2009-DBS-149 No /11/ Bow-tie SCC Inter Keyword Navigation based on Degree-constrained Co-Occurrence Graph

【HP用】26.12月号indd.indd

26.2月号indd.indd

26.1月号indd.indd

評論・社会科学 84号(よこ)(P)/3.金子

IPSJ SIG Technical Report Vol.2014-CE-126 No /10/11 1,a) Kinect Support System for Romaji Learning through Exercise Abstract: Educatio

IPSJ SIG Technical Report Vol.2012-HCI-149 No /7/20 1 1,2 1 (HMD: Head Mounted Display) HMD HMD,,,, An Information Presentation Method for Weara


Vol. 42 No MUC-6 6) 90% 2) MUC-6 MET-1 7),8) 7 90% 1 MUC IREX-NE 9) 10),11) 1) MUCMET 12) IREX-NE 13) ARPA 1987 MUC 1992 TREC IREX-N

Mimehand II[1] [2] 1 Suzuki [3] [3] [4] (1) (2) 1 [5] (3) 50 (4) 指文字, 3% (25 個 ) 漢字手話 + 指文字, 10% (80 個 ) 漢字手話, 43% (357 個 ) 地名 漢字手話 + 指文字, 21

特集_03-08.Q3C

29 jjencode JavaScript

IPSJ SIG Technical Report An Evaluation Method for the Degree of Strain of an Action Scene Mao Kuroda, 1 Takeshi Takai 1 and Takashi Matsuyama 1

TCP/IP IEEE Bluetooth LAN TCP TCP BEC FEC M T M R M T 2. 2 [5] AODV [4]DSR [3] 1 MS 100m 5 /100m 2 MD 2 c 2009 Information Processing Society of

Web Basic Web SAS-2 Web SAS-2 i

IPSJ SIG Technical Report Vol.2014-HCI-158 No /5/22 1,a) 2 2 3,b) Development of visualization technique expressing rainfall changing conditions

( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c

<95DB8C9288E397C389C88A E696E6462>

外国語学部 紀要30号(横書)/03_菊地俊一

Studies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth

IPSJ SIG Technical Report Vol.2014-CE-127 No /12/7 1,a) 2,3 2,3 3 Development of the ethological recording application for the understanding of

IPSJ SIG Technical Report Vol.2015-CVIM-196 No /3/6 1,a) 1,b) 1,c) U,,,, The Camera Position Alignment on a Gimbal Head for Fixed Viewpoint Swi

Introduction to Information and Communication Technology (a)

MDD PBL ET 9) 2) ET ET 2.2 2), 1 2 5) MDD PBL PBL MDD MDD MDD 10) MDD Executable UML 11) Executable UML MDD Executable UML

DEIM Forum 2009 E

Microsoft Word - toyoshima-deim2011.doc

Core1 FabScalar VerilogHDL Cache Cache FabScalar 1 CoreConnect[2] Wishbone[3] AMBA[4] AMBA 1 AMBA ARM L2 AMBA2.0 AMBA2.0 FabScalar AHB APB AHB AMBA2.0

20 Method for Recognizing Expression Considering Fuzzy Based on Optical Flow

Vol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L

1 UD Fig. 1 Concept of UD tourist information system. 1 ()KDDI UD 7) ) UD c 2010 Information Processing S

大学論集第42号本文.indb

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

IPSJ SIG Technical Report Vol.2016-CE-137 No /12/ e β /α α β β / α A judgment method of difficulty of task for a learner using simple

TF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat

Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

untitled

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

97-00


( )

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2011-MBL-57 No.27 Vol.2011-UBI-29 No /3/ A Consideration of Features for Fatigue Es

paper.dvi

2 : Open Clip Art Library [4] Microsoft Office PowerPoint Web PowerPoint 2 Yahoo! Web [5] SlideShare Yahoo! Web Yahoo! Web

07九州工業大学.indd

IPSJ SIG Technical Report Secret Tap Secret Tap Secret Flick 1 An Examination of Icon-based User Authentication Method Using Flick Input for

CJL NEWS VOL JANUARY contents

IPSJ SIG Technical Report Vol.2014-IOT-27 No.14 Vol.2014-SPT-11 No /10/10 1,a) 2 zabbix Consideration of a system to support understanding of f

Transcription:

1 2 3 Incremental Linefeed Insertion into Lecture Transcription for Automatic Captioning Masaki Murata, 1 Tomohiro Ohno 2 and Shigeki Matsubara 3 The development of a captioning system that supports the real-time understanding of spoken documents such as lectures and commentaries is required. In monologues, since a sentence tends to be long, each sentence is often displayed in multi lines on the screen, it is necessary to insert linefeeds into a text so that the text becomes easy to read. This paper proposes a technique for incrementally inserting linefeeds into a Japanese spoken monologue as an elemental technique to generate the readable captions. Our method appropriately and incrementally inserts linefeeds into a sentence by machine learning, based on the information such as dependencies, clause boundaries, pauses and line length. An experiment using Japanese speech data has shown the effectiveness of our technique. 1. 1) 2) 4) 5) 6) 1 2 7) 1 Graduate School of Information Science, Nagoya University 2 Graduate School of International Development, Nagoya University 3 Information Technology Center, Nagoya University 1 c 200 Information Processing Society of Japan

1 7.35% 74.0% 2. 2 3 1 2 2 3 linefeed point 3. 1 1 5 4 2 c 200 Information Processing Society of Japan

1 戦争が 2 終わりまして 3 それから 今日までの : 文節 4 4 5 五十年間を 6 便宜的に 1 ~ 7 分けますと 8 私の : 図 5 における文節番号 4 1 5 5 ( 1 ) 考えでは 1 1 (a) ( 2 ) CBAP 8) (b) 3 2 3 ( 3 ) ) (c 1 2 ( 4 ) ) 1 (d 2 3 ( 5 ) ( 6 ) (e 1 3 (f) 1 2 2 3 1 2 1 2 3 (g) (l) (a) (f) 8 5. 3.1 1 4-i) 1 7 1 n B = b 1 b n R = r 1 r n r i b i r i r i = 1 = 0 m j L j = b j 1 bj n j (1 j m) 1 k < n j r j k = 0 k = n j r j k = 1 3.1.1 B P (R B) R P (R B) 1 3.1.2 3 c 200 Information Processing Society of Japan

(a) (d) (g) (j) 1 2 3 4 8 文節 4 8 4 8 1 2 3 1 2 3 7 4 5 6 8 改行挿入判定 改行挿入判定 直後の文節に係らない 改行挿入判定 改行挿入判定 直後の文節に係らない (b) (e) (h) (k) 4 8 4 8 1 2 3 1 2 3 7 4 5 6 8 節 節境界 改行改行改行改行改行ナシナシアリナシアリ 改行挿入判定 改行挿入判定 1 2 3 改行改行ナシアリ 改行挿入判定 改行挿入判定 1 2 3 7 4 5 6 8 (c) (f) (i) (l) 4 8 4 8 係り受け関係 1 2 3 1 2 3 7 4 5 6 8 改行挿入判定 改行挿入判定 改行挿入判定 改行挿入判定 今日今日までのまでの五十年間五十年間を便宜的に分けますと私の 5 4 c 200 Information Processing Society of Japan

P (R B) (1) =P (r 1 1 = 0,, r 1 n 1 1 = 0, r 1 n 1 = 1,, r m 1 = 0,, r m n m 1 = 0, r m n m = 1 B) =P (r 1 1 = 0 B) P (r 1 n 1 1 = 0 r 1 n 1 2 = 0,, r 1 1 = 0, B) P (r 1 n 1 = 1 r 1 n 1 1 = 0,, r 1 1 = 0, B) P (r m 1 = 0 r m 1 n m 1 = 1, B) P (r m n m 1 = 0 r m n m 2 = 0,, r m 1 = 0, r m 1 n m 1 = 1, B) P (r m n m = 1 r m n m 1 = 0,, r m 1 = 0, r m 1 n m 1 = 1, B) P (r j k = 1 rj k 1 = 0,, rj 1 = 0, rj 1 n j 1 = 1, B) 1 B j 1 b j k P (r j k = 0 rj k 1 = 0,, rj 1 = 0, rj 1 n j 1 = 1, B) b j k P (r m n m = 1 r m n m 1 = 0,, r m 1 = 0, r m 1 n m 1 = 1, B) = 1 1 3.1.2 P (r j k = 1 rj k 1 = 0,, rj 1 = 0, rj 1 n j 1 = 1, B) P (r j k = 0 rj k 1 = 0,, r j 1 = 0, rj 1 n j 1 = 1, B) 7) b j k b j k b j k b j k b j k b j k b j 1 bj k b j k bj k b j k bj k b j k 3 2 3 6 7 b j k 4 0.2 0.2 1.0 1.0 3.0 3.0 b j k - - - - - 4. 4.1 10) 6 16 1 15 16 2 14 20,707 5 c 200 Information Processing Society of Japan

6 11) 1,000 20 4.2 = = F = 2 + Julius 12) 4.3 1 F 7.35% (5,711/7,17) 74.0% (5,711/7,625) 77.06 81.21% (5,845/7,17) 7.47% (5,845/7,355) 80.33 100 0 80 ] % 70 [ 60 合 50 割積 40 累 30 20 10 0 本手法 0 2 4 6 8 10 12 14 16 18 20 22 24 7 遅延時間 [ 秒 ] 文単位の手法 1 7) 7 4 4 = / 1.5 7.14 6 c 200 Information Processing Society of Japan

2 F 8.24% (1,517/1,700) 100.00% (1,517/1,517) 4.31 76.30% (4,14/5,47) 68.66% (4,14/6,108) 72.28 3 4.4 (%) 83.2 (68/838) 8.81 (581/588).0 (10/110) 100.00 (31/31) 100.00 (12/12) F 2 183 1,700 3 3 61 65.5% 1,607 1,456 0.60% 3 2.88% 5 15% 151 2.72% 5. 7.35% 74.0% (B) (No. 21700157) 7 c 200 Information Processing Society of Japan

1) vol.1, no.12, pp.1024-102 (2008). 2) G. Boulianne, J.-F. Beaumont, M. Boisvert, J. Brousseau, P. Cardinal, C. Chapdelaine, M. Comeau, P. Ouellet and F. Osterrath: Computer-Assisted Closed- Captioning of Live TV Broadcasts in French, Proc. th ICSLP, no.mon2a2o-1, pp.273-276 (2006). 3) J. Xue, R. Hu and Y. Zhao: New Improvements in Decoding Speed and Latency for Automatic Captioning, Proc. th ICSLP, no.wed1cap-8, pp.1630-1633 (2006). 4) C. Munteanu, G. Penn and R. Baecker: Web-Based Language Modelling for Automatic Lecture Transcription, Proc. 8th Interspeech, no.thd.p3a-2, pp.2353-2356 (2007). 5) D vol.j0-d, no.3, pp.808-814 (2007). 6) vol.j84-d-ii, no.6, pp.888-87, 2001. 7) vol.nl-188, pp.37-44 (2008). 8) CBAP vol.11, no.3, pp.3-68 (2004). ) T. Ohno, S. Matsubara, H. Kashioka, T. Maruyama, H. Tanaka, Y. Inagaki: Dependency Parsing of Japanese Monologue Using Clause Boundaries, Language Resources and Evaluation, vol.40, no.3-4, pp.263-27 (2007). 10) S. Matsubara, A. Takagi, N. Kawaguchi and Y. Inagaki: Bilingual Spoken Monologue Corpus for Simultaneous Machine Interpretation Research, Proc. 3rd LREC, pp.153-15 (2002). 11) L. Zhang: Maximum entropy modeling toolkit for python and c++, http://homepages.inf.ed.ac.uk/ s0450736/maxent toolkit.html (2007) [Online; accessed 6-September-2007]. 12) Julius vol.20 no.1 pp.41 4 (2005) 13) T. Kudo and Y. Matsumoto: Japanese Dependency Analyisis using Cascaded Chunking, Proc. 6th CoNLL, pp.63-6 (2002). 8 c 200 Information Processing Society of Japan