Vol. 43 No. 7 July 2002 ATR-MATRIX,,, ATR ITL ATR-MATRIX ATR-MATRIX 90% ATR-MATRIX Development and Evaluation of ATR-MATRIX Speech Translation System

Similar documents
Vol. 42 No. SIG 8(TOD 10) July HTML 100 Development of Authoring and Delivery System for Synchronized Contents and Experiment on High Spe

1 7.35% 74.0% linefeed point c 200 Information Processing Society of Japan

untitled

1 UD Fig. 1 Concept of UD tourist information system. 1 ()KDDI UD 7) ) UD c 2010 Information Processing S

Vol. 42 No MUC-6 6) 90% 2) MUC-6 MET-1 7),8) 7 90% 1 MUC IREX-NE 9) 10),11) 1) MUCMET 12) IREX-NE 13) ARPA 1987 MUC 1992 TREC IREX-N

Vol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

IPSJ SIG Technical Report Vol.2013-GN-86 No.35 Vol.2013-CDS-6 No /1/17 1,a) 2,b) (1) (2) (3) Development of Mobile Multilingual Medical

IPSJ SIG Technical Report Vol.2012-HCI-149 No /7/20 1 1,2 1 (HMD: Head Mounted Display) HMD HMD,,,, An Information Presentation Method for Weara

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

Table 1. Reluctance equalization design. Fig. 2. Voltage vector of LSynRM. Fig. 4. Analytical model. Table 2. Specifications of analytical models. Fig

IPSJ SIG Technical Report * Wi-Fi Survey of the Internet connectivity using geolocation of smartphones Yoshiaki Kitaguchi * Kenichi Nagami and Yutaka

10_08.dvi

Study on Throw Accuracy for Baseball Pitching Machine with Roller (Study of Seam of Ball and Roller) Shinobu SAKAI*5, Juhachi ODA, Kengo KAWATA and Yu

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

900 GPS GPS DGPS Differential GPS RTK-GPS Real Time Kinematic GPS 2) DGPS RTK-GPS GPS GPS Wi-Fi 3) RFID 4) M-CubITS 5) Wi-Fi PSP PlayStation Portable

A Study on Throw Simulation for Baseball Pitching Machine with Rollers and Its Optimization Shinobu SAKAI*5, Yuichiro KITAGAWA, Ryo KANAI and Juhachi

Studies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth

2). 3) 4) 1.2 NICTNICT DCRA Dihedral Corner Reflector micro-arraysdcra DCRA DCRA DCRA 3D DCRA PC USB PC PC ON / OFF Velleman K8055 K8055 K8055

2014/1 Vol. J97 D No. 1 2 [2] [3] 1 (a) paper (a) (b) (c) 1 Fig. 1 Issues in coordinating translation services. (b) feast feast feast (c) Kran

Vol. 48 No. 3 Mar PM PM PMBOK PM PM PM PM PM A Proposal and Its Demonstration of Developing System for Project Managers through University-Indus

3807 (3)(2) ,267 1 Fig. 1 Advertisement to the author of a blog. 3 (1) (2) (3) (2) (1) TV 2-0 Adsense (2) Web ) 6) 3

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

A pp CALL College Life CD-ROM Development of CD-ROM English Teaching Materials, College Life Series, for Improving English Communica

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

e-learning e e e e e-learning 2 Web e-leaning e 4 GP 4 e-learning e-learning e-learning e LMS LMS Internet Navigware

Vol. 43 No. 2 Feb. 2002,, MIDI A Probabilistic-model-based Quantization Method for Estimating the Position of Onset Time in a Score Masatoshi Hamanaka

soturon.dvi

音響モデル triphone 入力音声 音声分析 デコーダ 言語モデル N-gram bigram HMM の状態確率として利用 出力層 triphone: 3003 ノード リスコア trigram 隠れ層 2048 ノード X7 層 1 Structure of recognition syst

Grund.dvi

1

2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC

3D UbiCode (Ubiquitous+Code) RFID ResBe (Remote entertainment space Behavior evaluation) 2 UbiCode Fig. 2 UbiCode 2. UbiCode 2. 1 UbiCode UbiCode 2. 2

知能と情報, Vol.30, No.5, pp

2

塗装深み感の要因解析

.,,, [12].,, [13].,,.,, meal[10]., [11], SNS.,., [14].,,.,,.,,,.,,., Cami-log, , [15], A/D (Powerlab ; ), F- (F-150M, ), ( PC ).,, Chart5(ADIns

Vol.53 No (Mar. 2012) 1, 1,a) 1, 2 1 1, , Musical Interaction System Based on Stage Metaphor Seiko Myojin 1, 1,a

Web Stamps 96 KJ Stamps Web Vol 8, No 1, 2004

IPSJ SIG Technical Report Vol.2013-SLP-97 No /7/27 1 2,1 1 ( ) ( ) Phyno 1 (Phyno) PC Evaluation of Superiority of Robot Agent in Spoken Dialog

IPSJ SIG Technical Report Vol.2011-UBI-30 No /5/ , 1 1 Evaluation on Effect of Presenting False Information for Biological Information Vi

TF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat

GPGPU

fiúŒ{„ê…Z…fi…^†[…j…–†[…X

,,.,.,,.,.,.,.,,.,..,,,, i

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

IPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing

B HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int

Instability of Aerostatic Journal Bearings with Porous Floating Bush at High Speeds Masaaki MIYATAKE *4, Shigeka YOSHIMOTO, Tomoaki CHIBA and Akira CH

A Feasibility Study of Direct-Mapping-Type Parallel Processing Method to Solve Linear Equations in Load Flow Calculations Hiroaki Inayoshi, Non-member

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

(1 ) (2 ) Table 1. Details of each bar group sheared simultaneously (major shearing unit). 208

a) Extraction of Similarities and Differences in Human Behavior Using Singular Value Decomposition Kenichi MISHIMA, Sayaka KANATA, Hiroaki NAKANISHI a

IPSJ SIG Technical Report Vol.2010-SLDM-144 No.50 Vol.2010-EMB-16 No.50 Vol.2010-MBL-53 No.50 Vol.2010-UBI-25 No /3/27 Twitter IME Twitte

IPSJ SIG Technical Report Vol.2011-CE-110 No /7/9 Bebras 1, 6 1, 2 3 4, 6 5, 6 Bebras 2010 Bebras Reporting Trial of Bebras Contest for K12 stud

IPSJ SIG Technical Report Vol.2009-DPS-141 No.23 Vol.2009-GN-73 No.23 Vol.2009-EIP-46 No /11/27 t-room t-room 2 Development of

3_23.dvi

( ) fnirs ( ) An analysis of the brain activity during playing video games: comparing master with not master Shingo Hattahara, 1 Nobuto Fuji

MDD PBL ET 9) 2) ET ET 2.2 2), 1 2 5) MDD PBL PBL MDD MDD MDD 10) MDD Executable UML 11) Executable UML MDD Executable UML

3_39.dvi

Appropriate Disaster Preparedness Education in Classrooms According to Students Grade, from Kindergarten through High School Contrivance of an Educati

untitled

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c

23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h

Fig. 4. Configuration of fatigue test specimen. Table I. Mechanical property of test materials. Table II. Full scale fatigue test conditions and test

IPSJ SIG Technical Report An Evaluation Method for the Degree of Strain of an Action Scene Mao Kuroda, 1 Takeshi Takai 1 and Takashi Matsuyama 1

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came


DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

1 Table 1: Identification by color of voxel Voxel Mode of expression Nothing Other 1 Orange 2 Blue 3 Yellow 4 SSL Humanoid SSL-Vision 3 3 [, 21] 8 325

HP HP ELF 7 52

2007/8 Vol. J90 D No. 8 Stauffer [7] 2 2 I 1 I 2 2 (I 1(x),I 2(x)) 2 [13] I 2 = CI 1 (C >0) (I 1,I 2) (I 1,I 2) Field Monitoring Server

21 Effects of background stimuli by changing speed color matching color stimulus

独立行政法人情報通信研究機構 Development of the Information Analysis System WISDOM KIDAWARA Yutaka NICT Knowledge Clustered Group researched and developed the infor

Fig. 1. Schematic drawing of testing system. 71 ( 1 )

Fig. 2 Signal plane divided into cell of DWT Fig. 1 Schematic diagram for the monitoring system

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and

IPSJ SIG Technical Report Vol.2009-CVIM-167 No /6/10 Real AdaBoost HOG 1 1 1, 2 1 Real AdaBoost HOG HOG Real AdaBoost HOG A Method for Reducing

kiyo5_1-masuzawa.indd

( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst

9_18.dvi

2005ITRC-symposium-presen

IPSJ SIG Technical Report Vol.2014-HCI-158 No /5/22 1,a) 2 2 3,b) Development of visualization technique expressing rainfall changing conditions

149 (Newell [5]) Newell [5], [1], [1], [11] Li,Ryu, and Song [2], [11] Li,Ryu, and Song [2], [1] 1) 2) ( ) ( ) 3) T : 2 a : 3 a 1 :


1 Kinect for Windows M = [X Y Z] T M = [X Y Z ] T f (u,v) w 3.2 [11] [7] u = f X +u Z 0 δ u (X,Y,Z ) (5) v = f Y Z +v 0 δ v (X,Y,Z ) (6) w = Z +

XFEL/SPring-8

0801297,繊維学会ファイバ11月号/報文-01-青山

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

Microsoft Word JELS2009再再投稿丸島スタイル適用01_32-43a.doc

IPSJ SIG Technical Report Vol.2013-GN-87 No /3/ Research of a surround-sound field adjustmen system based on loudspeakers arrangement Ak

Vol.11-HCI-15 No. 11//1 Xangle 5 Xangle 7. 5 Ubi-WA Finger-Mount 9 Digitrack 11 1 Fig. 1 Pointing operations with our method Xangle Xa

HASC2012corpus HASC Challenge 2010,2011 HASC2011corpus( 116, 4898), HASC2012corpus( 136, 7668) HASC2012corpus HASC2012corpus

36 581/2 2012

ISAP- Integrated Structural Analysis Program for Piping Designs : Version IV ISAP- SAP- ISAP- 10 INPULS ( 3D-CAD ) ADAMS- GUI ( Graphical User Interfa

Transcription:

Vol. 43 No. 7 July 2002 ATR-MATRIX,,, ATR ITL ATR-MATRIX ATR-MATRIX 90% ATR-MATRIX Development and Evaluation of ATR-MATRIX Speech Translation System Fumiaki Sugaya,,, Toshiyuki Takezawa, Eiichiro Sumita, Yoshinori Sagisaka, and Seiichi Yamamoto ATR-MATRIX speech translation system was developed at ATR Interpreting Telecommunications Research Laboratories (ATR-ITL). In this paper we explain the system s outline and its development process including the initial objective, corpus collection and its overall evaluation. Each of three major components of the system: speech recognition, language translation, and speech synthesis, introduced an innovative corpus-based technology. In the paper, however the explanation is focused to major topics in the overall system, while rendering appropriate references to detail explanations of specific technology. We also explain some experimental results: additional sessions improve the performance of the same task. 1. 1993 ATR ATR Interpreting Telecommunications Research Laboratories ITL 1) ITL ATR-MATRIX 2) ATR ATR Spoken Language Translation Research Laboratories KDDI Presently with KDDI R&D Laboratories, Inc. Presently with Graduate School of Kobe University Presently with Graduate School of Waseda University 3) VERBMOBIL 21) 5) ATR-MATRIX TOEIC ITL ITL ATR-MATRIX 2 ITL 3 ATR-MATRIX 2230

Vol. 43 No. 7 ATR-MATRIX 2231 4 5 ATR-MATRIX 6 ATR-MATRIX 7 8 ITL 9 2. 2.1 ITL ATR 1 ITL Spontaneous speech 15),16) spontaneous speech read speech spontaneous speech 2.2 18) SLDB 17) 9) 12) 98 ATR-MATRIX ATR-MATRIX 3. ATR-MATRIX 3.1 1 ATR-MATRIX 1 19) ITL SPREC 7) TDMT 13) CHATR 20) 1 ATR-MATRIX Fig. 1 Configuration of ATR-MATRIX speech translation system.

2232 July 2002 3.2 Lisp 100 msec PC Pentium III 450 MHz 1 1 Table 1 Task/domain in data collection. 2 Table 2 Rule for conversation proceeding. 4. 4.1 1 4.2 2 4 1 1 10 2 4 4.3 2 1 1 4.4

Vol. 43 No. 7 ATR-MATRIX 2233 3 Table 3 Feature comparison between monolingual and bilingual DB. 2 3 2 1 5. ATR-MATRIX 5.1 HMM ML-SSS 6) N-gram 10) 5.2 ATR-MATRIX TDMT 12) TDMT X Y X X X Y Y to X (( ), ( )...), Y at X (( ),...),... X Y Y to X Y at X X Y 13) JE JK JG EJ 5 6 4 A B C 98% 85% TDMT 1

2234 July 2002 Table 4 4 Rank criteria for translation evaluation. 5 Table 5 Data size used for language translation subsystem. Fig. 2 2 Relationship between translation rate and pattern extraction rate. 6 Table 6 Evaluation results for several language pairs. 2 2 1/2 85.0% 95.3% 5.3 TDMT 1 SPREC N 19) 5.4 TDMT

Vol. 43 No. 7 ATR-MATRIX 2235 Table 7 7 System s specification and host performance. Fig. 3 3 Configuration for end-to-end dialogue experiment. 14) TDMT 8 Table 8 Performances of subsystems. 6. 6.1 3 3),4) 1 SPREC TDMT CHATR 7 barge-in ATR-MATRIX LAN TV 8 ATR-MATRIX SPREC 7) ATR-MATRIX TOEIC MAP-VFS 6.2 3 1 1 1 GUI 2 1 3 5 3 6.3 6.3.1 perplexity 4 perplexity 5 6

2236 July 2002 9 Table 9 Data size of dialog tests. 4 Perplexity Fig. 4 Perplexity along dialogues. 5 Fig. 5 Session time along dialogues. 6 Fig. 6 Word accuracy along dialogues. 1 3 Perplexity 18.3% 23.8% 18.0% 20% 8) 1 3 2 2 6.3.2 1 0 90% 6.3.3 9 ATR SLDB 17) 6.8 SLDB 10.3 7 SLDB 23 330 SLTA1 8 8 A A+B A+B+C 4 A A A+B A B A+B+C A B C 7 8 SLDB 10.3 6.8 3.5 7 8 3.5 2% 10% 7

Vol. 43 No. 7 ATR-MATRIX 2237 Table 10 10 Data size of dialogue tests without attention to machine. Fig. 7 7 Word accuracy vs. sentence length. Fig. 8 8 Translation rate vs. sentence length. 8 7. 6 ATR- MATRIX 22) ATR-MATRIX 7.1 6 PC 9 Fig. 9 Effects of speaking style. TV 2 SPREC 10 18.5 10.3 8 7.2 9 3 8) 11)

2238 July 2002 3 1 2 3 9 7 9 (1) (2) (3) (4) (5) (6) (7) 9 7.4% 1 8.2% 1.2% SPREC 82.5% 83.04% 82.46% 83% 8. 8.1 1 85% 98% 8.2 1 1 1 1 2 3 9. 9.1 5) TOEIC 700 550 150 13 TOEIC 575 ATR-MATRIX

Vol. 43 No. 7 ATR-MATRIX 2239 PC 1 3.8 88.1% 85% 9.2 ATR-MATRIX ATR-MATRIX ATR ATR 1) ASURA Vol.37, No.9, pp.1726 1735 (1996). 2) Takezawa, T., Morimoto, T., Sagisaka, Y., Campbell, N., Iida, H., Sugaya, F., Yokoo, A. and Yamamoto, S.: A Japanese-to-English speech translation system: ATR-MATRIX, Proc. ICSLP 1998, pp.2779 2782 (1998). 3) Sugaya, F., Takezawa, T., Yokoo, A. and Yamamoto, S.: End-to-end evaluation in ATR- MATRIX: Speech translation system between English and Japanese, Proc. Eurospeech99, pp.2431 2434 (1999). 4) ATR-MATRIX SP2000-21, pp.39 45 (June 2000). 5) D-II Vol.J84-D-II, No.11, pp.2362 2370 (2001). 6) Ostendorf, M. and Singer, H.: HMM topology design using maximum likelihood successive state splitting, Computer Speech and Language, Vol.11, No.1, pp.17 41 (1997). 7) ATR-MATRIX 1998 2-Q-20 (Mar. 1998). 8) D-II Vol.J84-D-II, No.1, pp.31 40 (2001). 9) 1999 pp.169 170 (1999). 10) N-gram D- II Vol.J81-D-II, No.9, pp.1929 1936 (1998). 11) N-gram D-II Vol.J83-D-II, No.11, pp.2146 2151 (2000). 12) Vol.6, No.5, pp.63 91 (1999). 13) Sumita, E., Yamada, S., Yamamoto, K., Paul, M., Kashioka, H., Ishikawa, K. and Shirai, S.: Solutions to Problems Inherent in Spokenlanguage Translation: The ATR- MATRIX Approach, Proc. MT Summit 99, pp.229 235 (Sep. 1999). 14)

2240 July 2002 Vol.5, No.4, pp.111 125 (1998). 15) SP2000-95, pp.1 5 (Dec. 2000). 16) 99 SLP-31-2 (2000). 17) Morimoto, T., Uratani, N., Takezawa, T., Furuse, O., Sobashima, Y., Iida, H., Nakamura, A., Sagisaka, Y., Higuchi, N. and Yamazaki, Y.: A speech and language database for speech translation research, Proc. ICSLP 94, pp.1791 1794 (1994). 18) Vol.83, No.8, pp.604 611 (2000). 19) Vol.6, No.2, pp.83 95 (1999). 20) Campbell, N.: CHATR: A high-definition speech re-sequencing systems, Proc. ASA/ASJ Joint Meeting, pp.1223 1228 (1996). 21) Wahlster, W.: verbmobil: foundations of speech-to-speech translation, Springer (2000). 22) pp.117 124 (Feb. 2001). ( 13 11 16 ) ( 14 4 16 ) 57 59 KDD 3 9 ATR 13 4 14 4 KDDI 59 62 ATR ATR 55 57 11 ATR ACL 48 50 NTT 61 ATR IEEE

Vol. 43 No. 7 ATR-MATRIX 2241 47 49 9 ATR ATR 56 3 5 IEEE