Vol. 43 No. 7 July 2002 ATR-MATRIX,,, ATR ITL ATR-MATRIX ATR-MATRIX 90% ATR-MATRIX Development and Evaluation of ATR-MATRIX Speech Translation System

Similar documents
1 UD Fig. 1 Concept of UD tourist information system. 1 ()KDDI UD 7) ) UD c 2010 Information Processing S

Vol. 42 No MUC-6 6) 90% 2) MUC-6 MET-1 7),8) 7 90% 1 MUC IREX-NE 9) 10),11) 1) MUCMET 12) IREX-NE 13) ARPA 1987 MUC 1992 TREC IREX-N

Vol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

Table 1. Reluctance equalization design. Fig. 2. Voltage vector of LSynRM. Fig. 4. Analytical model. Table 2. Specifications of analytical models. Fig

IPSJ SIG Technical Report * Wi-Fi Survey of the Internet connectivity using geolocation of smartphones Yoshiaki Kitaguchi * Kenichi Nagami and Yutaka

10_08.dvi

Study on Throw Accuracy for Baseball Pitching Machine with Roller (Study of Seam of Ball and Roller) Shinobu SAKAI*5, Juhachi ODA, Kengo KAWATA and Yu

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf

A Study on Throw Simulation for Baseball Pitching Machine with Rollers and Its Optimization Shinobu SAKAI*5, Yuichiro KITAGAWA, Ryo KANAI and Juhachi

Studies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth

2). 3) 4) 1.2 NICTNICT DCRA Dihedral Corner Reflector micro-arraysdcra DCRA DCRA DCRA 3D DCRA PC USB PC PC ON / OFF Velleman K8055 K8055 K8055

Vol. 48 No. 3 Mar PM PM PMBOK PM PM PM PM PM A Proposal and Its Demonstration of Developing System for Project Managers through University-Indus

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

A pp CALL College Life CD-ROM Development of CD-ROM English Teaching Materials, College Life Series, for Improving English Communica

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

e-learning e e e e e-learning 2 Web e-leaning e 4 GP 4 e-learning e-learning e-learning e LMS LMS Internet Navigware

soturon.dvi

音響モデル triphone 入力音声 音声分析 デコーダ 言語モデル N-gram bigram HMM の状態確率として利用 出力層 triphone: 3003 ノード リスコア trigram 隠れ層 2048 ノード X7 層 1 Structure of recognition syst

2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC

3D UbiCode (Ubiquitous+Code) RFID ResBe (Remote entertainment space Behavior evaluation) 2 UbiCode Fig. 2 UbiCode 2. UbiCode 2. 1 UbiCode UbiCode 2. 2

塗装深み感の要因解析

Web Stamps 96 KJ Stamps Web Vol 8, No 1, 2004

GPGPU

fiúŒ{„ê…Z…fi…^†[…j…–†[…X

,,.,.,,.,.,.,.,,.,..,,,, i

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

B HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int

A Feasibility Study of Direct-Mapping-Type Parallel Processing Method to Solve Linear Equations in Load Flow Calculations Hiroaki Inayoshi, Non-member

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

(1 ) (2 ) Table 1. Details of each bar group sheared simultaneously (major shearing unit). 208

IPSJ SIG Technical Report Vol.2011-CE-110 No /7/9 Bebras 1, 6 1, 2 3 4, 6 5, 6 Bebras 2010 Bebras Reporting Trial of Bebras Contest for K12 stud

3_23.dvi

MDD PBL ET 9) 2) ET ET 2.2 2), 1 2 5) MDD PBL PBL MDD MDD MDD 10) MDD Executable UML 11) Executable UML MDD Executable UML

Appropriate Disaster Preparedness Education in Classrooms According to Students Grade, from Kindergarten through High School Contrivance of an Educati

untitled

23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h

Fig. 4. Configuration of fatigue test specimen. Table I. Mechanical property of test materials. Table II. Full scale fatigue test conditions and test

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came


DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

HP HP ELF 7 52

2007/8 Vol. J90 D No. 8 Stauffer [7] 2 2 I 1 I 2 2 (I 1(x),I 2(x)) 2 [13] I 2 = CI 1 (C >0) (I 1,I 2) (I 1,I 2) Field Monitoring Server

21 Effects of background stimuli by changing speed color matching color stimulus

Fig. 2 Signal plane divided into cell of DWT Fig. 1 Schematic diagram for the monitoring system

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and

( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst

9_18.dvi

149 (Newell [5]) Newell [5], [1], [1], [11] Li,Ryu, and Song [2], [11] Li,Ryu, and Song [2], [1] 1) 2) ( ) ( ) 3) T : 2 a : 3 a 1 :


1 Kinect for Windows M = [X Y Z] T M = [X Y Z ] T f (u,v) w 3.2 [11] [7] u = f X +u Z 0 δ u (X,Y,Z ) (5) v = f Y Z +v 0 δ v (X,Y,Z ) (6) w = Z +

XFEL/SPring-8

0801297,繊維学会ファイバ11月号/報文-01-青山

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

HASC2012corpus HASC Challenge 2010,2011 HASC2011corpus( 116, 4898), HASC2012corpus( 136, 7668) HASC2012corpus HASC2012corpus

36 581/2 2012

ISAP- Integrated Structural Analysis Program for Piping Designs : Version IV ISAP- SAP- ISAP- 10 INPULS ( 3D-CAD ) ADAMS- GUI ( Graphical User Interfa

Transcription:

Vol. 43 No. 7 July 2002 ATR-MATRIX,,, ATR ITL ATR-MATRIX ATR-MATRIX 90% ATR-MATRIX Development and Evaluation of ATR-MATRIX Speech Translation System Fumiaki Sugaya,,, Toshiyuki Takezawa, Eiichiro Sumita, Yoshinori Sagisaka, and Seiichi Yamamoto ATR-MATRIX speech translation system was developed at ATR Interpreting Telecommunications Research Laboratories (ATR-ITL). In this paper we explain the system s outline and its development process including the initial objective, corpus collection and its overall evaluation. Each of three major components of the system: speech recognition, language translation, and speech synthesis, introduced an innovative corpus-based technology. In the paper, however the explanation is focused to major topics in the overall system, while rendering appropriate references to detail explanations of specific technology. We also explain some experimental results: additional sessions improve the performance of the same task. 1. 1993 ATR ATR Interpreting Telecommunications Research Laboratories ITL 1) ITL ATR-MATRIX 2) ATR ATR Spoken Language Translation Research Laboratories KDDI Presently with KDDI R&D Laboratories, Inc. Presently with Graduate School of Kobe University Presently with Graduate School of Waseda University 3) VERBMOBIL 21) 5) ATR-MATRIX TOEIC ITL ITL ATR-MATRIX 2 ITL 3 ATR-MATRIX 2230

Vol. 43 No. 7 ATR-MATRIX 2231 4 5 ATR-MATRIX 6 ATR-MATRIX 7 8 ITL 9 2. 2.1 ITL ATR 1 ITL Spontaneous speech 15),16) spontaneous speech read speech spontaneous speech 2.2 18) SLDB 17) 9) 12) 98 ATR-MATRIX ATR-MATRIX 3. ATR-MATRIX 3.1 1 ATR-MATRIX 1 19) ITL SPREC 7) TDMT 13) CHATR 20) 1 ATR-MATRIX Fig. 1 Configuration of ATR-MATRIX speech translation system.

2232 July 2002 3.2 Lisp 100 msec PC Pentium III 450 MHz 1 1 Table 1 Task/domain in data collection. 2 Table 2 Rule for conversation proceeding. 4. 4.1 1 4.2 2 4 1 1 10 2 4 4.3 2 1 1 4.4

Vol. 43 No. 7 ATR-MATRIX 2233 3 Table 3 Feature comparison between monolingual and bilingual DB. 2 3 2 1 5. ATR-MATRIX 5.1 HMM ML-SSS 6) N-gram 10) 5.2 ATR-MATRIX TDMT 12) TDMT X Y X X X Y Y to X (( ), ( )...), Y at X (( ),...),... X Y Y to X Y at X X Y 13) JE JK JG EJ 5 6 4 A B C 98% 85% TDMT 1

2234 July 2002 Table 4 4 Rank criteria for translation evaluation. 5 Table 5 Data size used for language translation subsystem. Fig. 2 2 Relationship between translation rate and pattern extraction rate. 6 Table 6 Evaluation results for several language pairs. 2 2 1/2 85.0% 95.3% 5.3 TDMT 1 SPREC N 19) 5.4 TDMT

Vol. 43 No. 7 ATR-MATRIX 2235 Table 7 7 System s specification and host performance. Fig. 3 3 Configuration for end-to-end dialogue experiment. 14) TDMT 8 Table 8 Performances of subsystems. 6. 6.1 3 3),4) 1 SPREC TDMT CHATR 7 barge-in ATR-MATRIX LAN TV 8 ATR-MATRIX SPREC 7) ATR-MATRIX TOEIC MAP-VFS 6.2 3 1 1 1 GUI 2 1 3 5 3 6.3 6.3.1 perplexity 4 perplexity 5 6

2236 July 2002 9 Table 9 Data size of dialog tests. 4 Perplexity Fig. 4 Perplexity along dialogues. 5 Fig. 5 Session time along dialogues. 6 Fig. 6 Word accuracy along dialogues. 1 3 Perplexity 18.3% 23.8% 18.0% 20% 8) 1 3 2 2 6.3.2 1 0 90% 6.3.3 9 ATR SLDB 17) 6.8 SLDB 10.3 7 SLDB 23 330 SLTA1 8 8 A A+B A+B+C 4 A A A+B A B A+B+C A B C 7 8 SLDB 10.3 6.8 3.5 7 8 3.5 2% 10% 7

Vol. 43 No. 7 ATR-MATRIX 2237 Table 10 10 Data size of dialogue tests without attention to machine. Fig. 7 7 Word accuracy vs. sentence length. Fig. 8 8 Translation rate vs. sentence length. 8 7. 6 ATR- MATRIX 22) ATR-MATRIX 7.1 6 PC 9 Fig. 9 Effects of speaking style. TV 2 SPREC 10 18.5 10.3 8 7.2 9 3 8) 11)

2238 July 2002 3 1 2 3 9 7 9 (1) (2) (3) (4) (5) (6) (7) 9 7.4% 1 8.2% 1.2% SPREC 82.5% 83.04% 82.46% 83% 8. 8.1 1 85% 98% 8.2 1 1 1 1 2 3 9. 9.1 5) TOEIC 700 550 150 13 TOEIC 575 ATR-MATRIX

Vol. 43 No. 7 ATR-MATRIX 2239 PC 1 3.8 88.1% 85% 9.2 ATR-MATRIX ATR-MATRIX ATR ATR 1) ASURA Vol.37, No.9, pp.1726 1735 (1996). 2) Takezawa, T., Morimoto, T., Sagisaka, Y., Campbell, N., Iida, H., Sugaya, F., Yokoo, A. and Yamamoto, S.: A Japanese-to-English speech translation system: ATR-MATRIX, Proc. ICSLP 1998, pp.2779 2782 (1998). 3) Sugaya, F., Takezawa, T., Yokoo, A. and Yamamoto, S.: End-to-end evaluation in ATR- MATRIX: Speech translation system between English and Japanese, Proc. Eurospeech99, pp.2431 2434 (1999). 4) ATR-MATRIX SP2000-21, pp.39 45 (June 2000). 5) D-II Vol.J84-D-II, No.11, pp.2362 2370 (2001). 6) Ostendorf, M. and Singer, H.: HMM topology design using maximum likelihood successive state splitting, Computer Speech and Language, Vol.11, No.1, pp.17 41 (1997). 7) ATR-MATRIX 1998 2-Q-20 (Mar. 1998). 8) D-II Vol.J84-D-II, No.1, pp.31 40 (2001). 9) 1999 pp.169 170 (1999). 10) N-gram D- II Vol.J81-D-II, No.9, pp.1929 1936 (1998). 11) N-gram D-II Vol.J83-D-II, No.11, pp.2146 2151 (2000). 12) Vol.6, No.5, pp.63 91 (1999). 13) Sumita, E., Yamada, S., Yamamoto, K., Paul, M., Kashioka, H., Ishikawa, K. and Shirai, S.: Solutions to Problems Inherent in Spokenlanguage Translation: The ATR- MATRIX Approach, Proc. MT Summit 99, pp.229 235 (Sep. 1999). 14)

2240 July 2002 Vol.5, No.4, pp.111 125 (1998). 15) SP2000-95, pp.1 5 (Dec. 2000). 16) 99 SLP-31-2 (2000). 17) Morimoto, T., Uratani, N., Takezawa, T., Furuse, O., Sobashima, Y., Iida, H., Nakamura, A., Sagisaka, Y., Higuchi, N. and Yamazaki, Y.: A speech and language database for speech translation research, Proc. ICSLP 94, pp.1791 1794 (1994). 18) Vol.83, No.8, pp.604 611 (2000). 19) Vol.6, No.2, pp.83 95 (1999). 20) Campbell, N.: CHATR: A high-definition speech re-sequencing systems, Proc. ASA/ASJ Joint Meeting, pp.1223 1228 (1996). 21) Wahlster, W.: verbmobil: foundations of speech-to-speech translation, Springer (2000). 22) pp.117 124 (Feb. 2001). ( 13 11 16 ) ( 14 4 16 ) 57 59 KDD 3 9 ATR 13 4 14 4 KDDI 59 62 ATR ATR 55 57 11 ATR ACL 48 50 NTT 61 ATR IEEE

Vol. 43 No. 7 ATR-MATRIX 2241 47 49 9 ATR ATR 56 3 5 IEEE