2 HMM HTK[2] 3 left-to-right HMM triphone MLLR 1 CSJ 10 1 : 3 1: GID AM/CSJ-APS/hmmdefs.gz

Similar documents
ル札幌市公式ホームページガイドライン

CRA3689A


WinXPBook.indb


ん n わ wa ら ra や ya ま ma は ha な na た ta さ sa か ka あ a り ri み mi ひ hi に ni ち chi し shi き ki い i る ru ゆ yu む mu ふ fu ぬ nu つ tsu す su く ku う u れ re め me へ

i

図書館目録HP用.PDF


1 1 H Li Be Na M g B A l C S i N P O S F He N Cl A e K Ca S c T i V C Mn Fe Co Ni Cu Zn Ga Ge As Se B K Rb S Y Z Nb Mo Tc Ru Rh Pd Ag Cd In Sn Sb T e

天理大学付属天理図書館所蔵「松前ノ言」について (2)

(i) 1 (ii) ,, 第 5 回音声ドキュメント処理ワークショップ講演論文集 (2011 年 3 月 7 日 ) 1) 1 2) Lamel 2) Roy 3) 4) w 1 w 2 w n 2 2-g

日本語 IME の設定 (XP の場合 ) 2

03J_sources.key

36 th IChO : - 3 ( ) , G O O D L U C K final 1

B5A indd

DDX6016W DDX6016 DDX3016 My-Kenwood My-Kenwood JVC KENWOOD Corporation B5A (JN)

音響モデル triphone 入力音声 音声分析 デコーダ 言語モデル N-gram bigram HMM の状態確率として利用 出力層 triphone: 3003 ノード リスコア trigram 隠れ層 2048 ノード X7 層 1 Structure of recognition syst

*-ga, *-ti, *-ma *-ga *-ti *-ma 2003a 2003b *-ga *-ti *-ma *-ga *-ti *-ma *-ga -no *-Ga *-nga *-ga wen wen-no *-ga ʔ- myan- ʔ-myan lwê- t-lwê t- *-ti


Mae-E (Forward); SSAA ver.

Mae-E (Forward); TTBB ver.

Finale [Missa VIII]

意識_ベトナム.indd

untitled

施策の概要 就学ガイドブック [ヴェトナム語]

Microsoft Word - DAI THUA 100 PHAP _hoan chinh_.doc

1 I p2/30

2 [ 99] [Ramachandran 01] sound symbolism[hinton 95] [ 06] Ueda et al.[ueda 12] I [ 93] SVM [ 12, Aramaki 12] SVM 3 Twitter ,8


<8CA48B8689C8985F8F5791E631308D862E696E6464>

RN201602_cs5_0122.indd

RAA-05(201604)MRA対応製品ver6

元素分析

PowerPoint プレゼンテーション

< >

PROSTAGE[プロステージ]


Tài chính Quan hệ Cộng đồng Quản trị căn bản Pháp lý Các loại rủi ro Dự án Tình hình Tài chính Mục tiêu công ty Giá cả Môi trường - Tử vong - Thương t

1/120 別表第 1(6 8 及び10 関係 ) 放射性物質の種類が明らかで かつ 一種類である場合の放射線業務従事者の呼吸する空気中の放射性物質の濃度限度等 添付 第一欄第二欄第三欄第四欄第五欄第六欄 放射性物質の種類 吸入摂取した 経口摂取した 放射線業 周辺監視 周辺監視 場合の実効線 場合

Lis-net OPAC 15

Microsoft Word - 演習5_蒸発装置

2_R_新技術説明会(佐々木)

労災保険外国人向けパンフ第二編_ベトナム語

ことばの べんきょう ことば 1 ぶんぼうぐ えんぴつ けしごむ ふでばこ ほん のおと じょうぎ はさみ のり いろえんぴつ 2 もちもの かばん かぎ かさ ぼうし すいとう たいそうふく めがね はんかち てぶくろ 3 きょうしつのなか つくえ いす こくばん ちょおく こくばんけし とけい

_

H1-H4

2

genron-3

PTB TV 2018 ver 8

TRƯỜNG ĐẠI HỌC SƯ PHẠM TP. HỒ CHÍ MINH ĐÀO DUY TÙNG TỪ NGỮ HÁN VIỆT TRONG CA DAO NAM BỘ Chuyên ngành: NGÔN NGỮ HỌC Mã số: LUẬ

x, y x 3 y xy 3 x 2 y + xy 2 x 3 + y 3 = x 3 y xy 3 x 2 y + xy 2 x 3 + y 3 = 15 xy (x y) (x + y) xy (x y) (x y) ( x 2 + xy + y 2) = 15 (x y)

京都教育大学 外国人の子どもの教育を考える会

Mục lục Lời mở đầu 1 1 Ba loại tai nạn lao động thường xảy ra trong khi hàn 2 2 Những công việc nhiều tai nạn gây tử vong 2 3 Tai họa và các nghi vấn

일본어 IME 설정법

ISTC 3

bộ khẩu tức là cái miệng. Cái miệng con người phát ngôn ra lời nói nhanh như mũi tên và mũi tên ấy sẽ mang đến cho chúng ta cả điều lợi lẫn điều hại;

分科会(OHP_プログラム.PDF

2001 Mg-Zn-Y LPSO(Long Period Stacking Order) Mg,,,. LPSO ( ), Mg, Zn,Y. Mg Zn, Y fcc( ) L1 2. LPSO Mg,., Mg L1 2, Zn,Y,, Y.,, Zn, Y Mg. Zn,Y., 926, 1

untitled

希少金属資源 -新たな段階に入った資源問題-

1/68 A. 電気所 ( 発電所, 変電所, 配電塔 ) における変圧器の空き容量一覧 平成 31 年 3 月 6 日現在 < 留意事項 > (1) 空容量は目安であり 系統接続の前には 接続検討のお申込みによる詳細検討が必要となります その結果 空容量が変更となる場合があります (2) 特に記載

gengo.dvi


Vol. 43 No. 7 July 2002 ATR-MATRIX,,, ATR ITL ATR-MATRIX ATR-MATRIX 90% ATR-MATRIX Development and Evaluation of ATR-MATRIX Speech Translation System

AC Modeling and Control of AC Motors Seiji Kondo, Member 1. q q (1) PM (a) N d q Dept. of E&E, Nagaoka Unive


a) b) c) Speech Recognition of Short Time Utterance Based on Speaker Clustering Hiroshi SEKI a), Daisuke ENAMI, Faqiang ZHU, Kazumasa YAMAMOTO b), and


Texts for Kisetsu-no Shiori (Bookmarks of Four Seasons)

Cisco MCS-78XX ブート・エラー・コード

E82/E87/E88 BMW Brake Pad Matching List GLAD Model Model No. Year Front Rear etc 116i UF16 / UE F#148 R# i UF F#148 R#

D d d c b a c x n cε c sε c c σ c sσ c n a c a t sε t sσ t n a t cε t cσ t S n = 0 ( ) 2 bd + n a 2 cdc + atd xn = bd + n ( ac + at ) n = n 1 I M = E

prime number theorem

genron-7

EOS 5D MarkIII 使用説明書

のんべえにゅーす 1

橡紙目次第1章1

* *1 *2 *2 *4 *5 *3 *3 *3 *6 *7

untitled


paper.dvi

The characteristics of the sound duration of the reading sound of Kumamoto dialect speakers and Tokyo dialect speakers Rieko OHBA K

Microsoft Word - NhanGianDuKi-ISO-A5.doc

S I. dy fx x fx y fx + C 3 C vt dy fx 4 x, y dy yt gt + Ct + C dt v e kt xt v e kt + C k x v k + C C xt v k 3 r r + dr e kt S Sr πr dt d v } dt k e kt

381


a Xanti Esina

TX-NA5009

PowerPoint プレゼンテーション

*1 *2 *3 *4 *5 *6 *6*5 *6 *7 *8 *9 *10 *11 *11 *7 *7 *7 *6 *12 *13 *14 *15

KINH TỨ THẬP NHỊ CHƯƠNG HẬU TẦN TAM TẠNG PHÁP SƯ CƯU-MA-LA-THẬP PHỤNG CHIẾU DỊCH ĐOÀN TRUNG CÒN - NGUYỄN MINH TIẾN Việt dịch và chú giải NGUYỄN MINH H

AI n Z f n : Z Z f n (k) = nk ( k Z) f n n 1.9 R R f : R R f 1 1 {a R f(a) = 0 R = {0 R 1.10 R R f : R R f 1 : R R 1.11 Z Z id Z 1.12 Q Q id

2 A B A B A A B Ea 1 51 Ea 1 A B A B B A B B A Ea 2 A B Ea 1 ( )k 1 Ea 1 Ea 2 Arrhenius 53 Ea R T k 1 = χe 1 Ea RT k 2 = χe 2 Ea RT 53 A B A B

登録プログラムの名称 登録番号 初回登録日 最新交付日 登録された事業所の名称及び所在地 問い合わせ窓口 JCSS JCSS 年 12 月 1 日 2018 年 5 月 23 日公益社団法人日本アイソトープ協会川崎技術開発センター 神奈川県川崎市川崎区殿町三丁目

¥ì¥·¥Ô¤Î¸À¸ì½èÍý¤Î¸½¾õ

案内(最終2).indd

untitled

2 (2016 3Q N) c = o (11) Ax = b A x = c A n I n n n 2n (A I n ) (I n X) A A X A n A A A (1) (2) c 0 c (3) c A A i j n 1 ( 1) i+j A (i, j) A (i, j) ã i

PII S (96)

Transcription:

Ver.1.0 2004/3/23 : : 1 1 2 2 2.1..................................... 3 2.2..................................... 5 2.3........................... 5 2.4.............................. 7 2.5............................ 7 3 9 4 10 5 CSJ 10 11 1 CSJ 1 CSJ [1] CSJ 1 1

2 HMM HTK[2] 3 left-to-right HMM triphone 3000 16 2 MLLR 1 CSJ 10 1 : 3 1: 787 186 166 42 + 953 228 GID AM/CSJ-APS/hmmdefs.gz 721 124 822 134 + 1543 258 GID AM/CSJ-SPS/hmmdefs.gz 1508 310 + 988 176 + 2496 486 + GID AM/CSJ-APS,SPS/hmmdefs.gz 2 CSJ segment.pdf 2

2.1 16kHz 16bit 25msec 10msec MFCC 12 MFCC 12 Power 1 25 CMS 2 2: 16 khz 0.97 Hamming 25 ms 10 ms MFCC 12 + MFCC 12 + 25 24 CMS 1 2 N Power = log 2 s n n=1 (1) d t = Θθ=1 θ(c t+θ c t θ ) 2 Θ θ=1 θ 2 (2) Θ =2 3 HTK config file 3

SOURCEFORMAT=NOHEAD SOURCEKIND = WAVEFORM SOURCERATE = 625 TARGETKIND = MFCC E D Z TARGETRATE=100000.0 SAVECOMPRESSED=F SAVEWITHCRC=F WINDOWSIZE=250000.0 USEHAMMING=T PREEMCOEF=0.97 NUMCHANS=24 NUMCEPS=12 ZMEANSOURCE=T ENORMALISE=F ESCALE=1.0 TRACE=0 RAWENERGY=F 3: HTK config file 4

2.2 4 42 q sp silb sile 500 2.3 N a: o: 4: aiueoa:i:u:e:o: N w y j my ky by gy ny hy ry py ptktschbdgzmnsshhfr q sp silb sile 2.3 CSJ CSJ? W (W ; ) --> (?, ) --> 5 500 silb sile 500 20 500 sp sp sp 5

5: a i u e o ka ki ku ke ko ga gi gu ge go sa sh i su se so za ji zu ze zo ta ch i ts u te to da ji zu de do na ni nu ne no ha hi fu he ho ba bi bu be bo pa pi pu pe po ma mi mu me mo ra ri ru re ro wa o ya yu yo ky a ky u ky o gy a gy u gy o sh a sh u sh o ja ju jo ch a ch u ch o ny a ny u ny o hy a hy u hy o by a by u by o py a py u py o my a my u my o ry a ry u ry o ie sh e je ti tu ch e ts a ts i ts e ts o di du du nie he fa fi fe fo hy u bi me wi we wo ka ga sui ji teyu ba bi bu be bo N q : 6

2.4 silb sile sp IPA 3 6 6: a:-k+a a-k+a -a+ky *-a+k ky-a+* y-a+* 2.5 1 1 7 2 CSJ 3000 3 http://www.itakura.nuee.nagoya-u.ac.jp/ takeda/ipa/ 7

7: L Nasal R Nasal L Bilabial R Bilabial L DeltalAlveolar R DeltalAlveolar L PalatoAlveola R PalatoAlveola L Velar R Velar L Glottal R Glottal L YOUON L SOKUON R SOKUON L R R R L N R N L A R A L I R I L U R U L E R E L O R O N-, n-, m- +N, +n, +m p-, b-, f-, m-, w- +p, +b, +f, +m, +w t-, d-, ts-, z-, s-, n- +t, +d, +ts, +z, +s, +n ch-, j-, sh- +ch, +j, +sh k-, g- +k, +g h- +h y- q- +q r- +r N- +N a- +a i- +i u- +u e- +e o- +o 8

3 [3] 4 CSJ [4] 5 - - HTK [2] : LM/csj.htkdic 2 <sil> <sp> <sil> 1000msec <sp> 8: <sil> [<sil>] silb <sil> [<sil>] sile <sp> [<sp>] sp + [ ] t e N + / [ ] j u: d e: b i: + / [ ] ju:rokupi:pi:esu + / [ ] ju:rokupi:piesu + [ ] wanwe: + [ ] wane: + / [ ] i ch i i: a: r u b i: + / [ ] nijiqke: + / [ ] nijuqke: + [ ] ts u: e: + / [ ] n i: d e: k e: + / [ ] n i: d i: k e: CSJ CSJ 0.2 CSJ CSJ 2596 6.67M 4 3 25,300 27,249 4 pos.pdf 5 wdb.pdf 9

4 3 N-gram CMU-Cambridge SLM toolkit ver.2[5] 6 2-gram csj.2gram.gz 3- gram csj.3gram.gz back-off Witten-Bell N-gram <sil> <sp> CSJ 30 : 10 7 9 CSJ 2592 6.67M 25K 0.7M 2.6M 9: 2,592 6,671,844 1-gram 25,300 2-gram 731,728 3-gram 2,611,952 : <sil> <sp> 5 CSJ CSJ 10 3 1 10 test-set 1 5 5 10 test-set 2 5 5 10 test-set 3 10 [6][10] 3 2002 10 CSJ 6 http://mi.eng.cam.ac.uk/ prc14/toolkit.html 7 A01M0007, A01M0035, A01M0074, A02M0117, A03M0100, A05M0031, A06M0134, 3 [6][7][8][9] 10

30 CSJ test-set 2 A01M0056 ID S05M0613, R00M0187, D01M0019, D04M0056, D02M0028, D03M0017 10: CSJ test-set 1 10 10 A01M0097 A01M0110 A01M0137 A03M0106 A03M0112 A03M0156 A04M0051 A04M0121 A04M0123 A05M0011 test-set 2 10 5 5 A01M0056 A01M0141 A02M0012 A03M0016 A06M0064 A01F0001 A01F0034 A01F0063 A03F0072 A06F0135 test-set 3 10 5 5 S00M0008 S00M0070 S00M0079 S00M0112 S00M0213 S00F0019 S00F0066 S00F0148 S00F0152 S01F0105 [1] T.Kawahara, H.Nanjo, T.Shinozaki, and S.Furui. Benchmark Test for Speech Recognition using the Corpus of Spontaneous Japanese. In Proc. ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition, pp. 135 138, 2003. [2] P.C.Woodland, C.J.Leggetter, J.J.Odell, V.Valtchev, and S.J.Young. The 1994 HTK Large Vocabulary Speech Recognition System. In IEEE Int l Conf. on Acoustics, Speech & Signal Processing (ICASSP), Vol. 1, pp. 73 76, 1995. [3].., pp. 21 28, Feb. 2001. [4],.., pp. 33 38, Feb. 2002. [5] P.R.Clarkson and R.Rosenfeld. Statistical Language Modeling using the CMU- Cambridge Toolkit. In Proc. European Conf. Speech Communication & Technology (EUROSPEECH), pp. 2707 2710, 1997. [6],.., Vol. 43, No. 7, pp. 2098 2107, 2002. 11

[7] T.Shinozaki and S.Furui. Towards Automatic Transcription of Spontaneous Presentations. In Proc. European Conf. Speech Communication & Technology (EU- ROSPEECH), pp. 491 494, 2001. [8] H.Nanjo and T.Kawahara. Speaking-Rate Dependent Decoding and Adaptation for Spontaneous Lecture Speech Recognition. In IEEE Int l Conf. on Acoustics, Speech & Signal Processing (ICASSP), pp. 725 728, 2002. [9],,,.., Vol. J86-DII, No. 4, pp. 450 459, 2003. [10] T.Shinozaki and S.Furui. Analysis on Individual Differences in Automatic Transcription of Spontaneous Presentations. In IEEE Int l Conf. on Acoustics, Speech & Signal Processing (ICASSP), Vol. 1, pp. 729 732, 2002. 606-8501 4F kawahara@i.kyoto-u.ac.jp 12