VTLN Maximum Likelihood Liniear Regression; MLLR [3] x Ax + c MLLR A, c SI / [] [] SI Localized Affine Invarian Feaure; LAIF [] LAIF LAIF MFCC / Merin
|
|
- かねろう のじま
- 5 years ago
- Views:
Transcription
1 THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE {suzuki,qiao,mine,hirose}@gavou-okyoacjp (Localized Affine Invarian Feaures; LAIF LAIF LAIF LAIF LAIF LAIF MFCC MFCC MFCC+ MFCC+LAIF MFCC+ MFCC 37% Speech recogniion using localized affine invarian feaures Masayuki SUZUKI, Yu QIAO, Nobuaki MINEMATSU, and Keikichi HIROSE Grad School of Engineering, Univ of Tokyo 7 3, Hongo, Bunkyo-ku, Tokyo, 3 Japan Grad School of Info Sci and Tech, Univ of Tokyo 7 3, Hongo, Bunkyo-ku, Tokyo, 3 33 Japan {suzuki,qiao,mine,hirose}@gavou-okyoacjp Absrac This paper proposes localized affine invarian feaures (LAIFs for speaker-independen auomaic speech recogniion The LAIFs can be calculaed direcly from daa sequences As speaker variaions can be approximaed well by affine ransform in a cepsral space, he LAIFs can provide robus feaures wih respec o hose variaions This fac inspires us o expec ha he use of he LAIFs should improve he recogniion performance especially when no raining daa is available for speaker normalizaion or adapaion To verify his expecaion, we apply LAIFs for isolaed word recogniion The experimenal resuls show ha he combinaion of LAIFs wih MFCC or MFCC+ MFCC can lead o higher performances han MFCC or MFCC+ MFCC only Especially in mismached condiions, MFCC+ MFCC+LAIFs can reduce he error raes by 37% when compared o MFCC+ MFCC only Key words Acousic feaures, Speaker independen ASR, Affine ransform, Localized feaures, Speaker invariance Speaker Independen; SI Speaker Dependen; SD [] SI SD Vocal Trac Lengh Normalizaion; VTLN []
2 VTLN Maximum Likelihood Liniear Regression; MLLR [3] x Ax + c MLLR A, c SI / [] [] SI Localized Affine Invarian Feaure; LAIF [] LAIF LAIF MFCC / Merins Irino [7] [9] Warping Invarian Feaure; WIF LAIF WIF LAIF LAIF Invarian Srucure Represenaion; ISR [] [] ISR LAIF HMM LAIF ISR LAIF LAIF LAIF LAIF X = [x, x,, x T ] d x x x = Ax + c ( A d d c d X X = [x, x,, x T ] LAIF X X k :+k = [x k, x k +,, x,, x +k ] X k :+k F (X k :+k = F (X k :+k =,, T ( F F F (X k :+k k k (X k :+k k = k = k k τ= (X k:+k = τ (x +τ x τ k τ (3 τ= (X k:+k = (X k:+k LAIF F (X k :+k LAIF F (X k :+k = (µ b µ a T (Σ a + Σ b (µ b µ a (
3 µ Σ a [ k,, ] b [,, + k ] µ a Σ a µ a = x τ ( k τ= k Σ a = (x τ µ a (x τ µ a T ( k τ= k ML µ b Σ b LAIF µ a Σ a µ a Σ a µ a = Aµ a + c (7 Σ a = AΣ aa T ( F (X k :+k = (µ b µ a T (Σ a + Σ b (µ b µ a = (Aµ b Aµ a T (A(Σ a + Σ b A T (Aµ b Aµ a = (µ b µ a T A T (A T (Σ a + Σ b A A(µ b µ a = F (X k :+k (9 ( LAIF ( LAIF [] X i k :i+k W X i k :i+k W ( LAIF (7, (9 ( LAIF F (X k :+k ( = n F X n k :n+k = m F X m k :m+k X n k :n+k X m k :m+k LAIF LAIF LAIF [] x Ax + c A A [3], [] [] A [] x x (, x ( LAIF LAIF ( ( ( x ( A ( = A ( x ( x ( = A ( x ( + c ( ( x ( = A ( x ( + c ( ( x ( x ( + ( c ( c ( ( A A s s sream : (x (, x (,, x (s sream : (x (, x (3,, x (s+ sream d s + : (x (d s+, x (d s+,, x (d LAIF A s s s LAIF F (X k :+k d s + [F ( (X k :+k,, F (d s+ (X k :+k ] T 3
4 d cepsrum sequence T x ( x ( x (3 x (d muli sream parameerizaion sream s k +k sream s sream d s + s a x ( x (s b k k + (µ b µa T (Σa + Σb (µb µa LAIF F ( (X k:+k F ( (X k:+k F (d s+ (X k:+k Fig LAIF Calculaion of LAIFs wih muli sream parameerizaion LAIF s = LAIF A Ax + c ( LAIF f (X k :+k = (µ a µ b T (σa + σb (µ a µ b = µ a µ b σ a + σ b (3 s µ σ µ a µ b k = k = k k µ a µ b = w τ (x +τ x τ τ= ( w τ τ /k w τ w τ ( LAIF w τ = τ/( k τ= τ ( (3 (3 s = LAIF s LAIF LAIF [] LAIF - (Specro-Temporal Feaures; STF STF Muroi [] LAIF Muroi Ax + c A [] LAIF 3 LAIF LAIF k, k khz k = k + = k + k Hz Kanedera [7] s LAIF MFCC /aiueo/ sraigh [] 7 LAIF s = s = LAIF MFCC MFCC LAIF MFCC k Hidden-Markov- Toolki(HTK []
5 (a /aiueo/ (a /aiueo/ (b MFCC /aiueo/ (b MFCC/aiueo/ (c LAIF s= /aiueo/ (c LAIF s= /aiueo/ (d LAIF s= /aiueo/ (d LAIF s= /aiueo/ MFCC LAIF Fig Comparison beween oupus of mel filer bank, MFCCs and LAIFs LAIF s= MFCC 3 [9] 3 3 bi/khz bi/khz msec msec 97z DCT MFCC MFCC LAIF Table Acousic feaures used for he experimen Feaures(# of dimension MFCC( MFCC( + LAIF s= ( MFCC( + LAIF s= ( MFCC( + MFCC( MFCC( + MFCC( + LAIF s= ( MFCC( + MFCC( + LAIF s= ( 3 HMM LAIF HMM lef-o-righ HMM
6 Table Recogniion resul M,, and L denoe MFCC, dela cofficiens of MFCC, and LAIF, respecively s means block size for muli sream parameerizaion Mehod M M+L s= M+L s= M+ M+ +L s= M+ +L s= Mached condiion 93% 99% 9% 997% 99% 9939% Male raining Female esing 77% 3% 33% 79% 3% 97% Female raining Male esing 79% 3% 3% 3% 9% 97% Mached condiion MFCC MFCC+ MFCC LAIF MFCC LAIF s= % MFCC+ MFCC LAIF s= 37% s s = s = LAIF LAIF MFCC LAIF SI LAIF SI LAIF+MFCC+ MFCC 37% LAIF SI [] S Young eal, The HTK Book (for HTK Version 3 [] EEide and HGish, A parameric approach o vocal rac lengh normalizaion, Proc In Conf Acousics, Speech, and Signal Processing, vol, pp3 3, 99 [3] CJ Leggeer and PC Woodland, Mazimum likelihood speaker adapaion of coninuous densiy hidden Markov models, Compuer Speech and Language, Vol 9, pp 7, 99 [],,, volj7 D II, no, pp37 3, [] R Gomez, T Toda, H Saruwaari, K Shikano, Techniques in rapid unsupervised speaker adapaion based on HMMsufficien saisics, Speech Communicaion, vol, pp 7 9 [] Y Qiao, M Suzuki, N Minemasu, Affine invarian feaures and is applicaion o speech recogniion, Proc In Conf Acousics, Speech, and Signal Processing, 9 (submied [7] A Merins and J Rademacher, Frequency-warping invarian feaures for auomaic speech recogniion, Proc IEEE In Conf Acousics, Speech, and Signal Processing, vol, pp, [] J Rademacher, M Wacher and A Merins, Improved warping-invarian feaures for auomaic speech recogniion, Proc In Conf Acousics, Speech, and Signal Processing, pp 99 [9] T Irino and R D Paerson, Segregaing informaion abou he size and shape of he vocal rac using a imedomain audiory model: The sabilised wavele-mellin ransform, Speech Communicaion, vol, pp 3 [] N Minemasu, Mahmaical evidence of he acousic universal srucure in speech, Proc In Conf Acousics, Speech, and Signal Processing, pp 9 9 [] S Asakawa, N Minemasu, K Hirose, Muli-sream parameerizaion for srucural speech recogniion, Proc In Conf Acousics, Speech, and Signal Processing, pp 97, [],,,,,, SP 3, pp73 7, [3] M Piz and HNey, Vocal rac normalizaion equals linear ransformaion in cepsral space, IEEE Trans Speech and Audio Processing, vol3, pp93 9, [],,,, volj3 D II, no, pp 7, [] T Muroi, T Takiguchi, Y Ariki Speaker Independen Phoneme Recogniion Based on Fisher Weigh Map, Inernaional Journal of Hybrid Informaion Technology, Vol, No 3, 9 [],,,,,, SP7, pp9 9, 7 [7] N Kanedera, T Arai, H Hermansky, and M Pavel, On he relaive imporance of various componens of he modulaion specrum for auomaic speech recogniion, Speech Communicaion, vol, no, pp 3, 999 [] H Kawahara, STRAIGHT, Exploraion of he oher aspec of VOCODER: Percepually isomorphic decomposiion of speech sounds, Acousic Science and Technology, Vol 7, No [9] -,, vol, no, pp99 9, 99
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. E-mail: {ytamura,takai,tkato,tm}@vision.kuee.kyoto-u.ac.jp Abstract Current Wave Pattern Analysis for Anomaly
More informationIPSJ SIG Technical Report Vol.2010-CVIM-172 No /5/ Object Tracking Based on Generative Appearance Model 1. ( 1 ) ( 2 ) ( 3 ) 1 3) T
1 2 2 3 1 Objec Tracking Based on Generaive Appearance Model 1. ( 1 ) ( 2 ) ( 3 ) 1 3) Tasuya YONEKAWA, 1 Kazuhiko KAWAMOTO, 2 Asushi IMIYA 2 and Akihiro SUGIMOTO 3 We propose a mehod for racking objecs
More information(MIRU2008) HOG Histograms of Oriented Gradients (HOG)
(MIRU2008) 2008 7 HOG - - E-mail: katsu0920@me.cs.scitec.kobe-u.ac.jp, {takigu,ariki}@kobe-u.ac.jp Histograms of Oriented Gradients (HOG) HOG Shape Contexts HOG 5.5 Histograms of Oriented Gradients D Human
More information1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.,, 464 8601 470 0393 101 464 8601 E-mail: matsunagah@murase.m.is.nagoya-u.ac.jp, {ide,murase,hirayama}@is.nagoya-u.ac.jp,
More information研究報告用MS-Wordテンプレートファイル
HMM 3 HMM 1 () HMM (PARCOR ) One-model Speech Recogniion and Synhesis Based on Movemen HMMs Tsuneo Nia, Takumi Takei, Masashi Kimura, and Kouichi Kasurada Speech recogniion and synhesis have been designed
More informationIPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and
MIDI 1 2 3 2 1 Modeling Performance Indeterminacies for Polyphonic Midi Score Following and Its Application to Automatic Accompaniment Nakamura Eita 1 Yamamoto Ryuichi 2 Saito Yasuyuki 3 Sako Shinji 2
More information○松本委員
CIRJE-J-100 2003 11 CIRJE hp://www.e.u-okyo.ac.jp/cirje/research/03research02dp_j.hml Credi Risk Modeling Approaches 2003 11 17 Absrac This aricle originaes from a speech given by he auhor in he seminar
More information「霧」や「もや」などをクリアにする高速画像処理技術
Fas Single-Image Defogging 谭志明 白向晖 王炳融 東明浩 あらまし CPU GPU720 48050 fps Absrac Bad weaher condiions such as fog, haze, and dus ofen reduce he performance of oudoor cameras. In order o improve he visibiliy
More informationTF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat
1 1 2 1. TF-IDF TDF-IDF TDF-IDF. 3 18 6 Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Satoshi Date, 1 Teruaki Kitasuka, 1 Tsuyoshi Itokawa 2
More information11 22 33 12 23 1 2 3, 1 2, U2 3 U 1 U b 1 (o t ) b 2 (o t ) b 3 (o t ), 3 b (o t ) MULTI-SPEAKER SPEECH DATABASE Training Speech Analysis Mel-Cepstrum, logf0 /context1/ /context2/... Context Dependent
More information第 1 回バイオメトリクス研究会 ( 早稲田大学 ) THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS Proceedings of Biometrics Workshop,169
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS Proceedings of Biometrics Workshop,169-8555 3-4-1,169-8555 3-4-1 E-mail: s hayashi@kom.comm.waseda.ac.jp, ohki@suou.waseda.jp Wolf
More information経済論集 44‐1(よこ)/2.李
PC PC IT PC IT ! 1 The Archimedes Project 2 1992 TAS Total Access System 3 itaskintelligent Total Access System 4 Ho alauna 5 1 PC IT IT Archimedes at StanfordTASTotal Access System itaskintelligent Total
More information音響モデル triphone 入力音声 音声分析 デコーダ 言語モデル N-gram bigram HMM の状態確率として利用 出力層 triphone: 3003 ノード リスコア trigram 隠れ層 2048 ノード X7 層 1 Structure of recognition syst
1,a) 1 1 1 deep neural netowrk(dnn) (HMM) () GMM-HMM 2 3 (CSJ) 1. DNN [6]. GPGPU HMM DNN HMM () [7]. [8] [1][2][3] GMM-HMM Gaussian mixture HMM(GMM- HMM) MAP MLLR [4] [3] DNN 1 1 triphone bigram [5]. 2
More information1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2
CHLAC 1 2 3 3,. (CHLAC), 1).,.,, CHLAC,.,. Suspicious Behavior Detection based on CHLAC Method Hideaki Imanishi, 1 Toyohiro Hayashi, 2 Shuichi Enokida 3 and Toshiaki Ejima 3 We have proposed a method for
More informationTable 1. Reluctance equalization design. Fig. 2. Voltage vector of LSynRM. Fig. 4. Analytical model. Table 2. Specifications of analytical models. Fig
Mover Design and Performance Analysis of Linear Synchronous Reluctance Motor with Multi-flux Barrier Masayuki Sanada, Member, Mitsutoshi Asano, Student Member, Shigeo Morimoto, Member, Yoji Takeda, Member
More information2007/8 Vol. J90 D No. 8 AdaBoos Haar-like AdaBoos Viola Jones Haar-like [17] (1) Haar-like AdaBoos (2) Suppor Vecor Tracking SVT [1] SVT [6] Okuma [10
a) 3D People Tracking Using he Paricle Filer wih Cascaded Classifiers Yoshinori KOBAYASHI a),daisukesugimura,kousukehirasawa, Naohiko SUZUKI,HiroshiKAGE,YoichiSATO, and Akihiro SUGIMOTO Haar-like AdaBoos
More informationIPSJ SIG Technical Report 1, Instrument Separation in Reverberant Environments Using Crystal Microphone Arrays Nobutaka ITO, 1, 2 Yu KITANO, 1
1, 2 1 1 1 Instrument Separation in Reverberant Environments Using Crystal Microphone Arrays Nobutaka ITO, 1, 2 Yu KITANO, 1 Nobutaka ONO 1 and Shigeki SAGAYAMA 1 This paper deals with instrument separation
More information& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro
TV 1,2,a) 1 2 2015 1 26, 2015 5 21 Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Rotation Using Mobile Device Hiroyuki Kawakita 1,2,a) Toshio Nakagawa 1 Makoto Sato
More informationVol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information
Vol.54 No.7 1937 1950 (July 2013) 1,a) 2012 11 1, 2013 4 5 1 Similar Sounds Sentences Generator Based on Morphological Analysis Manner and Low Class Words Masaaki Kanakubo 1,a) Received: November 1, 2012,
More information第62巻 第1号 平成24年4月/石こうを用いた木材ペレット
Bulletin of Japan Association for Fire Science and Engineering Vol. 62. No. 1 (2012) Development of Two-Dimensional Simple Simulation Model and Evaluation of Discharge Ability for Water Discharge of Firefighting
More information1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf
1,a) 2,b) 4,c) 3,d) 4,e) Web A Review Supporting System for Whiteboard Logging Movies Based on Notes Timeline Taniguchi Yoshihide 1,a) Horiguchi Satoshi 2,b) Inoue Akifumi 4,c) Igaki Hiroshi 3,d) Hoshi
More informationKinecV2 2.2 Kinec Kinec [8] Kinec Kinec [9] KinecV1 3D [10] Kisikidis [11] Kinec Kinec Kinec 3 KinecV2 PC 1 KinecV2 Kinec PC Kinec KinecV2 PC KinecV2
Kinec Developmen of Moion Capure Sysem using Muliple Kinecs 1 1 1 Miyaake Jumpei 1 Ohubo Masakazu 1 Yoshida Kaori 1 1 1 Graduae School of Life Science and Sysems Enginnering, Kyushu Insiue of echnology
More information7) 8) 9),10) 11) 18) 11),16) 18) 19) 20) Vocaloid 6) Vocaloid 1 VocaListener1 2 VocaListener1 3 VocaListener VocaListener1 VocaListener1 Voca
VocaListener2: 1 1 VocaListener2 VocaListener VocaListener2 VocaListener2 VocaListener VocaListener2 VocaListener2: A Singing Synthesis System Mimicking Voice Timbre Changes in Addition to Pitch and Dynamics
More informationpaper.dvi
59 6 2003 pp. 1 11 1 43.72.Kb * 1 2 3 1. 2 2 1 1 1 [1] Person Recognition for News Videos through Multimodal Interaction, by Masakiyo Fujimoto, Yasuo Ariki and Shuji Doshita. 1 ATR 2 3 masakiyo.fujimoto@atr.jp
More informationi
24 i 1 1 1.1.................................. 1 1.2....................... 2 1.3........................... 5 2 7 2.1............................... 7 2.2............ 8 2.3.......................... 9
More informationDPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)
1 2 1 3 Experimental Evaluation of Convenient Strain Measurement Using a Magnet for Digital Public Art Junghyun Kim, 1 Makoto Iida, 2 Takeshi Naemura 1 and Hiroyuki Ota 3 We present a basic technology
More information21 Key Exchange method for portable terminal with direct input by user
21 Key Exchange method for portable terminal with direct input by user 1110251 2011 3 17 Diffie-Hellman,..,,,,.,, 2.,.,..,,.,, Diffie-Hellman, i Abstract Key Exchange method for portable terminal with
More informationFig. 2 Signal plane divided into cell of DWT Fig. 1 Schematic diagram for the monitoring system
Study of Health Monitoring of Vehicle Structure by Using Feature Extraction based on Discrete Wavelet Transform Akihisa TABATA *4, Yoshio AOKI, Kazutaka ANDO and Masataka KATO Department of Precision Machinery
More informationIPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing
DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing Youhei Namiki 1 and Yutaka Akiyama 1 Pyrosequencing, one of the DNA sequencing technologies, allows us to determine
More informationJFE.dvi
,, Department of Civil Engineering, Chuo University Kasuga 1-13-27, Bunkyo-ku, Tokyo 112 8551, JAPAN E-mail : atsu1005@kc.chuo-u.ac.jp E-mail : kawa@civil.chuo-u.ac.jp SATO KOGYO CO., LTD. 12-20, Nihonbashi-Honcho
More informationTHE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. UWB UWB
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. UWB -1 E-mail: seki@aso.cce.i.koto-u.ac.jp UWB SEABED SEABED SEABED,,, SEABED Application of fast imaging
More information3D UbiCode (Ubiquitous+Code) RFID ResBe (Remote entertainment space Behavior evaluation) 2 UbiCode Fig. 2 UbiCode 2. UbiCode 2. 1 UbiCode UbiCode 2. 2
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS HCG HUMAN COMMUNICATION GROUP SYMPOSIUM. UbiCode 243 0292 1030 E-mail: {ubicode,koide}@shirai.la, {otsuka,shirai}@ic.kanagawa-it.ac.jp
More informationH(ω) = ( G H (ω)g(ω) ) 1 G H (ω) (6) 2 H 11 (ω) H 1N (ω) H(ω)= (2) H M1 (ω) H MN (ω) [ X(ω)= X 1 (ω) X 2 (ω) X N (ω) ] T (3)
72 12 2016 pp. 777 782 777 * 43.60.Pt; 43.38.Md; 43.60.Sx 1. 1 2 [1 8] Flexible acoustic interface based on 3D sound reproduction. Yosuke Tatekura (Shizuoka University, Hamamatsu, 432 8561) 2. 2.1 3 M
More information知能と情報, Vol.30, No.5, pp
1, Adobe Illustrator Photoshop [1] [2] [3] Initital Values Assignment of Parameters Using Onomatopoieas for Interactive Design Tool Tsuyoshi NAKAMURA, Yuki SAWAMURA, Masayoshi KANOH, and Koji YAMADA Graduate
More informationVol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m
Vol.55 No.1 2 15 (Jan. 2014) 1,a) 2,3,b) 4,3,c) 3,d) 2013 3 18, 2013 10 9 saccess 1 1 saccess saccess Design and Implementation of an Online Tool for Database Education Hiroyuki Nagataki 1,a) Yoshiaki
More informationIPSJ SIG Technical Report Vol.2016-CE-137 No /12/ e β /α α β β / α A judgment method of difficulty of task for a learner using simple
1 2 3 4 5 e β /α α β β / α A judgment method of difficulty of task for a learner using simple electroencephalograph Katsuyuki Umezawa 1 Takashi Ishida 2 Tomohiko Saito 3 Makoto Nakazawa 4 Shigeichi Hirasawa
More informationOptical Lenses CCD Camera Laser Sheet Wind Turbine with med Diffuser Pitot Tube PC Fig.1 Experimental facility. Transparent Diffuser Double Pulsed Nd:
*1 *2 *3 PIV Measurement of Field of the Wind Turbine with a med Diffuser Kazuhiko TOSHIMITSU *4, Koutarou NISHIKAWA and Yuji OHYA *4 Department of Mechanical Engineering, Matsue National Collage of Technology,
More informationVol. 43 No. 7 July 2002 ATR-MATRIX,,, ATR ITL ATR-MATRIX ATR-MATRIX 90% ATR-MATRIX Development and Evaluation of ATR-MATRIX Speech Translation System
Vol. 43 No. 7 July 2002 ATR-MATRIX,,, ATR ITL ATR-MATRIX ATR-MATRIX 90% ATR-MATRIX Development and Evaluation of ATR-MATRIX Speech Translation System Fumiaki Sugaya,,, Toshiyuki Takezawa, Eiichiro Sumita,
More information1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing
1,a) 2,b) 3 Modeling of Agitation Method in Automatic Mahjong Table using Multi-Agent Simulation Hiroyasu Ide 1,a) Takashi Okuda 2,b) Abstract: Automatic mahjong table refers to mahjong table which automatically
More informationIPSJ SIG Technical Report Vol.2009-SLP-77 No /7/ GOP Improvement of Structure-based Automatic Estimation of Pronunciation Proficiency
GOP Improvement of Structure-based Automatic Estimation of Pronunciation Proficiency Masayuki Suzuki, Dean Luo, Nobuaki Minematsu and Keikichi Hirose Adequacy in controlling the vocal organs is often estimated
More informationNJ-XS10J This appliance is designed for use in Japan only and can not be used in any other country. No servicing is available outside of Japan. a a a a a ba a a a a a a a a 1 2 1 2 3 4 1 2 3 3 4 1
More informationMicrosoft Word - toyoshima-deim2011.doc
DEIM Forum 2011 E9-4 252-0882 5322 252-0882 5322 E-mail: t09651yt, sashiori, kiyoki @sfc.keio.ac.jp CBIR A Meaning Recognition System for Sign-Logo by Color-Shape-Based Similarity Computations for Images
More information本文6(599) (Page 601)
(MIRU2008) 2008 7 525 8577 1 1 1 E-mail: matsuzaki@i.ci.ritsumei.ac.jp, shimada@ci.ritsumei.ac.jp Object Recognition by Observing Grasping Scene from Image Sequence Hironori KASAHARA, Jun MATSUZAKI, Nobutaka
More informationIPSJ SIG Technical Report Vol.2011-EC-19 No /3/ ,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-
1 3 5 4 1 2 1,.,., Peg-Scope Viewer,,.,,,,. Utilization of Watching Logs for Support of Multi-View Video Contents Kosuke Niwa, 1 Shogo Tokai, 3 Tetsuya Kawamoto, 5 Toshiaki Fujii, 4 Marutani Takafumi,
More informationNP-DB NP-DB10 0570-011874 This appliance was designed for use in Japan only where the local voltage supply is AC100V and should not be used in other countries where the voltage and frequency
More informationDEIM Forum 2009 E
DEIM Forum 2009 E5-3 464-8601 1 606-8501 464 8601 1 E-mail: lifushi@arch.itc.nagoya-u.ac.jp, mayumi@mm.media.kyoto-u.ac.jp, {hirano,kajita,mase}@itc.nagoya-u.ac.jp Abstract Study on a Recipe Recommendation
More information23 Fig. 2: hwmodulev2 3. Reconfigurable HPC 3.1 hw/sw hw/sw hw/sw FPGA PC FPGA PC FPGA HPC FPGA FPGA hw/sw hw/sw hw- Module FPGA hwmodule hw/sw FPGA h
23 FPGA CUDA Performance Comparison of FPGA Array with CUDA on Poisson Equation (lijiang@sekine-lab.ei.tuat.ac.jp), (kazuki@sekine-lab.ei.tuat.ac.jp), (takahashi@sekine-lab.ei.tuat.ac.jp), (tamukoh@cc.tuat.ac.jp),
More informationBulletin of JSSAC(2014) Vol. 20, No. 2, pp (Received 2013/11/27 Revised 2014/3/27 Accepted 2014/5/26) It is known that some of number puzzles ca
Bulletin of JSSAC(2014) Vol. 20, No. 2, pp. 3-22 (Received 2013/11/27 Revised 2014/3/27 Accepted 2014/5/26) It is known that some of number puzzles can be solved by using Gröbner bases. In this paper,
More informationa) b) c) Speech Recognition of Short Time Utterance Based on Speaker Clustering Hiroshi SEKI a), Daisuke ENAMI, Faqiang ZHU, Kazumasa YAMAMOTO b), and
a) b) c) Speech Recognition of Short Time Utterance Based on Speaker Clustering Hiroshi SEKI a), Daisuke ENAMI, Faqiang ZHU, Kazumasa YAMAMOTO b), and Seiichi NAKAGAWA c) 0.5 DNN (Deep Neural Network)
More information2. CABAC CABAC CABAC 1 1 CABAC Figure 1 Overview of CABAC 2 DCT 2 0/ /1 CABAC [3] 3. 2 値化部 コンテキスト計算部 2 値算術符号化部 CABAC CABAC
H.264 CABAC 1 1 1 1 1 2, CABAC(Context-based Adaptive Binary Arithmetic Coding) H.264, CABAC, A Parallelization Technology of H.264 CABAC For Real Time Encoder of Moving Picture YUSUKE YATABE 1 HIRONORI
More informationNP-HV10 NP-HV18 NP-HV E D C 1 1 2 3 2 3 1 2 1 3 4 5 6 7 8 1 1 2 2 2 3 1 2 1 3 4 1 5 6 2 3 4 4 1 1 2 3 4 5 0570-011874 This appliance was designed for use in Japan only where the local voltage supply
More information10_08.dvi
476 67 10 2011 pp. 476 481 * 43.72.+q 1. MOS Mean Opinion Score ITU-T P.835 [1] [2] [3] Subjective and objective quality evaluation of noisereduced speech. Takeshi Yamada, Shoji Makino and Nobuhiko Kitawaki
More informationIPSJ SIG Technical Report Vol.2009-DPS-141 No.23 Vol.2009-GN-73 No.23 Vol.2009-EIP-46 No /11/27 t-room t-room 2 Development of
t-room 1 2 2 2 2 1 1 2 t-room 2 Development of Assistant System for Ensemble in t-room Yosuke Irie, 1 Shigemi Aoyagi, 2 Toshihiro Takada, 2 Keiji Hirata, 2 Katsuhiko Kaji, 2 Shigeru Katagiri 1 and Miho
More informationxx/xx Vol. Jxx A No. xx 1 Fig. 1 PAL(Panoramic Annular Lens) PAL(Panoramic Annular Lens) PAL (2) PAL PAL 2 PAL 3 2 PAL 1 PAL 3 PAL PAL 2. 1 PAL
PAL On the Precision of 3D Measurement by Stereo PAL Images Hiroyuki HASE,HirofumiKAWAI,FrankEKPAR, Masaaki YONEDA,andJien KATO PAL 3 PAL Panoramic Annular Lens 1985 Greguss PAL 1 PAL PAL 2 3 2 PAL DP
More informationIPSJ-SLP
F0 MFCC 1 2 3 1 1 1 1 MFCCF0 1 86.7% 90.2% A System for Automatic Discrimination between Singing and Speaking Voices on the Basis of Peak Interval of Spectral Change, F0, and MFCC Shimpei Aso, 1 Takeshi
More information本文.indd
35 1 2013 4 Reprinted From THE RESEARCH BULLETIN OF THE FACULTY OF EDUCATION AND WELFARE SCIENCE, OITA UNIVERSITY Vol. 35, No. 1April 2013 OITA, JAPAN Res. Bull. Fac. Educ.Welf. Sci., Oita Univ. 17 7 3
More informationa
IH NJ-KH10 NJ-KH18 This appliance is designed for use in Japan only and can not be used in any other country. No servicing is available outside of Japan. a 12 13 23 a 180150 170147 1011 10 21 124 1 0
More information08-特集04.indd
5 2 Journal of Multimedia Aided Education Research 2008, Vol. 5, No. 2, 3543 ICT ICT ICT 2 ICT ICT 1100 2008 ICT ICT 2007 ICT ICT ICT ICT IPtalk2008 2006 LAN TCP/IP 1 35 5 22008 1 Enter 1 IPtalk 2 2 2IPtalk
More informationTHE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE k
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. 565 0871 2 1 606 8501 606 8501 651 2103 3 1 E-mail: k-nakamura@comm.eng.osaka-u.ac.jp ARToolKit 1. 1 1 2.
More information258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS 2 3 4 5 2. 2.1 3 1) GPS Global Positioning System
Vol. 52 No. 1 257 268 (Jan. 2011) 1 2, 1 1 measurement. In this paper, a dynamic road map making system is proposed. The proposition system uses probe-cars which has an in-vehicle camera and a GPS receiver.
More informationTHE INSTITUTE OF ELECTRONICS, TECHNICAL REPORT OF IEICE. INFORMATION AND COMMUNICATION ENGINEERS
Title とメルケプストラムを用いた音響モデルに基づく騒音環境下叫び声検出の性能評価 Author(s) 福森, 隆寛 ; 中山, 雅人 ; 西浦, 敬信 ; 南條, 浩輝 Citation 電子情報通信学会技術研究報告 = IEICE technical re 信学技報 (217), 116(477): 283-286 Issue Date 217-3 URL http://hdl.handle.net/2433/228957
More informationTCP/IP IEEE Bluetooth LAN TCP TCP BEC FEC M T M R M T 2. 2 [5] AODV [4]DSR [3] 1 MS 100m 5 /100m 2 MD 2 c 2009 Information Processing Society of
IEEE802.11 [1]Bluetooth [2] 1 1 (1) [6] Ack (Ack) BEC FEC (BEC) BEC FEC 100 20 BEC FEC 6.19% 14.1% High Throughput and Highly Reliable Transmission in MANET Masaaki Kosugi 1 and Hiroaki Higaki 1 1. LAN
More informationn-jas09.dvi
Vol. 9 (2009 12 ), No. 03-091211 JASCOME CREEP ANALYSIS DISCONTINUOUS ROCK MASS AROUND UNDERGROUND CAVERN 1) 2) 3) Takakuni TATSUMI, Hidenori YOSHIDA and Masumi FUJIWARA 1) ( 761-0396 2217-20, E-mail:
More informationEQUIVALENT TRANSFORMATION TECHNIQUE FOR ISLANDING DETECTION METHODS OF SYNCHRONOUS GENERATOR -REACTIVE POWER PERTURBATION METHODS USING AVR OR SVC- Ju
EQUIVALENT TRANSFORMATION TECHNIQUE FOR ISLANDING DETECTION METHODS OF SYNCHRONOUS GENERATOR -REACTIVE POWER PERTURBATION METHODS USING AVR OR SVC- Jun Motohashi, Member, Takashi Ichinose, Member (Tokyo
More informationTHE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. 56 8531 1 3 E-mail: morisaka@ec.ee.es.osaka-u.ac.jp, {shiomi,okamura}@ee.es.osaka-u.ac.jp 2.665GHz 29 1.2,
More information2007/8 Vol. J90 D No. 8 Stauffer [7] 2 2 I 1 I 2 2 (I 1(x),I 2(x)) 2 [13] I 2 = CI 1 (C >0) (I 1,I 2) (I 1,I 2) Field Monitoring Server
a) Change Detection Using Joint Intensity Histogram Yasuyo KITA a) 2 (0 255) (I 1 (x),i 2 (x)) I 2 = CI 1 (C>0) (I 1,I 2 ) (I 1,I 2 ) 2 1. [1] 2 [2] [3] [5] [6] [8] Intelligent Systems Research Institute,
More information1_26.dvi
C3PV 1,a) 2,b) 2,c) 3,d) 1,e) 2012 4 20, 2012 10 10 C3PV C3PV C3PV 1 Java C3PV 45 38 84% Programming Process Visualization for Supporting Students in Programming Exercise Hiroshi Igaki 1,a) Shun Saito
More informationIPSJ-JNL
Vol. 52 No. 12 3853 3867 (Dec. 2011) VocaListener 1 1 VocaListener VocaListener 2 VocaListener: A Singing Synthesis System by Mimicking Pitch and Dynamics of User s Singing Tomoyasu Nakano 1 and Masataka
More informationP2P P2P peer peer P2P peer P2P peer P2P i
26 P2P Proposed a system for the purpose of idle resource utilization of the computer using the P2P 1150373 2015 2 27 P2P P2P peer peer P2P peer P2P peer P2P i Abstract Proposed a system for the purpose
More informationA Feasibility Study of Direct-Mapping-Type Parallel Processing Method to Solve Linear Equations in Load Flow Calculations Hiroaki Inayoshi, Non-member
A Feasibility Study of Direct-Mapping-Type Parallel Processing Method to Solve Linear Equations in Load Flow Calculations Hiroaki Inayoshi, Non-member (University of Tsukuba), Yasuharu Ohsawa, Member (Kobe
More information大学野球の期分けにおける一般的準備期のランニング トレーニングが試合期の大学生投手の実戦状況下 パフォーマンスに与える影響
The Effect of Pre-Season Running Training for Game Performance of University Baseball Pitcher AKAIKE, Kohei This paper provides useful information for university baseball players and coaches as well as
More informationSG79F095HO2
MSZ-J227-W MSZ-J257J287-W This appliance is designed for use in Japan only and can not be used in any other country. No servicing is available outside of Japan. 1 2 3 1 2 3 1 2 1 2 0120-56-8634
More informationGPGPU
GPGPU 2013 1008 2015 1 23 Abstract In recent years, with the advance of microscope technology, the alive cells have been able to observe. On the other hand, from the standpoint of image processing, the
More informationIPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe
1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Speech Visualization System Based on Augmented Reality Yuichiro Nagano 1 and Takashi Yoshino 2 As the spread of the Augmented Reality(AR) technology and service,
More information1. HNS [1] HNS HNS HNS [2] HNS [3] [4] [5] HNS 16ch SNR [6] 1 16ch 1 3 SNR [4] [5] 2. 2 HNS API HNS CS27-HNS [1] (SOA) [7] API Web 2
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. 657 8531 1 1 E-mail: {soda,matsubara}@ws.cs.kobe-u.ac.jp, {masa-n,shinsuke,shin,yosimoto}@cs.kobe-u.ac.jp,
More informationIPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp
1. 1 1 1 2 treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corpus Management Tool: ChaKi Yuji Matsumoto, 1 Masayuki Asahara, 1 Masakazu Iwatate 1 and Toshio Morita 2 This paper
More informationID 3) 9 4) 5) ID 2 ID 2 ID 2 Bluetooth ID 2 SRCid1 DSTid2 2 id1 id2 ID SRC DST SRC 2 2 ID 2 2 QR 6) 8) 6) QR QR QR QR
Vol. 51 No. 11 2081 2088 (Nov. 2010) 2 1 1 1 which appended specific characters to the information such as identification to avoid parity check errors, before QR Code encoding with the structured append
More information4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q
x-means 1 2 2 x-means, x-means k-means Bayesian Information Criterion BIC Watershed x-means Moving Object Extraction Using the Number of Clusters Determined by X-means Clustering Naoki Kubo, 1 Kousuke
More information20 Method for Recognizing Expression Considering Fuzzy Based on Optical Flow
20 Method for Recognizing Expression Considering Fuzzy Based on Optical Flow 1115084 2009 3 5 3.,,,.., HCI(Human Computer Interaction),.,,.,,.,.,,..,. i Abstract Method for Recognizing Expression Considering
More informationVol.53 No (Mar. 2012) 1, 1,a) 1, 2 1 1, , Musical Interaction System Based on Stage Metaphor Seiko Myojin 1, 1,a
1, 1,a) 1, 2 1 1, 3 2 1 2011 6 17, 2011 12 16 Musical Interaction System Based on Stage Metaphor Seiko Myojin 1, 1,a) Kazuki Kanamori 1, 2 Mie Nakatani 1 Hirokazu Kato 1, 3 Sanae H. Wake 2 Shogo Nishida
More informationVol. 43 No. 2 Feb. 2002,, MIDI A Probabilistic-model-based Quantization Method for Estimating the Position of Onset Time in a Score Masatoshi Hamanaka
Vol. 43 No. 2 Feb. 2002,, MIDI A Probabilistic-model-based Quantization Method for Estimating the Position of Onset Time in a Score Masatoshi Hamanaka, Masataka Goto,, Hideki Asoh and Nobuyuki Otsu, This
More informationTable 1 Type of polymeric coating materials Fig. 2 Results of suppressive effects of polymeric coating materials on the progress of neutralization of concrete. Table 2 Evaluation of the suppressive effects
More informationThe Japanese Journal of Psychology 2004, Vol. 75, No. 5, 397-406 Effects of fundamental frequency and speech rate on impression formation Teruhisa Uchida and Naoko Nakaune (Research Division, The National
More information3_23.dvi
Vol. 52 No. 3 1234 1244 (Mar. 2011) 1 1 mixi 1 Casual Scheduling Management and Shared System Using Avatar Takashi Yoshino 1 and Takayuki Yamano 1 Conventional scheduling management and shared systems
More information金融政策の波及経路と政策手段
Krugman(988) Woodford(999) (2000) (2000) 4 rae-of-reurn dominance 405 4 406 (i) a 2 cash good credi good b King and Wolman(999) (ii) 407 3 4 90 (iii) (iv) 408 λ κ (2.8) π x π λ = x κ Svensson 999 sric
More informationTHE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE {s-kasihr, wakamiya,
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. 565-0871 1 5 E-mail: {s-kasihr, wakamiya, murata}@ist.osaka-u.ac.jp PC 70% Design, implementation, and evaluation
More information取扱説明書
ER-LD530 STEP 1 STEP 2 STEP 3 STEP 4 STEP 5 1 5 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 1 2 3 22 23 1 2 24 25 26 27 1 2 3 28 29 30 31 32 33 34 1 2 3 35 1 2 3 36 37 1 2 3 4 38 39 1 2 3 4 40
More information. VOCA (Voce Output Communcaton Ads) [], [] [] [] GloveTalkII [] F/F [] [] [8] [9], [0] HMM ) ).. m a m n b n. Stylanou [] a bz=[a, b] (Gaussan Mxture
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. 8 00 E-mal: {kunkosh,qao,mne,hrose}@gavo.t.u-tokyo.ac.jp Nasal sound generaton and ptch control for the
More information7,, i
23 Research of the authentication method on the two dimensional code 1145111 2012 2 13 7,, i Abstract Research of the authentication method on the two dimensional code Karita Koichiro Recently, the two
More informationlog F0 意識 しゃべり 葉の log F0 Fig. 1 1 An example of classification of substyles of rap. ' & 2. 4) m.o.v.e 5) motsu motsu (1) (2) (3) (4) (1) (2) mot
1. 1 2 1 3 2 HMM Rap-style Singing Voice Synthesis Keijiro Saino, 1 Keiichiro Oura, 2 Makoto Tachibana, 1 Hieki Kenmochi 3 an Keiichi Tokua 2 This paper aresses rap-style singing voice synthesis. Since
More informationgengo.dvi
4 97.52% tri-gram 92.76% 98.49% : Japanese word segmentation by Adaboost using the decision list as the weak learner Hiroyuki Shinnou In this paper, we propose the new method of Japanese word segmentation
More informationTable 1. Assumed performance of a water electrol ysis plant. Fig. 1. Structure of a proposed power generation system utilizing waste heat from factori
Proposal and Characteristics Evaluation of a Power Generation System Utilizing Waste Heat from Factories for Load Leveling Pyong Sik Pak, Member, Takashi Arima, Non-member (Osaka University) In this paper,
More informationSocial Intelligence []... [] ( ) ( ) 一 般 の 情 報 他 人 の 情 報 人 コンテキスト 付 与 ソーシャル メディアの 普 及 により 受 け 手 は 自 分 の 認 識 を 発 信 機 械 コンテキスト 分 析 私 の 情 報 神 沼 靖 子, 内 木
THE 一 INSTITUTE 般 社 団 法 人 OF ELECTRONICS, 電 子 情 報 通 信 学 会 信 IEICE 学 技 Technical 報 Report INFORMATION THE INSTITUTE ANDOF COMMUNICATION ELECTRONICS, ENGINEERS IEICE Technical Report INFORMATION AND COMMUNICATION
More informationkut-paper-template.dvi
14 Application of Automatic Text Summarization for Question Answering System 1030260 2003 2 12 Prassie Posum Prassie Prassie i Abstract Application of Automatic Text Summarization for Question Answering
More information2.2 (a) = 1, M = 9, p i 1 = p i = p i+1 = 0 (b) = 1, M = 9, p i 1 = 0, p i = 1, p i+1 = 1 1: M 2 M 2 w i [j] w i [j] = 1 j= w i w i = (w i [ ],, w i [
RI-002 Encoding-oriented video generation algorithm based on control with high temporal resolution Yukihiro BANDOH, Seishi TAKAMURA, Atsushi SHIMIZU 1 1T / CMOS [1] 4K (4096 2160 /) 900 Hz 50Hz,60Hz 240Hz
More informationVol.8 No (July 2015) 2/ [3] stratification / *1 2 J-REIT *2 *1 *2 J-REIT % J-REIT J-REIT 6 J-REIT J-REIT 10 J-REIT *3 J-
Vol.8 No.2 1 9 (July 2015) 1,a) 2 3 2012 1 5 2012 3 24, 2013 12 12 2 1 2 A Factor Model for Measuring Market Risk in Real Estate Investment Hiroshi Ishijima 1,a) Akira Maeda 2 Tomohiko Taniyama 3 Received:
More information3_39.dvi
Vol. 49 No. 3 Mar. 2008 Web 1 2 PC Web Web Windows Web Access Watchdog Systems for Children Protection Tatsumi Ueda 1 and Yoshiaki Takai 2 For today s children, the Internet is one of the most familiar
More information