IPSJ SIG Technical Report Vol.2014-MUS-104 No /8/27 F0 1,a) 1,b) 1,c) 2,d) (F0) F0 F0 Graphical User Interface (GUI) F0 1. [1] CD MIDI [2] [3,

Size: px
Start display at page:

Download "IPSJ SIG Technical Report Vol.2014-MUS-104 No /8/27 F0 1,a) 1,b) 1,c) 2,d) (F0) F0 F0 Graphical User Interface (GUI) F0 1. [1] CD MIDI [2] [3,"

Transcription

1 F,a),b),c) 2,d) (F) F F Graphical User Interface (GUI) F. [] CD MIDI [2] [3, 4] [5] 2 a) ikemiya@kuis.kyoto-u.ac.jp b) itoyama@kuis.kyoto-u.ac.jp c) yoshii@kuis.kyoto-u.ac.jp d) okuno@aoni.waseda.jp TANDEM-STRAIGHT [6] (F) 3 [7] F F [8] [9] F F [] F c 24 Information Processing Society of Japan

2 Time [] F ( ) F GUI 2 F 2. F F ( 2) Q [2] F Q 2. Q x(n) Q [2] X(n, k) = n+ N k /2 N k { j=n N k /2 x(j)a k(j n + N k /2) () a k (n) = w(n/n k ) exp( i2πnf k /f s ) N k = Q fs f k, Q = (2 /fratio ) qrate k f k k [Hz] f s w(t) [, ] fratio qrate n [msec] Q t f X(t, f) 2.2 Robust PCA Robust PCA (RPCA) [3] 2 minimize L + λ S (subjectto L + S = M) (2) M L S L λ RPCA [4] [4] Q c 24 Information Processing Society of Japan 2

3 Q 4. F GUI F F 2.3 F F F 2.3. F F F F 3 GUI F F F c [cent] F L t (c) F c l c h [cent] F [cent] F (t) = arg max c l c c h L t (c) (3) L t (c) [5, 6] 4 X(t, f) M b (t, f) M h (t, f) X s (t, f) Original spectrum RPCA mask Harmonic mask Masked spectrum [cent] RPCA F Subharmonic Summation (SHS) [7] SHS L t (c) L t (c) = N λ n S t (c + 2 log 2 n) (4) n= S t (c) N λ 5, F F ( 4) [ Ht h w 2 < C(f) < Hh t + w 2 M h (t, f) = Ht h = F t + 2 log 2 h, h H otherwise (5) F t t F [cent] C(f) f [cent] H w [cent] RPCA M b (t, f) X s (t, f) X m (t, f) X s (t, f) = M b (t, f)m h (t, f)x(t, f), X m (t, f) = ( M b (t, f)m h (t, f))x(t, f) (6) 2.4 [Hz] n [cent] 2 log 2 n [cent] c 24 Information Processing Society of Japan 3

4 X s (t, f) E(t, f) X shift (t, f) Original spectrum (vocal) Estimated spectrum envelope Simply-shifted spectrum Corrected spectrum [cent] X s (t, f) X m (t, f) [6] ( 5) (DAP) [8] F F E(t, f) m E(t, f) X shift (t, f) = A t X s (t, f m) (7) E(t, f m) A t X new (t, f) = X m (t, f m) + X shift (t, f) (8) 2.5 Q Q [2] 2.4 Q [9] 3. [], F,, 3. F F F 4. F 4. 6kHz 6bit Q fratio.5 (2 bins per octave) qrate.2 [msec] RPCA k [4]. 2.3 w 2 [cent] 4.2 F 2.3. F F F c 24 Information Processing Society of Japan 4

5 [%] [cent] c: [cent] (a) F [msec] (b) F (c = 4 RWC-MDB-P-2: No.7) ±4 [cent] F (b) F F ±4 [cent] RWC Music Database: Popular Music (RWC-MDB-P-2) [2] 94 F ±c [cent] F c F 6 (a) 5 [cent] c = [cent] % c c = 4 [cent] 9 F 4 F ( 6 (b)) [cent] 6 [Hz] No correction With correction TANDEM-STRAIGHT [cent] TANDEM-STRAIGHT TANDEM-STRAIGHT [6] TANDEM-STRAIGHT [cent] TANDEM- STRAIGHT 2 [cent] TANDEM-STRAIGHT DAP 4.4 F 8 Q c 24 Information Processing Society of Japan 5

6 [cent] Original spectrogram Vocal expression Modified spectrogram [msec] 5. F F F GUI F GUI JSPS JST CREST OngaCREST [] Goto, M.: Active Music Listening Interfaces Based on Signal Processing, Proc. ICASSP (27). [2] Yoshii, K., Goto, M., Komatani, K., Ogata, T. and Okuno, H. G.: Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening, IPSJ Journal (27). [3] Itoyama, K., Goto, M., Komatani, K., Ogata, T. and Okuno, H. G.: Instrument Equalizer for Query-by- Example Retrieval: Improving Sound Source Separation based on Integrated Harmonic and Inharmonic Models, Proc. ISMIR (28). [4] Fritsch, J. and Plumbley, M. D.: Score Informed Audio Source Separation using Constrained Nonnegative Matrix Factorization and Score Synthesis, Proc. ICASSP (23). [5] Rafii, Z., Germain, F. G., Sun, D. L. and Mysore, G. J.: Combining Modeling of Singing Voice and Background Music for Automatic Separation of Musical Mixtures, Proc. ISMIR (23). [6] Kawahara, H., Morise, M., Takahashi, T., Nisimura, R., Irino, T. and Banno, H.: Tandem-STRAIGHT: A Temporally Stable Power Spectral Representation for Periodic Signals and Applications to Interference-free Spectrum, F, and Aperiodicity Estimation, Proc. ICASSP (28). [7] Ohishi, Y., Mochihashi, D., Kameoka, H. and Kashino, K.: Mixture of Gaussian Process Experts for Predicting Sung Melodic Contour with Expressive Dynamic Fluctuations, Proc. ICASSP (24). [8] (23). [9] Fujihara, H. and Goto, M.: Concurrent Estimation of Singing Voice F and Phonemes by Using Spectral Envelopes Estimated from Polyphonic Music, Proc. ICASSP, pp (2). [] Saito, T. and Goto, M.: Acoustic and Perceptual Effects of Vocal Training in Amateur Male Singing, Proc. INTERSPEECH (29). [] Ikemiya, Y., Itoyama, K. and Okuno, H. G.: Transcribing Vocal Expression from Polyphonic Music, Proc. ICASSP (24). [2] Schorkhuber, C. and Klapuri, A.: Constant-Q Transform Toolbox for Music Processing, SMC Conference (2). [3] Candes, E. J., Li, X., Ma, Y. and Wright, J.: Robust Principal Component Analysis?, J. ACM (2). [4] Huang, P.-S., Chen, S. D., Smaragdis, P. and Hasegawa- Johnson, M.: Singing-Voice Separation from Monaural Recordings Using Robust Principal Component Analysis, Proc. ICASSP (22). [5] Goto, M.: PreFEst: A Predominant-F Estimation Method for Polyphonic Musical Audio Signals, Proc. MIREX (25). [6] Saito, S., Kameoka, H., Takahashi, K., Nishimoto, T. and Sagayama, S.: Specmurt Analysis of Polyphonic Music Signals, IEEE Trans. on Audio, Speech, and Language Process (28). [7] Hermes, D. J.: Measurement of pitch by subharmonic summation, J. Acoust. Soc. Am., Vol. 83, No., pp (online), DOI:.2/ (988). [8] El-Jaroudi, A. and Makhoul, J.: Discrete All-Pole Modeling, IEEE Trans. on Signal Proc. (99). [9] Irino, T. and Kawahara, H.: Signal Reconstruction from Modified Auditory Wavelet Transform, IEEE Trans. on Signal Proc. (993). [2] Goto, M., Hashiguchi, H., Nishimura, T. and Oka, R.: RWC Music Database: Popular, Classical, and Jazz Music Databases, Proc. ISMIR, pp (22). c 24 Information Processing Society of Japan 6

情報処理学会インタラクション 2015 IPSJ Interaction INT /3/7 1,a) 1,b) 1,c) CD Robust PCA Subharmonic Summation MIREX2014 GUI GUI A Vocal Expression Ed

情報処理学会インタラクション 2015 IPSJ Interaction INT /3/7 1,a) 1,b) 1,c) CD Robust PCA Subharmonic Summation MIREX2014 GUI GUI A Vocal Expression Ed 情報処理学会インタラクション 215 IPSJ Interaction 215 15INT15 215/3/7 1,a) 1,b) 1,c) CD Robust PCA Subharmonic Summation MIREX214 GUI GUI A Vocal Expression Editing System based on Singing Voice Separation and F Estimation

More information

音楽音響信号の音源分離と能動的音楽鑑賞への応用 Sound source separation for music audio signals and its application to active music listening 援にとどまらず 一種の創作支援と見ることもできる 例えば ドラム

音楽音響信号の音源分離と能動的音楽鑑賞への応用 Sound source separation for music audio signals and its application to active music listening 援にとどまらず 一種の創作支援と見ることもできる 例えば ドラム 援にとどまらず 一種の創作支援と見ることもできる 例えば ドラムパートの音量や音色 パターンを MIDI ファイルを扱うかのごとく編集する [1] 楽器パートの音量バランスを個別に調整する [2] あるいは歌声と伴奏を分離する [3] といったことが可能である 音楽 CD や MP3 を再生するだけの受動的な音楽鑑賞体験を超 糸山克寿 (Katsutoshi ITOYAMA, Ph. D.) 京都大学大学院情報学研究科助教

More information

力 出力 ÝÒ 源分離 f å 2 š ž 伸縮率 f g å ² f œå 1 ( F0) audio-to-audio 3 2 RNMF [2] DTW audio-to-audio [3] [4] MIDI 2.2 [5 10] Dannenberg [5] Verc

力 出力 ÝÒ 源分離 f å 2 š ž 伸縮率 f g å ² f œå 1 ( F0) audio-to-audio 3 2 RNMF [2] DTW audio-to-audio [3] [4] MIDI 2.2 [5 10] Dannenberg [5] Verc 1,a) 1,b) 1,c) 1,d) 2,e) (MIDI ) audio-to-audio (RNMF) (DTW) DTW 1., (MIDI ) MIDI CD 2 1 1 MIDI CGM (Consumer Generated Music) Web Songrium [1] 2007 7 120 Web 1 2 / AIP a) wada@sap.ist.i.kyoto-u.ac.jp

More information

pp d 2 * Hz Hz 3 10 db Wind-induced noise, Noise reduction, Microphone array, Beamforming 1

pp d 2 * Hz Hz 3 10 db Wind-induced noise, Noise reduction, Microphone array, Beamforming 1 72 12 2016 pp. 739 748 739 43.60.+d 2 * 1 2 2 3 2 125 Hz 0.3 0.8 2 125 Hz 3 10 db Wind-induced noise, Noise reduction, Microphone array, Beamforming 1. 1.1 PSS [1] [2 4] 2 Wind-induced noise reduction

More information

sigmus201007_fujihara.dvi

sigmus201007_fujihara.dvi 1 1 1) W-PST W-PST W-PST W-PST Singing voice conversion method by using spectral envelope of singing voice estimated from polyphonic music Hiromasa Fujihara 1 and Masataka Goto 1 This paper describes a

More information

OngaCREST [10] A 3. Latent Dirichlet Allocation: LDA [11] Songle [12] Pitman-Yor (VPYLM) [13] [14,15] n n n 3.1 [16 18] PreFEst [19] F

OngaCREST [10] A 3. Latent Dirichlet Allocation: LDA [11] Songle [12] Pitman-Yor (VPYLM) [13] [14,15] n n n 3.1 [16 18] PreFEst [19] F 1,a) 2,b) 1,c) LPMCC MFCC Fluctuation Pattern (LDA) Songle Pitman-Yor (VPYLM) 3278 1. (MIR: Music Information Retrieval) [1 5] [6 8] 1 National Institute of Advanced Industrial Science and Technology (AIST)

More information

sigmusdemo.dvi

sigmusdemo.dvi V IT Demonstrations: Introduction of Research by Young Researchers V Masatoshi Hamanaka Akira Nishimura Hiroshi Takaesu Shigeyuki Hirai Katsutoshi Itoyama Akiyuki Yoshino Shohei Kajiwara Nozomi Kigimoto

More information

IPSJ-MUS

IPSJ-MUS Vol.29-MUS-81 No.2 29/7/29 1 2 1 ground-truth RWC 22 16 Method for Calculating the Subjective-based Music Similarity Measure Yusuke Hiraga, 1 Yasunori Ohishi 2 and Kazuya Takeda 1 In this paper, we propose

More information

IPSJ-SLP

IPSJ-SLP F0 MFCC 1 2 3 1 1 1 1 MFCCF0 1 86.7% 90.2% A System for Automatic Discrimination between Singing and Speaking Voices on the Basis of Peak Interval of Spectral Change, F0, and MFCC Shimpei Aso, 1 Takeshi

More information

IPSJ SIG Technical Report Vol.2017-MUS-115 No /6/17 1,a) 1 1 WORLD F0 Vocaloid F0 ipad 1. Vocaloid [1] UTAU *1 Vocaloid Vocaloid F0 VocaListene

IPSJ SIG Technical Report Vol.2017-MUS-115 No /6/17 1,a) 1 1 WORLD F0 Vocaloid F0 ipad 1. Vocaloid [1] UTAU *1 Vocaloid Vocaloid F0 VocaListene 1,a) 1 1 WORLD F0 Vocaloid F0 ipad 1. Vocaloid [1] UTAU *1 Vocaloid Vocaloid F0 VocaListener [2], [3] Vocaloid *2 VocaListener Vocaloid 1 University of Yamanashi a) g16tk018@yamanashi.ac.jp *1 http://utau2008.web.fc2.com/

More information

H(ω) = ( G H (ω)g(ω) ) 1 G H (ω) (6) 2 H 11 (ω) H 1N (ω) H(ω)= (2) H M1 (ω) H MN (ω) [ X(ω)= X 1 (ω) X 2 (ω) X N (ω) ] T (3)

H(ω) = ( G H (ω)g(ω) ) 1 G H (ω) (6) 2 H 11 (ω) H 1N (ω) H(ω)= (2) H M1 (ω) H MN (ω) [ X(ω)= X 1 (ω) X 2 (ω) X N (ω) ] T (3) 72 12 2016 pp. 777 782 777 * 43.60.Pt; 43.38.Md; 43.60.Sx 1. 1 2 [1 8] Flexible acoustic interface based on 3D sound reproduction. Yosuke Tatekura (Shizuoka University, Hamamatsu, 432 8561) 2. 2.1 3 M

More information

ホットスポット 1 音リアクションイベント BIC GMM 2 3 BIC GMM HMM 10) SVM 11) 12) 13) Bayesian Information Criterion BIC 14) BIC M = M 1, M 2,,

ホットスポット 1 音リアクションイベント BIC GMM 2 3 BIC GMM HMM 10) SVM 11) 12) 13) Bayesian Information Criterion BIC 14) BIC M = M 1, M 2,, 1 1 2 2 BIC GMM Acoustic Event Detection for Finding Hot Spots in Podcasts Kouhei Sumi, 1 Tatsuya Kawahara, 1 Jun Ogata 2 and Masataka Goto 2 This paper presents a method to detect acoustic events that

More information

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and MIDI 1 2 3 2 1 Modeling Performance Indeterminacies for Polyphonic Midi Score Following and Its Application to Automatic Accompaniment Nakamura Eita 1 Yamamoto Ryuichi 2 Saito Yasuyuki 3 Sako Shinji 2

More information

WISS 2018 [2 4] [5,6] Query-by-Dancing Query-by- Dancing Cao [1] OpenPose 2 Ghias [7] Query by humming Chen [8] Query by rhythm Jang [9] Query-by-tapp

WISS 2018 [2 4] [5,6] Query-by-Dancing Query-by- Dancing Cao [1] OpenPose 2 Ghias [7] Query by humming Chen [8] Query by rhythm Jang [9] Query-by-tapp Query-by-Dancing: WISS 2018. Query-by-Dancing Query-by-Dancing 1 OpenPose [1] Copyright is held by the author(s). DJ DJ DJ WISS 2018 [2 4] [5,6] Query-by-Dancing Query-by- Dancing Cao [1] OpenPose 2 Ghias

More information

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2011-MUS-89 No /2/12 NMF NMF NMF NMF NMF NMF Matrix Generation Using Probabilistic Spectrum Enve

情報処理学会研究報告 IPSJ SIG Technical Report Vol.2011-MUS-89 No /2/12 NMF NMF NMF NMF NMF NMF Matrix Generation Using Probabilistic Spectrum Enve NMF 1 2 2 NMF NMF NMF NMF NMF Matrix Generation Using Probabilistic Spectrum Envelope for Mixed Music Analysis Toru Nakashika, 1 Tetsuya Takiguchi 2 and Yasuo Ariki 2 NMF (Non-negative Matrix Factorization)

More information

IPSJ SIG Technical Report Vol.2012-MUS-94 No.27 Vol.2012-SLP-90 No /2/4 1 2 J K L 3 ( ) GUI Musical Audio Signal Modeling for Joint Estimation

IPSJ SIG Technical Report Vol.2012-MUS-94 No.27 Vol.2012-SLP-90 No /2/4 1 2 J K L 3 ( ) GUI Musical Audio Signal Modeling for Joint Estimation 2 J K L 3 GUI Musical Audio Signal Modeling or Joint Estiation o Haronic, Inharonic, and Tibral Structure and its Application to Source Sepatation NAOKI YASURAOKA and HIROSHI G. OKUNO 2 This paper presents

More information

IPSJ SIG Technical Report Vol.2017-MUS-116 No /8/24 MachineDancing: 1,a) 1,b) 3 MachineDancing MachineDancing MachineDancing 1 MachineDan

IPSJ SIG Technical Report Vol.2017-MUS-116 No /8/24 MachineDancing: 1,a) 1,b) 3 MachineDancing MachineDancing MachineDancing 1 MachineDan MachineDancing: 1,a) 1,b) 3 MachineDancing 2 1. 3 MachineDancing MachineDancing 1 MachineDancing MachineDancing [1] 1 305 0058 1-1-1 a) s.fukayama@aist.go.jp b) m.goto@aist.go.jp 1 MachineDancing 3 CG

More information

3 3) 6) 1) MPEG-7 2) MPEG-7 (A) (B) 2 9) Zils 10) (1) (2) 2.1 2

3 3) 6) 1) MPEG-7 2) MPEG-7 (A) (B) 2 9) Zils 10) (1) (2) 2.1 2 yoshii@kuis.kyoto-u.ac.jp m.goto@aist.go.jp okuno@i.kyoto-u.ac.jp 48% 82% Identification of Hihat Cymbals for Musical Audio Signals Using the Single Template Adaptation Method KAZUYOSHI YOSHII,MASATAKA

More information

2014 3

2014 3 1 3 113 : 1 Copyright c 1 by Kobayashi Keisuke Desktop Music (DTM) DAW (Digital Audio Workstation) YAMAHA Vocaloid DTM MIDI (Musical Instruments Digital Interface) Lee (Non-negative Matrix Factorization;

More information

7) 8) 9),10) 11) 18) 11),16) 18) 19) 20) Vocaloid 6) Vocaloid 1 VocaListener1 2 VocaListener1 3 VocaListener VocaListener1 VocaListener1 Voca

7) 8) 9),10) 11) 18) 11),16) 18) 19) 20) Vocaloid 6) Vocaloid 1 VocaListener1 2 VocaListener1 3 VocaListener VocaListener1 VocaListener1 Voca VocaListener2: 1 1 VocaListener2 VocaListener VocaListener2 VocaListener2 VocaListener VocaListener2 VocaListener2: A Singing Synthesis System Mimicking Voice Timbre Changes in Addition to Pitch and Dynamics

More information

動画コンテンツ 動画 1 動画 2 動画 3 生成中の映像 入力音楽 選択された素片 テンポによる伸縮 音楽的構造 A B B B B B A C C : 4) 6) Web Web 2 2 c 2009 Information Processing S

動画コンテンツ 動画 1 動画 2 動画 3 生成中の映像 入力音楽 選択された素片 テンポによる伸縮 音楽的構造 A B B B B B A C C : 4) 6) Web Web 2 2 c 2009 Information Processing S 1 2 2 1 Web An Automatic Music Video Creation System by Reusing Dance Video Content Sora Murofushi, 1 Tomoyasu Nakano, 2 Masataka Goto 2 and Shigeo Morishima 1 This paper presents a system that automatically

More information

2 DS SS (SS+DS) Fig. 2 Separation algorithm for motorcycle sound by combining DS and SS (SS+DS). 3. [3] DS SS 2 SS+DS 1 1 B SS SS 4. NMF 4. 1 (NMF) Y

2 DS SS (SS+DS) Fig. 2 Separation algorithm for motorcycle sound by combining DS and SS (SS+DS). 3. [3] DS SS 2 SS+DS 1 1 B SS SS 4. NMF 4. 1 (NMF) Y a) Separation of Motorcycle Sound by Near Field Microphone Array and Nonnegative Matrix Factorization Chisaki YOSHINAGA, Nonmember, Yosuke TATEKURA a), Member, Kazuaki HAMADA, and Tetsuya KIMURA, Nonmembers

More information

IPSJ SIG Technical Report Vol.2015-MUS-107 No /5/23 HARK-Binaural Raspberry Pi 2 1,a) ( ) HARK 2 HARK-Binaural A/D Raspberry Pi 2 1.

IPSJ SIG Technical Report Vol.2015-MUS-107 No /5/23 HARK-Binaural Raspberry Pi 2 1,a) ( ) HARK 2 HARK-Binaural A/D Raspberry Pi 2 1. HARK-Binaural Raspberry Pi 2 1,a) 1 1 1 2 3 () HARK 2 HARK-Binaural A/D Raspberry Pi 2 1. [1,2] [2 5] () HARK (Honda Research Institute Japan audition for robots with Kyoto University) *1 GUI ( 1) Python

More information

IPSJ SIG Technical Report 1, Instrument Separation in Reverberant Environments Using Crystal Microphone Arrays Nobutaka ITO, 1, 2 Yu KITANO, 1

IPSJ SIG Technical Report 1, Instrument Separation in Reverberant Environments Using Crystal Microphone Arrays Nobutaka ITO, 1, 2 Yu KITANO, 1 1, 2 1 1 1 Instrument Separation in Reverberant Environments Using Crystal Microphone Arrays Nobutaka ITO, 1, 2 Yu KITANO, 1 Nobutaka ONO 1 and Shigeki SAGAYAMA 1 This paper deals with instrument separation

More information

IPSJ SIG Technical Report Vol.2012-MUS-94 No.3 Vol.2012-SLP-90 No /2/ DTM 200 GUIN-Resonator: A system synthesizing voice with the styl

IPSJ SIG Technical Report Vol.2012-MUS-94 No.3 Vol.2012-SLP-90 No /2/ DTM 200 GUIN-Resonator: A system synthesizing voice with the styl 1 1 2 1 DTM 200 GUIN-Resonator: A system synthesizing voice with the style of Amami folk songs Daisuke Suguru, 1 Takashi Baba, 1 Masanori Morise 2 and Haruhiro Katayose 1 The recent spread of Karaoke and

More information

Duplicate Near Duplicate Intact Partial Copy Original Image Near Partial Copy Near Partial Copy with a background (a) (b) 2 1 [6] SIFT SIFT SIF

Duplicate Near Duplicate Intact Partial Copy Original Image Near Partial Copy Near Partial Copy with a background (a) (b) 2 1 [6] SIFT SIFT SIF Partial Copy Detection of Line Drawings from a Large-Scale Database Weihan Sun, Koichi Kise Graduate School of Engineering, Osaka Prefecture University E-mail: sunweihan@m.cs.osakafu-u.ac.jp, kise@cs.osakafu-u.ac.jp

More information

27 5) STRAIGHT ) STRAIGHT 8) 3 STRAIGHT ),6),2) 7) 7),9) 5) 2. 2. STRAIGHT 5),7) 2.. spline 2..2 6) ms 2..3 4) STRAIGHT (db) ERB N(Effective Rectangul

27 5) STRAIGHT ) STRAIGHT 8) 3 STRAIGHT ),6),2) 7) 7),9) 5) 2. 2. STRAIGHT 5),7) 2.. spline 2..2 6) ms 2..3 4) STRAIGHT (db) ERB N(Effective Rectangul 2 2 4 3 STRAIGHT 3 5 2 Perceptual study on design reuse of voice identity and singing style based on singing voice morphing HIDEKI KAWAHARA, TAICHI IKOMA, MASANORI MORISE, TORU TAKAHASHI, KEN ICHI TOYODA

More information

YANGsaf [] 3. πn, (n Z) Z [16 18] 3.1 Flanagan [19] A.1 TANDEM-STRAIGHT [1] 1/ [0] A. TANDEM-STRAIGHT [] 3. [3,6] F0 [14] F0 [10] [10] 3.3 [] Vol.017-

YANGsaf [] 3. πn, (n Z) Z [16 18] 3.1 Flanagan [19] A.1 TANDEM-STRAIGHT [1] 1/ [0] A. TANDEM-STRAIGHT [] 3. [3,6] F0 [14] F0 [10] [10] 3.3 [] Vol.017- Vol.017-MUS-114 No.6 017//7 1,a),b) 3,c) 4,d) YANGsaf [1] (1) () (3) YANGsaf [, 3] FFT bin STRAIGHT TANDEM Revisiting aperiodicity estimation based on instantaneous frequency and group delay Kawahara Hideki

More information

2_05.dvi

2_05.dvi 74 68 2 2012 pp. 74 85 43.60. c * 1, 2 1 2, 3 1 2 1 4 BM CSS CSS CSM BM CSM CSS CSS CSM Blind source separation, Sparseness, Binary mas, Musical noise, Cepstral smoothing, Separated speech signals 1. BSS

More information

impulse_response.dvi

impulse_response.dvi 5 Time Time Level Level Frequency Frequency Fig. 5.1: [1] 2004. [2] P. A. Nelson, S. J. Elliott, Active Noise Control, Academic Press, 1992. [3] M. R. Schroeder, Integrated-impulse method measuring sound

More information

IPSJ SIG Technical Report Vol.2019-MUS-123 No.23 Vol.2019-SLP-127 No /6/22 Bidirectional Gated Recurrent Units Singing Voice Synthesi

IPSJ SIG Technical Report Vol.2019-MUS-123 No.23 Vol.2019-SLP-127 No /6/22 Bidirectional Gated Recurrent Units Singing Voice Synthesi Bidirectional Gated Recurrent Units Singing Voice Synthesis Using Bidirectional Gated Recurrent Units. [] (HMM) [] [3], [4] Kobe University MEC Company Ltd. (Text to Speech: TTS) [5].. 3Hz Hz c 9 Information

More information

log F0 意識 しゃべり 葉の log F0 Fig. 1 1 An example of classification of substyles of rap. ' & 2. 4) m.o.v.e 5) motsu motsu (1) (2) (3) (4) (1) (2) mot

log F0 意識 しゃべり 葉の log F0 Fig. 1 1 An example of classification of substyles of rap. ' & 2. 4) m.o.v.e 5) motsu motsu (1) (2) (3) (4) (1) (2) mot 1. 1 2 1 3 2 HMM Rap-style Singing Voice Synthesis Keijiro Saino, 1 Keiichiro Oura, 2 Makoto Tachibana, 1 Hieki Kenmochi 3 an Keiichi Tokua 2 This paper aresses rap-style singing voice synthesis. Since

More information

Vol. 48 No. 3 Mar Evaluation of Music-noise Assimilation Playback for Portable Audio Players Akifumi Inoue, Shohei Bise, Satoshi Ichimura and

Vol. 48 No. 3 Mar Evaluation of Music-noise Assimilation Playback for Portable Audio Players Akifumi Inoue, Shohei Bise, Satoshi Ichimura and Vol. 48 No. 3 Mar. 2007 1 Evaluation of Music-noise Assimilation Playback for Portable Audio Players Akifumi Inoue, Shohei Bise, Satoshi Ichimura and Yutaka Matsushita Though the population of portable

More information

IPSJ-JNL

IPSJ-JNL Vol. 52 No. 12 3853 3867 (Dec. 2011) VocaListener 1 1 VocaListener VocaListener 2 VocaListener: A Singing Synthesis System by Mimicking Pitch and Dynamics of User s Singing Tomoyasu Nakano 1 and Masataka

More information

(1970) 17) V. Kucera: A Contribution to Matrix Ouadratic Equations, IEEE Trans. on Automatic Control, AC- 17-3, 344/347 (1972) 18) V. Kucera: On Nonnegative Definite Solutions to Matrix Ouadratic Equations,

More information

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c CodeDrummer: 1 2 3 1 CodeDrummer: Sonification Methods of Function Calls in Program Execution Kazuya Sato, 1 Shigeyuki Hirai, 2 Kazutaka Maruyama 3 and Minoru Terada 1 We propose a program sonification

More information

11 22 33 12 23 1 2 3, 1 2, U2 3 U 1 U b 1 (o t ) b 2 (o t ) b 3 (o t ), 3 b (o t ) MULTI-SPEAKER SPEECH DATABASE Training Speech Analysis Mel-Cepstrum, logf0 /context1/ /context2/... Context Dependent

More information

2013 M

2013 M 2013 M0110453 2013 : M0110453 20 1 1 1.1............................ 1 1.2.............................. 4 2 5 2.1................................. 6 2.2................................. 8 2.3.................................

More information

IPSJ SIG Technical Report Vol.2012-MUS-95 No /6/2 1,a) 2,b) 1,c) 1,d) TANDEM-STRAIGHT 70 Hz 20 db Manipulation of temporal fine structures on ex

IPSJ SIG Technical Report Vol.2012-MUS-95 No /6/2 1,a) 2,b) 1,c) 1,d) TANDEM-STRAIGHT 70 Hz 20 db Manipulation of temporal fine structures on ex 1,a) 2,b) 1,c) 1,d) TANDEM-STRAIGHT 70 Hz 20 db Manipulation of temporal fine structures on excitation source and spectral envelope of singing voices and their effects on perceived impression Kawahara

More information

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF   a m Vol.55 No.1 2 15 (Jan. 2014) 1,a) 2,3,b) 4,3,c) 3,d) 2013 3 18, 2013 10 9 saccess 1 1 saccess saccess Design and Implementation of an Online Tool for Database Education Hiroyuki Nagataki 1,a) Yoshiaki

More information

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website by the author(s) under the agreement with the IPSJ.

More information

PreFEst Predominant- F0 Estimation Method EM Expectation-Maximization [20] CD 10 2. D m(t) D b (t) t F0 F i(t) (i =m, b) A i(t) D m(t) ={F m(t),a m(t)

PreFEst Predominant- F0 Estimation Method EM Expectation-Maximization [20] CD 10 2. D m(t) D b (t) t F0 F i(t) (i =m, b) A i(t) D m(t) ={F m(t),a m(t) F0 Estimation of Melody and Bass Lines in Musical Audio Signals Masataka GOTO CD EM Expectation-Maximization CD EM 1. 1 [1] [5] [4], [5] 2 CD compact disc Electrotechnical Laboratory, Tukuba-shi, 305 8568

More information

IPSJ SIG Technical Report Vol.2015-MUS-106 No.25 Vol.2015-EC-35 No /3/3 1,a) 1,b) 1,c) 1,d),,, Improving voice attractiveness by speech paramet

IPSJ SIG Technical Report Vol.2015-MUS-106 No.25 Vol.2015-EC-35 No /3/3 1,a) 1,b) 1,c) 1,d),,, Improving voice attractiveness by speech paramet ,a),b),c),d),,, Improving voice attractiveness by speech parameter modification for interactive voice training applications Yoshimoto Shoki,a) Ryuichi Nisimura,b) Toshio Irino,c) Hideki Kawahara,d) Abstract:

More information

[2][3][4][5] 4 ( 1 ) ( 2 ) ( 3 ) ( 4 ) 2. Shiratori [2] Shiratori [3] [4] GP [5] [6] [7] [8][9] Kinect Choi [10] 3. 1 c 2016 Information Processing So

[2][3][4][5] 4 ( 1 ) ( 2 ) ( 3 ) ( 4 ) 2. Shiratori [2] Shiratori [3] [4] GP [5] [6] [7] [8][9] Kinect Choi [10] 3. 1 c 2016 Information Processing So 1,a) 2 2 1 2,b) 3,c) A choreographic authoring system reflecting a user s preference Ryo Kakitsuka 1,a) Kosetsu Tsukuda 2 Satoru Fukayama 2 Naoya Iwamoto 1 Masataka Goto 2,b) Shigeo Morishima 3,c) Abstract:

More information

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q

4. C i k = 2 k-means C 1 i, C 2 i 5. C i x i p [ f(θ i ; x) = (2π) p 2 Vi 1 2 exp (x µ ] i) t V 1 i (x µ i ) 2 BIC BIC = 2 log L( ˆθ i ; x i C i ) + q x-means 1 2 2 x-means, x-means k-means Bayesian Information Criterion BIC Watershed x-means Moving Object Extraction Using the Number of Clusters Determined by X-means Clustering Naoki Kubo, 1 Kousuke

More information

IPSJ SIG Technical Report Vol.2013-GN-87 No /3/ Research of a surround-sound field adjustmen system based on loudspeakers arrangement Ak

IPSJ SIG Technical Report Vol.2013-GN-87 No /3/ Research of a surround-sound field adjustmen system based on loudspeakers arrangement Ak 1 1 3 Research of a surround-sound field adjustmen system based on loudspeakers arrangement Akiyama Daichi 1 Kanai Hideaki 1 Abstract: In this paper, we propose a presentation method that does not depend

More information

paper.dvi

paper.dvi 59 6 2003 pp. 1 11 1 43.72.Kb * 1 2 3 1. 2 2 1 1 1 [1] Person Recognition for News Videos through Multimodal Interaction, by Masakiyo Fujimoto, Yasuo Ariki and Shuji Doshita. 1 ATR 2 3 masakiyo.fujimoto@atr.jp

More information

音楽とOR(片寄)

音楽とOR(片寄) Directability 1957 1. 1950 Meyer [1] 1970 Higgins [2] 1980 Deutsch [3] 1980 [4] Lerdahl Jackendoff Generative Theory of Tonal Music (GTTM)[5] Narmour Impication - Realization Model (IRM) [6] 1950 1957

More information

Hz

Hz ( ) 2006 1 3 3 3 4 10 Hz 1 1 1.1.................................... 1 1.2.................................... 1 2 2 2.1.................................... 2 2.2.................................... 3

More information

1911 F0 5) SingBySpeaking F0 F0 F0 4 F0 2. F0 4) 5) rate extent 6) rate 5.6 [Hz] extent 87 [cent] F0 5.2 [%] F0 SingBySpeaking 7) F0 Fig. 1 1 F0 F0 co

1911 F0 5) SingBySpeaking F0 F0 F0 4 F0 2. F0 4) 5) rate extent 6) rate 5.6 [Hz] extent 87 [cent] F0 5.2 [%] F0 SingBySpeaking 7) F0 Fig. 1 1 F0 F0 co Vol. 52 No. 5 1910 1922 (May 2011) This paper describes the details of singing database for analyzing the differences of musical expressions ( and portamento) among professional singers and the effective

More information

Vol. 43 No. 2 Feb. 2002,, MIDI A Probabilistic-model-based Quantization Method for Estimating the Position of Onset Time in a Score Masatoshi Hamanaka

Vol. 43 No. 2 Feb. 2002,, MIDI A Probabilistic-model-based Quantization Method for Estimating the Position of Onset Time in a Score Masatoshi Hamanaka Vol. 43 No. 2 Feb. 2002,, MIDI A Probabilistic-model-based Quantization Method for Estimating the Position of Onset Time in a Score Masatoshi Hamanaka, Masataka Goto,, Hideki Asoh and Nobuyuki Otsu, This

More information

(a) F 0 (b) F 0 図 1 歌声 F0 の時間的制約 ラリを作る仕組みを構築するなどの応用が考えられる. 実 験では, 実際に市販楽曲から歌い方要素を抽出できることを 確認する. 2. 問題設定 本稿で扱う問題をまとめると以下のようになる. 入力 : 伴奏付き歌唱 / 歌唱音高列 出力

(a) F 0 (b) F 0 図 1 歌声 F0 の時間的制約 ラリを作る仕組みを構築するなどの応用が考えられる. 実 験では, 実際に市販楽曲から歌い方要素を抽出できることを 確認する. 2. 問題設定 本稿で扱う問題をまとめると以下のようになる. 入力 : 伴奏付き歌唱 / 歌唱音高列 出力 伴奏付き歌唱に含まれる歌い方要素の個別抽出 池宮由楽 1,a) 糸山克寿 1,b) 奥乃博 1,c) 概要 : 本稿では, 伴奏付き歌唱に含まれるビブラートやこぶしといった歌い方要素を個別に抽出する手法について述べる. 歌い方要素は歌唱者の個人性を強く反映し, それらを個別に検出しパラメータ化することで,CGM や MIR への多様な応用が可能となる. 本手法では, ユーザが簡易に取得できる歌唱の音高列を事前知識として用いる.

More information

TADM-STRAIGHT [7], [8] 3 (1) (2) (3) [9] 0.9% [10] [11] 2. [12] [13] glottal formant [14], [15] 3 [16] [11] (dcgcfb) [10] X 284 ( ) P

TADM-STRAIGHT [7], [8] 3 (1) (2) (3) [9] 0.9% [10] [11] 2. [12] [13] glottal formant [14], [15] 3 [16] [11] (dcgcfb) [10] X 284 ( ) P Vol.2013-MUS-99 o.47 1,a) 1,b) 1,c) 1,d) 6 56 284 Voice tells your body information Kobayashi Mayuko 1,a) isimura Ryuichi 1,b) Irino Toshio 1,c) Kawahara Hideki 1,d) Abstract: When we hear a voice, we

More information

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St 1 2 1, 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical Structures based on Phrase Similarity Yuma Ito, 1 Yoshinari Takegawa, 2 Tsutomu Terada 1, 3 and Masahiko Tsukamoto

More information

Voice-to-MIDI A Method of Note Counting and Pitch Extraction by Using Melody Rhythm Taps for Voice-to-MIDI System Naoki ITOU and Kazushi NISHIMOTO MID

Voice-to-MIDI A Method of Note Counting and Pitch Extraction by Using Melody Rhythm Taps for Voice-to-MIDI System Naoki ITOU and Kazushi NISHIMOTO MID JAIST Reposi https://dspace.j Title Voice-to-MIDIのためのメロディリズムタップを 用 いた 音 数 音 高 の 判 定 手 法 の 提 案 Author(s) 伊 藤, 直 樹 ; 西 本, 一 志 Citation 電 子 情 報 通 信 学 会 論 文 誌 D, J96-D(4): 965-977 Issue Date 2013-04-01 Type

More information

1. HNS [1] HNS HNS HNS [2] HNS [3] [4] [5] HNS 16ch SNR [6] 1 16ch 1 3 SNR [4] [5] 2. 2 HNS API HNS CS27-HNS [1] (SOA) [7] API Web 2

1. HNS [1] HNS HNS HNS [2] HNS [3] [4] [5] HNS 16ch SNR [6] 1 16ch 1 3 SNR [4] [5] 2. 2 HNS API HNS CS27-HNS [1] (SOA) [7] API Web 2 THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. 657 8531 1 1 E-mail: {soda,matsubara}@ws.cs.kobe-u.ac.jp, {masa-n,shinsuke,shin,yosimoto}@cs.kobe-u.ac.jp,

More information

10_08.dvi

10_08.dvi 476 67 10 2011 pp. 476 481 * 43.72.+q 1. MOS Mean Opinion Score ITU-T P.835 [1] [2] [3] Subjective and objective quality evaluation of noisereduced speech. Takeshi Yamada, Shoji Makino and Nobuhiko Kitawaki

More information

untitled

untitled ,a,b (F0 (NMF (AR F0 (VB (MU. (Nonnegative Matrix Factorization: NMF [ 3] NMF [4,5] NMF (Multiplicative Update: MU NMF ( (F0 Umezono --, Tsuuba, Ibarai 305 8568, Japan a.yoshii(ataist.go.jp b m.goto(ataist.go.jp

More information

IPSJ SIG Technical Report Pitman-Yor 1 1 Pitman-Yor n-gram A proposal of the melody generation method using hierarchical pitman-yor language model Aki

IPSJ SIG Technical Report Pitman-Yor 1 1 Pitman-Yor n-gram A proposal of the melody generation method using hierarchical pitman-yor language model Aki Pitman-Yor Pitman-Yor n-gram A proposal of the melody generation method using hierarchical pitman-yor language model Akira Shirai and Tadahiro Taniguchi Although a lot of melody generation method has been

More information

5) 2. Geminoid HI-1 6) Telenoid 7) Geminoid HI-1 Geminoid HI-1 Telenoid Robot- PHONE 8) RobotPHONE 11 InterRobot 9) InterRobot InterRobot irt( ) 10) 4

5) 2. Geminoid HI-1 6) Telenoid 7) Geminoid HI-1 Geminoid HI-1 Telenoid Robot- PHONE 8) RobotPHONE 11 InterRobot 9) InterRobot InterRobot irt( ) 10) 4 Remote Hand Clapping Transmission Using Hand Clapping Machines on Live Video Streaming Masato Takahashi, Yuto Kumon,ShuheyTakeda and Masahiko Inami Abstract We propose a remote transmission system of hand

More information

Fig. 2 Signal plane divided into cell of DWT Fig. 1 Schematic diagram for the monitoring system

Fig. 2 Signal plane divided into cell of DWT Fig. 1 Schematic diagram for the monitoring system Study of Health Monitoring of Vehicle Structure by Using Feature Extraction based on Discrete Wavelet Transform Akihisa TABATA *4, Yoshio AOKI, Kazutaka ANDO and Masataka KATO Department of Precision Machinery

More information

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

(MIRU2008) HOG Histograms of Oriented Gradients (HOG) (MIRU2008) 2008 7 HOG - - E-mail: katsu0920@me.cs.scitec.kobe-u.ac.jp, {takigu,ariki}@kobe-u.ac.jp Histograms of Oriented Gradients (HOG) HOG Shape Contexts HOG 5.5 Histograms of Oriented Gradients D Human

More information

Gaze Head Eye (a) deg (b) 45 deg (c) 9 deg 1: - 1(b) - [5], [6] [7] Stahl [8], [9] Fang [1], [11] Itti [12] Itti [13] [7] Fang [1],

Gaze Head Eye (a) deg (b) 45 deg (c) 9 deg 1: - 1(b) - [5], [6] [7] Stahl [8], [9] Fang [1], [11] Itti [12] Itti [13] [7] Fang [1], 1 1 1 Structure from Motion - 1 Ville [1] NAC EMR-9 [2] 1 Osaka University [3], [4] 1 1(a) 1(c) 9 9 9 c 216 Information Processing Society of Japan 1 Gaze Head Eye (a) deg (b) 45 deg (c) 9 deg 1: - 1(b)

More information

149 (Newell [5]) Newell [5], [1], [1], [11] Li,Ryu, and Song [2], [11] Li,Ryu, and Song [2], [1] 1) 2) ( ) ( ) 3) T : 2 a : 3 a 1 :

149 (Newell [5]) Newell [5], [1], [1], [11] Li,Ryu, and Song [2], [11] Li,Ryu, and Song [2], [1] 1) 2) ( ) ( ) 3) T : 2 a : 3 a 1 : Transactions of the Operations Research Society of Japan Vol. 58, 215, pp. 148 165 c ( 215 1 2 ; 215 9 3 ) 1) 2) :,,,,, 1. [9] 3 12 Darroch,Newell, and Morris [1] Mcneil [3] Miller [4] Newell [5, 6], [1]

More information

2 3, 4, 5 6 2. [1] [2] [3]., [4], () [3], [5]. Mel Frequency Cepstral Coefficients (MFCC) [9] Logan [4] MFCC MFCC Flexer [10] Bogdanov2010 [3] [14],,,

2 3, 4, 5 6 2. [1] [2] [3]., [4], () [3], [5]. Mel Frequency Cepstral Coefficients (MFCC) [9] Logan [4] MFCC MFCC Flexer [10] Bogdanov2010 [3] [14],,, DEIM Forum 2016 E1-4 525-8577 1 1-1 E-mail: is0111rs@ed.ritsumei.ac.jp, oku@fc.ritsumei.ac.jp, kawagoe@is.ritsumei.ac.jp 373 1.,, itunes Store 1, Web,., 4,300., [1], [2] [3],,, [4], ( ) [3], [5].,,.,,,,

More information

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6) 1 2 1 3 Experimental Evaluation of Convenient Strain Measurement Using a Magnet for Digital Public Art Junghyun Kim, 1 Makoto Iida, 2 Takeshi Naemura 1 and Hiroyuki Ota 3 We present a basic technology

More information

3.1 Thalmic Lab Myo * Bluetooth PC Myo 8 RMS RMS t RMS(t) i (i = 1, 2,, 8) 8 SVM libsvm *2 ν-svm 1 Myo 2 8 RMS 3.2 Myo (Root

3.1 Thalmic Lab Myo * Bluetooth PC Myo 8 RMS RMS t RMS(t) i (i = 1, 2,, 8) 8 SVM libsvm *2 ν-svm 1 Myo 2 8 RMS 3.2 Myo (Root 1,a) 2 2 1. 1 College of Information Science, School of Informatics, University of Tsukuba 2 Faculty of Engineering, Information and Systems, University of Tsukuba a) oharada@iplab.cs.tsukuba.ac.jp 2.

More information

Songrium: 多様な関係性に基づく音楽視聴支援サービス

Songrium: 多様な関係性に基づく音楽視聴支援サービス Songrium: 1,a) 1,b) Web Songrium Songrium 1. [1] 1, 305-8568 1-1-1 National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Umezono, Tsukuba, Ibaraki 305-8568, Japan a) masahiro.hamasaki(at)aist.go.jp

More information

IPSJ SIG Technical Report 1,a) 1,b) 1,c) 1,d) 2,e) 2,f) 2,g) 1. [1] [2] 2 [3] Osaka Prefecture University 1 1, Gakuencho, Naka, Sakai,

IPSJ SIG Technical Report 1,a) 1,b) 1,c) 1,d) 2,e) 2,f) 2,g) 1. [1] [2] 2 [3] Osaka Prefecture University 1 1, Gakuencho, Naka, Sakai, 1,a) 1,b) 1,c) 1,d) 2,e) 2,f) 2,g) 1. [1] [2] 2 [3] 1 599 8531 1 1 Osaka Prefecture University 1 1, Gakuencho, Naka, Sakai, Osaka 599 8531, Japan 2 565 0871 Osaka University 1 1, Yamadaoka, Suita, Osaka

More information

IPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing

IPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing Youhei Namiki 1 and Yutaka Akiyama 1 Pyrosequencing, one of the DNA sequencing technologies, allows us to determine

More information

IPSJ SIG Technical Report Vol.2014-MUS-103 No /5/25 GUI 1,a) 1,b) 1,c) 1,d).., GUI.,FIR, 3 3.,.GUI. GUI,. A GUI for manipulating growl-like tas

IPSJ SIG Technical Report Vol.2014-MUS-103 No /5/25 GUI 1,a) 1,b) 1,c) 1,d).., GUI.,FIR, 3 3.,.GUI. GUI,. A GUI for manipulating growl-like tas GUI 1,a) 1,b) 1,c) 1,d).., GUI.,FIR, 3 3.,.GUI. GUI,. A GUI for manipulating growl-like taste in singing voice Mizobuchi Shohei 1,a) Nisimura Ryuichi 1,b) Irino Toshio 1,c) Kawahara Hideki 1,d) Abstract:

More information

NTT 465 図 1.,,..,, 1980,.,, [Hori 12]..,, [Kinoshita 09]. REVERB Challange, 30,, [Delcorix 14].,,.,,,,.,.., [ 13]. 2 4 会話シーンを捉える リアルタイム会話分析 2,. 360,,,

NTT 465 図 1.,,..,, 1980,.,, [Hori 12]..,, [Kinoshita 09]. REVERB Challange, 30,, [Delcorix 14].,,.,,,,.,.., [ 13]. 2 4 会話シーンを捉える リアルタイム会話分析 2,. 360,,, 464 29 5 2014 9 企業における AI 研究の最前線 コミュニケーション科学と人工知能研究 NTT コミュニケーション科学基礎研究所の取組み Communication Science and Artificial Intelligence Research Activities at NTT Communication Science Laboratories 柏野邦夫 Kunio Kashino

More information

AUTOMATIC MEASUREMENTS OF STREAM FLOW USING FLUVIAL ACOUSTIC TOMOGRAPHY SYSTEM Kiyosi KAWANISI, Arata, KANEKO Noriaki GOHDA and Shinya

AUTOMATIC MEASUREMENTS OF STREAM FLOW USING FLUVIAL ACOUSTIC TOMOGRAPHY SYSTEM Kiyosi KAWANISI, Arata, KANEKO Noriaki GOHDA and Shinya 2010 9 AUTOMATIC MEASUREMENTS OF STREAM FLOW USING FLUVIAL ACOUSTIC TOMOGRAPHY SYSTEM 1 2 3 4 Kiyosi KAWANISI, Arata, KANEKO Noriaki GOHDA and Shinya NIGO 1 739-8527 1-4-1 2 739-8527 1-4-1 3 723-0047 12-2

More information

1(a) (b),(c) - [5], [6] Itti [12] [13] gaze eyeball head 2: [time] [7] Stahl [8], [9] Fang [1], [11] 3 -

1(a) (b),(c) - [5], [6] Itti [12] [13] gaze eyeball head 2: [time] [7] Stahl [8], [9] Fang [1], [11] 3 - Vol216-CVIM-22 No18 216/5/12 1 1 1 Structure from Motion - 1 8% Tobii Pro TX3 NAC EMR ACTUS Eye Tribe Tobii Pro Glass NAC EMR-9 Pupil Headset Ville [1] EMR-9 [2] 1 Osaka University Gaze Head Eye (a) deg

More information

CVaR

CVaR CVaR 20 4 24 3 24 1 31 ,.,.,. Markowitz,., (Value-at-Risk, VaR) (Conditional Value-at-Risk, CVaR). VaR, CVaR VaR. CVaR, CVaR. CVaR,,.,.,,,.,,. 1 5 2 VaR CVaR 6 2.1................................................

More information

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s

(a) 1 (b) 3. Gilbert Pernicka[2] Treibitz Schechner[3] Narasimhan [4] Kim [5] Nayar [6] [7][8][9] 2. X X X [10] [11] L L t L s L = L t + L s 1 1 1, Extraction of Transmitted Light using Parallel High-frequency Illumination Kenichiro Tanaka 1 Yasuhiro Mukaigawa 1 Yasushi Yagi 1 Abstract: We propose a new sharpening method of transmitted scene

More information

IPSJ SIG Technical Report Vol.2009-DPS-141 No.23 Vol.2009-GN-73 No.23 Vol.2009-EIP-46 No /11/27 t-room t-room 2 Development of

IPSJ SIG Technical Report Vol.2009-DPS-141 No.23 Vol.2009-GN-73 No.23 Vol.2009-EIP-46 No /11/27 t-room t-room 2 Development of t-room 1 2 2 2 2 1 1 2 t-room 2 Development of Assistant System for Ensemble in t-room Yosuke Irie, 1 Shigemi Aoyagi, 2 Toshihiro Takada, 2 Keiji Hirata, 2 Katsuhiko Kaji, 2 Shigeru Katagiri 1 and Miho

More information

PowerPoint Presentation

PowerPoint Presentation 2017 年 9 月 21 日日本心理学会第 81 回大会 @ 久留米シティプラザ WORLD チュートリアル 山梨大学大学院総合研究部 准教授森勢将雅 mmorise@yamanashi.ac.jp @m_morise (Twitter) 本日の概要 音声分析合成システム WORLD の紹介 基礎的な説明と利用例について WORLD の導入と簡単な使い方 ( デモ ) Windows 環境での導入例

More information

IPSJ SIG Technical Report Vol.2015-MUS-106 No.10 Vol.2015-EC-35 No /3/2 BGM 1,4,a) ,4 BGM. BGM. BGM BGM. BGM. BGM. BGM. 1.,. YouTube 201

IPSJ SIG Technical Report Vol.2015-MUS-106 No.10 Vol.2015-EC-35 No /3/2 BGM 1,4,a) ,4 BGM. BGM. BGM BGM. BGM. BGM. BGM. 1.,. YouTube 201 BGM 1,4,a) 1 2 2 3,4 BGM. BGM. BGM BGM. BGM. BGM. BGM. 1.,. YouTube 2015 1 100.. Web.. BGM.BGM [1]. BGM BGM 1 Waseda University, Shinjuku, Tokyo 169-8555, Japan 2 3 4 JST CREST a) ha-ru-ki@asagi.waseda.jp.

More information

DEIM Forum 2017 E Netflix (Video on Demand) IP 4K [1] Video on D

DEIM Forum 2017 E Netflix (Video on Demand) IP 4K [1] Video on D DEIM Forum 2017 E1-1 700-8530 3-1-1 E-mail: inoue-y@mis.cs.okayama-u.ac.jp, gotoh@cs.okayama-u.ac.jp 1. Netflix (Video on Demand) IP 4K [1] Video on Demand ( VoD) () 2. 2. 1 VoD VoD 2. 2 AbemaTV VoD VoD

More information

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing 1,a) 1,b) 1,c) 2012 11 8 2012 12 18, 2013 1 27 WEB Ruby Removal Filters Using Genetic Programming for Early-modern Japanese Printed Books Taeka Awazu 1,a) Masami Takata 1,b) Kazuki Joe 1,c) Received: November

More information

DT pdf

DT pdf 131 71 71 71 71 71 7 1 71 71 71 71 71 71 71 7 1 71 71 71 71 71 71 71 71 71 71 7 1 71 71 71 71 7 1 71 71 71 71 71 71 71 71 71 71 71 7 1 71 71 71 71 71 71 71 71 7 1 71 71 7 1 71 71 71 71 71 71 71 71 7 1

More information

2.2 6).,.,.,. Yang, 7).,,.,,. 2.3 SIFT SIFT (Scale-Invariant Feature Transform) 8).,. SIFT,,. SIFT, Mean-Shift 9)., SIFT,., SIFT,. 3.,.,,,,,.,,,., 1,

2.2 6).,.,.,. Yang, 7).,,.,,. 2.3 SIFT SIFT (Scale-Invariant Feature Transform) 8).,. SIFT,,. SIFT, Mean-Shift 9)., SIFT,., SIFT,. 3.,.,,,,,.,,,., 1, 1 1 2,,.,.,,, SIFT.,,. Pitching Motion Analysis Using Image Processing Shinya Kasahara, 1 Issei Fujishiro 1 and Yoshio Ohno 2 At present, analysis of pitching motion from baseball videos is timeconsuming

More information

28 Horizontal angle correction using straight line detection in an equirectangular image

28 Horizontal angle correction using straight line detection in an equirectangular image 28 Horizontal angle correction using straight line detection in an equirectangular image 1170283 2017 3 1 2 i Abstract Horizontal angle correction using straight line detection in an equirectangular image

More information

¥ì¥·¥Ô¤Î¸À¸ì½èÍý¤Î¸½¾õ

¥ì¥·¥Ô¤Î¸À¸ì½èÍý¤Î¸½¾õ 2013 8 18 Table of Contents = + 1. 2. 3. 4. 5. etc. 1. ( + + ( )) 2. :,,,,,, (MUC 1 ) 3. 4. (subj: person, i-obj: org. ) 1 Message Understanding Conference ( ) UGC 2 ( ) : : 2 User-Generated Content [

More information

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L 1,a) 1,b) 1/f β Generation Method of Animation from Pictures with Natural Flicker Abstract: Some methods to create animation automatically from one picture have been proposed. There is a method that gives

More information

untitled

untitled The Impact of Digitization on Music Production: From a Perspective of Modularity 51 2 pp. 87-108 2003 12 I 21 3 Information and Communication Technology, ICT 0 1 1 20 1 199820012000 1 MP3 CD 2 3 II CD

More information

xx/xx Vol. Jxx A No. xx 1 Fig. 1 PAL(Panoramic Annular Lens) PAL(Panoramic Annular Lens) PAL (2) PAL PAL 2 PAL 3 2 PAL 1 PAL 3 PAL PAL 2. 1 PAL

xx/xx Vol. Jxx A No. xx 1 Fig. 1 PAL(Panoramic Annular Lens) PAL(Panoramic Annular Lens) PAL (2) PAL PAL 2 PAL 3 2 PAL 1 PAL 3 PAL PAL 2. 1 PAL PAL On the Precision of 3D Measurement by Stereo PAL Images Hiroyuki HASE,HirofumiKAWAI,FrankEKPAR, Masaaki YONEDA,andJien KATO PAL 3 PAL Panoramic Annular Lens 1985 Greguss PAL 1 PAL PAL 2 3 2 PAL DP

More information

2. [2], [3], [4] [5] [6], [7], [8] Agnihotri [6] Xu [7] [8] [9] Nakamura [10] TRECVID (TREC Video Retrieval Evaluation) [11] TRECVID TRECVID Singing s

2. [2], [3], [4] [5] [6], [7], [8] Agnihotri [6] Xu [7] [8] [9] Nakamura [10] TRECVID (TREC Video Retrieval Evaluation) [11] TRECVID TRECVID Singing s 1,a) 2,b) 2,c) 3,d) PV Audio-visual 1. Videotrine[1] YouTube 30 29 PSY GANGNAM STYLE Music clip 2014 4 19.5 29 26 Music clip 3 Music clip 1 Waseda University 2 National Institute of Advanced Industrial

More information

2.2 (a) = 1, M = 9, p i 1 = p i = p i+1 = 0 (b) = 1, M = 9, p i 1 = 0, p i = 1, p i+1 = 1 1: M 2 M 2 w i [j] w i [j] = 1 j= w i w i = (w i [ ],, w i [

2.2 (a) = 1, M = 9, p i 1 = p i = p i+1 = 0 (b) = 1, M = 9, p i 1 = 0, p i = 1, p i+1 = 1 1: M 2 M 2 w i [j] w i [j] = 1 j= w i w i = (w i [ ],, w i [ RI-002 Encoding-oriented video generation algorithm based on control with high temporal resolution Yukihiro BANDOH, Seishi TAKAMURA, Atsushi SHIMIZU 1 1T / CMOS [1] 4K (4096 2160 /) 900 Hz 50Hz,60Hz 240Hz

More information

14 2 5

14 2 5 14 2 5 i ii Surface Reconstruction from Point Cloud of Human Body in Arbitrary Postures Isao MORO Abstract We propose a method for surface reconstruction from point cloud of human body in arbitrary postures.

More information

SEISMIC HAZARD ESTIMATION BASED ON ACTIVE FAULT DATA AND HISTORICAL EARTHQUAKE DATA By Hiroyuki KAMEDA and Toshihiko OKUMURA A method is presented for using historical earthquake data and active fault

More information

2

2 Copyright 2008 Nara Institute of Science and Technology / Osaka University 2 Copyright 2008 Nara Institute of Science and Technology / Osaka University CHAOS Report in US 1994 http://www.standishgroup.com/sample_research/

More information

ID 3) 9 4) 5) ID 2 ID 2 ID 2 Bluetooth ID 2 SRCid1 DSTid2 2 id1 id2 ID SRC DST SRC 2 2 ID 2 2 QR 6) 8) 6) QR QR QR QR

ID 3) 9 4) 5) ID 2 ID 2 ID 2 Bluetooth ID 2 SRCid1 DSTid2 2 id1 id2 ID SRC DST SRC 2 2 ID 2 2 QR 6) 8) 6) QR QR QR QR Vol. 51 No. 11 2081 2088 (Nov. 2010) 2 1 1 1 which appended specific characters to the information such as identification to avoid parity check errors, before QR Code encoding with the structured append

More information

IPSJ SIG Technical Report Vol.2012-CG-149 No.13 Vol.2012-CVIM-184 No /12/4 3 1,a) ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransa

IPSJ SIG Technical Report Vol.2012-CG-149 No.13 Vol.2012-CVIM-184 No /12/4 3 1,a) ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransa 3,a) 3 3 ( ) DB 3D DB 2D,,,, PnP(Perspective n-point), Ransac. DB [] [2] 3 DB Web Web DB Web NTT NTT Media Intelligence Laboratories, - Hikarinooka Yokosuka-Shi, Kanagawa 239-0847 Japan a) yabushita.hiroko@lab.ntt.co.jp

More information

の さ ま ざ ま な 要 素 技 術 と の イ ン テ グ レー シ ョ ン が 必 要 で あ り,ト ー タ ル で み た 場 合 に,研 だ い ろ い ろ あ る よ う に 思 う.今 究 に 加 え,こ 究 開 発 す べ き課 題 は,ま 後 は,個 れ ら を 融 合 す る 技 術 の 研 究 開 発 が 望 ま れ る だ ろ う. (2003年12月4日 参 図9 応 用

More information

untitled

untitled N N X=[ ] R IJK R X R ABC A=[a ] R B=[b ] R C=[c ] R ABC X =[ ] R = a b c X X X X X D( ) D(X X )= log + D( ) a a b b c c b c b c a c a c a b a b R X X A a t =a b c a = t a R i i = a =. a I R = a = b =

More information

, (GPS: Global Positioning Systemg),.,, (LBS: Local Based Services).. GPS,.,. RFID LAN,.,.,.,,,.,..,.,.,,, i

, (GPS: Global Positioning Systemg),.,, (LBS: Local Based Services).. GPS,.,. RFID LAN,.,.,.,,,.,..,.,.,,, i 25 Estimation scheme of indoor positioning using difference of times which chirp signals arrive 114348 214 3 6 , (GPS: Global Positioning Systemg),.,, (LBS: Local Based Services).. GPS,.,. RFID LAN,.,.,.,,,.,..,.,.,,,

More information