log F0 意識 しゃべり 葉の log F0 Fig. 1 1 An example of classification of substyles of rap. ' & 2. 4) m.o.v.e 5) motsu motsu (1) (2) (3) (4) (1) (2) mot

Similar documents
1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

06’ÓŠ¹/ŒØŒì

1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf


Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

Vol. 42 No MUC-6 6) 90% 2) MUC-6 MET-1 7),8) 7 90% 1 MUC IREX-NE 9) 10),11) 1) MUCMET 12) IREX-NE 13) ARPA 1987 MUC 1992 TREC IREX-N

Wikipedia YahooQA MAD 4)5) MAD Web 6) 3. YAMAHA 7) 8) Vocaloid PV YouTube 1 minato minato ussy 3D MAD F EDis ussy

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and

7) 8) 9),10) 11) 18) 11),16) 18) 19) 20) Vocaloid 6) Vocaloid 1 VocaListener1 2 VocaListener1 3 VocaListener VocaListener1 VocaListener1 Voca

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website

Table 1. Assumed performance of a water electrol ysis plant. Fig. 1. Structure of a proposed power generation system utilizing waste heat from factori

1 UD Fig. 1 Concept of UD tourist information system. 1 ()KDDI UD 7) ) UD c 2010 Information Processing S

, PDD ASD p.,.,..,..,.,..,.,..,.,.,.,, 146

Steel Construction Vol. 6 No. 22(June 1999) Engineering

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

Frequently Asked Questions (FAQ) About Sunsetting the SW-CMMR

_先端融合開発専攻_観音0314PDF用

論文9.indd

0801297,繊維学会ファイバ11月号/報文-01-青山

2 DS SS (SS+DS) Fig. 2 Separation algorithm for motorcycle sound by combining DS and SS (SS+DS). 3. [3] DS SS 2 SS+DS 1 1 B SS SS 4. NMF 4. 1 (NMF) Y

29 jjencode JavaScript

08-特集04.indd

MDD PBL ET 9) 2) ET ET 2.2 2), 1 2 5) MDD PBL PBL MDD MDD MDD 10) MDD Executable UML 11) Executable UML MDD Executable UML

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

A Nutritional Study of Anemia in Pregnancy Hematologic Characteristics in Pregnancy (Part 1) Keizo Shiraki, Fumiko Hisaoka Department of Nutrition, Sc

DOUSHISYA-sports_R12339(高解像度).pdf

3_23.dvi

ICT a) Caption Presentation Method with Speech Expression Utilizing Speech Bubble Shapes for Video Content Yuko KONYA a) and Itiro SIIO 1. Graduate Sc

6_27.dvi

IPSJ SIG Technical Report Vol.2014-IOT-27 No.14 Vol.2014-SPT-11 No /10/10 1,a) 2 zabbix Consideration of a system to support understanding of f

[2] , [3] 2. 2 [4] 2. 3 BABOK BABOK(Business Analysis Body of Knowledge) BABOK IIBA(International Institute of Business Analysis) BABOK 7

A Feasibility Study of Direct-Mapping-Type Parallel Processing Method to Solve Linear Equations in Load Flow Calculations Hiroaki Inayoshi, Non-member

第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L

Vol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L

Vol. 48 No. 3 Mar PM PM PMBOK PM PM PM PM PM A Proposal and Its Demonstration of Developing System for Project Managers through University-Indus

JOURNAL OF THE JAPANESE ASSOCIATION FOR PETROLEUM TECHNOLOGY VOL. 66, NO. 6 (Nov., 2001) (Received August 10, 2001; accepted November 9, 2001) Alterna

untitled

A Study on Throw Simulation for Baseball Pitching Machine with Rollers and Its Optimization Shinobu SAKAI*5, Yuichiro KITAGAWA, Ryo KANAI and Juhachi

Studies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth

IPSJ SIG Technical Report Vol.2017-CLE-21 No /3/21 e 1,2 1,2 1 1,2 1 Sakai e e e Sakai e Current Status and Challenges on e-learning T


04-“²†XŒØ‘�“_-6.01

97-00

OJT Planned Happenstance

Fig, 1. Waveform of the short-circuit current peculiar to a metal. Fig. 2. Waveform of arc short-circuit current. 398 T. IEE Japan, Vol. 113-B, No. 4,

APU win-win

( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst

2). 3) 4) 1.2 NICTNICT DCRA Dihedral Corner Reflector micro-arraysdcra DCRA DCRA DCRA 3D DCRA PC USB PC PC ON / OFF Velleman K8055 K8055 K8055

1 2. Nippon Cataloging Rules NCR [6] (1) 5 (2) 4 3 (3) 4 (4) 3 (5) ISSN 7 International Standard Serial Number ISSN (6) (7) 7 16 (8) ISBN ISSN I

MOTIF XF 取扱説明書

IPSJ SIG Technical Report Vol.2016-MUS-111 No /5/21 1, 1 2,a) HMM A study on an implementation of semiautomatic composition of music which matc

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

コンピューターとつなぐ

Virtual Window System Virtual Window System Virtual Window System Virtual Window System Virtual Window System Virtual Window System Social Networking

<332D985F95B62D8FAC93638BA795DB90E690B62E706466>

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

Transcription:

1. 1 2 1 3 2 HMM Rap-style Singing Voice Synthesis Keijiro Saino, 1 Keiichiro Oura, 2 Makoto Tachibana, 1 Hieki Kenmochi 3 an Keiichi Tokua 2 This paper aresses rap-style singing voice synthesis. Since it has not been very clear how to write a musical score for rap-style songs, existing singing voice synthesis systems base on musical scores are not suitable for synthesizing them with an intuitive input. Here a new type of musical score specialize for a rap-style is efine. An HMM-base singing voice synthesis system is use to realize an automatic synthesis of realistic rap-style singing. Glissano phenomenon which is special for the style coul be foun in synthesis results. It was also trie to apply pitch parameters generate from the HMMs to a sample-concatenation-base singing voice synthesis system. 1) HMM 2) VOCALOID 3) 2 3 HMM 4 5 HMM 6 1 Corporate Research an Development Center, Yamaha Corporation 2 Department of Computer Science an Engineering, Nagoya Institute of Technology 3 yamaha+ yamaha+ Division, Yamaha Corporation 1 c 2012 Information Processing Society of Japan

log F0 意識 しゃべり 葉の log F0 Fig. 1 1 An example of classification of substyles of rap. ' & 2. 4) m.o.v.e 5) motsu motsu 2.1 1 (1) (2) (3) (4) (1) (2) motsu (1), (2), (4) motsu 2 (1), (2) motsu (1) (2) メロディ構造の意識のない, しゃべり 葉のようなイントネーション Fig. 2 2 motsu Examples of log F 0 series of rap-style singing voice by motsu. motsu (4) (1) (4) (1), (2) 2.2 (1) (2) 2 c 2012 Information Processing Society of Japan

C#(+5) B(+3) G#(root) F#(-2) D#(-5) ぶし ʼ (A) (B) 2 下降 向 3. HMM つめ以降の 符 ( レッツ ʼ グリッサンド 符は通常 符と同様の単位で, 任意の さをもちうる 3 Fig. 3 The efine musical notation rules. motsu 2.1 motsu 3 16 8 3 2 5-5, -2, 0, +3, +5 1 VOCALOID 3) HMM 2) HMM 2.2 HMM HMM 2 HMM HMM 6) 3 c 2012 Information Processing Society of Japan

1 Table 1 Singing voice ata use for moel training. / motsu 11 21 6 BPM 92 130 motsu 48kHz/16bit 49 STRAIGHT 5 ms SWIPE 7) 5 ms & ' Root = C#3 HMM HMM MLSA 4. 2.2 4.1 HMM motsu 2.2 13 13 motsu motsu 13 motsu 1 (1) (4) (1), (2) motsu motsu 13 11 4 (BPM 128) Fig. 4 A part of input rap score an contour of generate log F 0. (BPM 128) 2 1 4.2 HMM (Hien Semi-Markov Moels; HSMM) 8) left-to-right 5 HMM 4 HMM 4.3 2 4 2 2 4 c 2012 Information Processing Society of Japan

Table 2 2 Subjective evaluation methos. Root = C#3 Root = C#3 A B C D ' el 信頼度区間 DMOS ' Fig. 5 5 Subjective evaluation results. 6 (BPM 100) Fig. 6 Generate log F 0 contour on each experimental conition (BPM 100). 4 19 1 5 5 (Degraation Mean Opinion Score; DMOS) 10 5 6 A D 6 5 4.4 /a/ /o/ /a/ 5. HMM VOCALOID 3) 4 5 c 2012 Information Processing Society of Japan

7 HMM VOCALOID Fig. 7 An example of VOCALOID pitch parameters converte from the parameters generate from the HMMs. VOCALOID VOCALOID VOCALOID 7 HMM VOCALOID HMM VocaListener 9) 6. HMM 2 VOCALOID HMM 7. motsu 1) H. Kenmochi, VOCALOID an Hatsune Miku phenomenon in Japan, Proc.InterSinging 2010, pp.1 4, 2010. 2) K.Oura, A.Mase, T.Yamaa, S.Muto, Y.Nankaku, an K.Tokua, Recent Development of the HMM-base Singing Voice Synthesis System - Sinsy, Proc.SSW7, pp.211 216, 2010. 3) H.Kenmochi an H.Ohshita, VOCALOID-Commercial Singing Synthesizer Base on Sample Concatenation, Proc.INTERSPEECH 2007, pp.4011 4010, 2007. 4) [DVD BOOK], (2005). 5) M.O.V.E Official Website, http://electropica.com/inex.html. 6),,,, HMM,, vol.i, 1-8-20, pp.283 284, 2010. 7) A.Camacho, SWIPE: A Sawtooth Waveform Inspire Pitch Estimator for Speech an Music, Ph.D.Thesis, University of Floria, 2007. 8) H.Zen, T.Masuko, K.Tokua, T.Kobayashi, an T.Kitamura, A Hien Semi- Markov Moel-Base Speech Synthesis System, Proc.IEICE Trans., vol.90-d, no.5, pp.825 834, 2007. 9) T. Nakano, an M. Goto, VocaListener: A Singing-to-Singing Synthesis System Base on Iterative Parameter Estimation, Proc.SMC 2009. pp.343 348, 2009. 6 c 2012 Information Processing Society of Japan