log F0 意識 しゃべり 葉の log F0 Fig. 1 1 An example of classification of substyles of rap. ' & 2. 4) m.o.v.e 5) motsu motsu (1) (2) (3) (4) (1) (2) mot

Similar documents
Vol.53 No (Mar. 2012) 1, 1,a) 1, 2 1 1, , Musical Interaction System Based on Stage Metaphor Seiko Myojin 1, 1,a

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

IPSJ SIG Technical Report Vol.2012-MUS-94 No.3 Vol.2012-SLP-90 No /2/ DTM 200 GUIN-Resonator: A system synthesizing voice with the styl

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS ) GPS Global Positioning System

IPSJ SIG Technical Report Vol.2009-DPS-141 No.23 Vol.2009-GN-73 No.23 Vol.2009-EIP-46 No /11/27 t-room t-room 2 Development of

IPSJ SIG Technical Report Secret Tap Secret Tap Secret Flick 1 An Examination of Icon-based User Authentication Method Using Flick Input for

06’ÓŠ¹/ŒØŒì

1: A/B/C/D Fig. 1 Modeling Based on Difference in Agitation Method artisoc[7] A D 2017 Information Processing

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

1 Web [2] Web [3] [4] [5], [6] [7] [8] S.W. [9] 3. MeetingShelf Web MeetingShelf MeetingShelf (1) (2) (3) (4) (5) Web MeetingShelf


Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

2006 [3] Scratch Squeak PEN [4] PenFlowchart 2 3 PenFlowchart 4 PenFlowchart PEN xdncl PEN [5] PEN xdncl DNCL 1 1 [6] 1 PEN Fig. 1 The PEN

IPSJ SIG Technical Report Vol.2009-HCI-134 No /7/17 1. RDB Wiki Wiki RDB SQL Wiki Wiki RDB Wiki RDB Wiki A Wiki System Enhanced by Visibl

DPA,, ShareLog 3) 4) 2.2 Strino Strino STRain-based user Interface with tacticle of elastic Natural ObjectsStrino 1 Strino ) PC Log-Log (2007 6)

Vol. 42 No MUC-6 6) 90% 2) MUC-6 MET-1 7),8) 7 90% 1 MUC IREX-NE 9) 10),11) 1) MUCMET 12) IREX-NE 13) ARPA 1987 MUC 1992 TREC IREX-N

Wikipedia YahooQA MAD 4)5) MAD Web 6) 3. YAMAHA 7) 8) Vocaloid PV YouTube 1 minato minato ussy 3D MAD F EDis ussy

A Study of Effective Application of CG Multimedia Contents for Help of Understandings of the Working Principles of the Internal Combustion Engine (The

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and

7) 8) 9),10) 11) 18) 11),16) 18) 19) 20) Vocaloid 6) Vocaloid 1 VocaListener1 2 VocaListener1 3 VocaListener VocaListener1 VocaListener1 Voca

The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). The material has been made available on the website

Table 1. Assumed performance of a water electrol ysis plant. Fig. 1. Structure of a proposed power generation system utilizing waste heat from factori

1 UD Fig. 1 Concept of UD tourist information system. 1 ()KDDI UD 7) ) UD c 2010 Information Processing S

1911 F0 5) SingBySpeaking F0 F0 F0 4 F0 2. F0 4) 5) rate extent 6) rate 5.6 [Hz] extent 87 [cent] F0 5.2 [%] F0 SingBySpeaking 7) F0 Fig. 1 1 F0 F0 co

1 7.35% 74.0% linefeed point c 200 Information Processing Society of Japan

IPSJ SIG Technical Report Vol.2017-MUS-115 No /6/17 1,a) 1 1 WORLD F0 Vocaloid F0 ipad 1. Vocaloid [1] UTAU *1 Vocaloid Vocaloid F0 VocaListene

Vol. 42 No. SIG 8(TOD 10) July HTML 100 Development of Authoring and Delivery System for Synchronized Contents and Experiment on High Spe

IPSJ SIG Technical Report Vol.2011-DBS-153 No /11/3 Wikipedia Wikipedia Wikipedia Extracting Difference Information from Multilingual Wiki

Tf dvi

IPSJ-JNL

, PDD ASD p.,.,..,..,.,..,.,..,.,.,.,, 146

Steel Construction Vol. 6 No. 22(June 1999) Engineering

IPSJ SIG Technical Report Vol.2010-NL-199 No /11/ treebank ( ) KWIC /MeCab / Morphological and Dependency Structure Annotated Corp

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

1 p.27 Fig. 1 Example of a koto score. [1] 1 1 [1] A 2. Rogers [4] Zhang [5] [6] [7] Löchtefeld [8] Xiao [

Frequently Asked Questions (FAQ) About Sunsetting the SW-CMMR

_先端融合開発専攻_観音0314PDF用

Q [4] 2. [3] [5] ϵ- Q Q CO CO [4] Q Q [1] i = X ln n i + C (1) n i i n n i i i n i = n X i i C exploration exploitation [4] Q Q Q ϵ 1 ϵ 3. [3] [5] [4]

論文9.indd

3_39.dvi

0801297,繊維学会ファイバ11月号/報文-01-青山

1_26.dvi

2 DS SS (SS+DS) Fig. 2 Separation algorithm for motorcycle sound by combining DS and SS (SS+DS). 3. [3] DS SS 2 SS+DS 1 1 B SS SS 4. NMF 4. 1 (NMF) Y

1 4 4 [3] SNS 5 SNS , ,000 [2] c 2013 Information Processing Society of Japan

29 jjencode JavaScript

08-特集04.indd

1 1 CodeDrummer CodeMusician CodeDrummer Fig. 1 Overview of proposal system c

MDD PBL ET 9) 2) ET ET 2.2 2), 1 2 5) MDD PBL PBL MDD MDD MDD 10) MDD Executable UML 11) Executable UML MDD Executable UML

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

A Nutritional Study of Anemia in Pregnancy Hematologic Characteristics in Pregnancy (Part 1) Keizo Shiraki, Fumiko Hisaoka Department of Nutrition, Sc

I 1) 2) 51 (1976) 6.9 ha 9 (1934) 2km 15, (1955) 6 (1620)

DOUSHISYA-sports_R12339(高解像度).pdf

3_23.dvi

ICT a) Caption Presentation Method with Speech Expression Utilizing Speech Bubble Shapes for Video Content Yuko KONYA a) and Itiro SIIO 1. Graduate Sc

6_27.dvi

IPSJ SIG Technical Report Vol.2014-IOT-27 No.14 Vol.2014-SPT-11 No /10/10 1,a) 2 zabbix Consideration of a system to support understanding of f

[2] , [3] 2. 2 [4] 2. 3 BABOK BABOK(Business Analysis Body of Knowledge) BABOK IIBA(International Institute of Business Analysis) BABOK 7

FUJII, M. and KOSAKA, M. 2. J J [7] Fig. 1 J Fig. 2: Motivation and Skill improvement Model of J Orchestra Fig. 1: Motivating factors for a

TCP/IP IEEE Bluetooth LAN TCP TCP BEC FEC M T M R M T 2. 2 [5] AODV [4]DSR [3] 1 MS 100m 5 /100m 2 MD 2 c 2009 Information Processing Society of

A Feasibility Study of Direct-Mapping-Type Parallel Processing Method to Solve Linear Equations in Load Flow Calculations Hiroaki Inayoshi, Non-member

013858,繊維学会誌ファイバー1月/報文-02-古金谷

第62巻 第1号 平成24年4月/石こうを用いた木材ペレット

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L

Vol. 48 No. 4 Apr LAN TCP/IP LAN TCP/IP 1 PC TCP/IP 1 PC User-mode Linux 12 Development of a System to Visualize Computer Network Behavior for L

IPSJ SIG Technical Report An Evaluation Method for the Degree of Strain of an Action Scene Mao Kuroda, 1 Takeshi Takai 1 and Takashi Matsuyama 1

IPSJ SIG Technical Report Vol.2011-MUS-91 No /7/ , 3 1 Design and Implementation on a System for Learning Songs by Presenting Musical St

Vol. 48 No. 3 Mar PM PM PMBOK PM PM PM PM PM A Proposal and Its Demonstration of Developing System for Project Managers through University-Indus

News_Letter_No35(Ver.2).p65

udc-2.dvi

JOURNAL OF THE JAPANESE ASSOCIATION FOR PETROLEUM TECHNOLOGY VOL. 66, NO. 6 (Nov., 2001) (Received August 10, 2001; accepted November 9, 2001) Alterna

untitled

2. Twitter Twitter 2.1 Twitter Twitter( ) Twitter Twitter ( 1 ) RT ReTweet RT ReTweet RT ( 2 ) URL Twitter Twitter 140 URL URL URL 140 URL URL

A Study on Throw Simulation for Baseball Pitching Machine with Rollers and Its Optimization Shinobu SAKAI*5, Yuichiro KITAGAWA, Ryo KANAI and Juhachi

Studies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth

IPSJ SIG Technical Report Vol.2017-CLE-21 No /3/21 e 1,2 1,2 1 1,2 1 Sakai e e e Sakai e Current Status and Challenges on e-learning T


04-“²†XŒØ‘�“_-6.01

97-00


DEIM Forum 2009 E

IT,, i

OJT Planned Happenstance

IPSJ SIG Technical Report Vol.2017-ARC-225 No.12 Vol.2017-SLDM-179 No.12 Vol.2017-EMB-44 No /3/9 1 1 RTOS DefensiveZone DefensiveZone MPU RTOS

Fig, 1. Waveform of the short-circuit current peculiar to a metal. Fig. 2. Waveform of arc short-circuit current. 398 T. IEE Japan, Vol. 113-B, No. 4,

APU win-win

( ) [1] [4] ( ) 2. [5] [6] Piano Tutor[7] [1], [2], [8], [9] Radiobaton[10] Two Finger Piano[11] Coloring-in Piano[12] ism[13] MIDI MIDI 1 Fig. 1 Syst

e-learning station 1) 2) 1) 3) 2) 2) 1) 4) e-learning Station 16 e-learning e-learning key words: e-learning LMS CMS A Trial and Prospect of Kumamoto

2). 3) 4) 1.2 NICTNICT DCRA Dihedral Corner Reflector micro-arraysdcra DCRA DCRA DCRA 3D DCRA PC USB PC PC ON / OFF Velleman K8055 K8055 K8055

1 2. Nippon Cataloging Rules NCR [6] (1) 5 (2) 4 3 (3) 4 (4) 3 (5) ISSN 7 International Standard Serial Number ISSN (6) (7) 7 16 (8) ISBN ISSN I

MOTIF XF 取扱説明書

TF-IDF TDF-IDF TDF-IDF Extracting Impression of Sightseeing Spots from Blogs for Supporting Selection of Spots to Visit in Travel Sat

Instability of Aerostatic Journal Bearings with Porous Floating Bush at High Speeds Masaaki MIYATAKE *4, Shigeka YOSHIMOTO, Tomoaki CHIBA and Akira CH

IPSJ SIG Technical Report Vol.2016-MUS-111 No /5/21 1, 1 2,a) HMM A study on an implementation of semiautomatic composition of music which matc

1., 1 COOKPAD 2, Web.,,,,,,.,, [1]., 5.,, [2].,,.,.,, 5, [3].,,,.,, [4], 33,.,,.,,.. 2.,, 3.., 4., 5., ,. 1.,,., 2.,. 1,,

17 Proposal of an Algorithm of Image Extraction and Research on Improvement of a Man-machine Interface of Food Intake Measuring System

コンピューターとつなぐ

Virtual Window System Virtual Window System Virtual Window System Virtual Window System Virtual Window System Virtual Window System Social Networking

<332D985F95B62D8FAC93638BA795DB90E690B62E706466>

Vol.55 No (Jan. 2014) saccess 6 saccess 7 saccess 2. [3] p.33 * B (A) (B) (C) (D) (E) (F) *1 [3], [4] Web PDF a m

Transcription:

1. 1 2 1 3 2 HMM Rap-style Singing Voice Synthesis Keijiro Saino, 1 Keiichiro Oura, 2 Makoto Tachibana, 1 Hieki Kenmochi 3 an Keiichi Tokua 2 This paper aresses rap-style singing voice synthesis. Since it has not been very clear how to write a musical score for rap-style songs, existing singing voice synthesis systems base on musical scores are not suitable for synthesizing them with an intuitive input. Here a new type of musical score specialize for a rap-style is efine. An HMM-base singing voice synthesis system is use to realize an automatic synthesis of realistic rap-style singing. Glissano phenomenon which is special for the style coul be foun in synthesis results. It was also trie to apply pitch parameters generate from the HMMs to a sample-concatenation-base singing voice synthesis system. 1) HMM 2) VOCALOID 3) 2 3 HMM 4 5 HMM 6 1 Corporate Research an Development Center, Yamaha Corporation 2 Department of Computer Science an Engineering, Nagoya Institute of Technology 3 yamaha+ yamaha+ Division, Yamaha Corporation 1 c 2012 Information Processing Society of Japan

log F0 意識 しゃべり 葉の log F0 Fig. 1 1 An example of classification of substyles of rap. ' & 2. 4) m.o.v.e 5) motsu motsu 2.1 1 (1) (2) (3) (4) (1) (2) motsu (1), (2), (4) motsu 2 (1), (2) motsu (1) (2) メロディ構造の意識のない, しゃべり 葉のようなイントネーション Fig. 2 2 motsu Examples of log F 0 series of rap-style singing voice by motsu. motsu (4) (1) (4) (1), (2) 2.2 (1) (2) 2 c 2012 Information Processing Society of Japan

C#(+5) B(+3) G#(root) F#(-2) D#(-5) ぶし ʼ (A) (B) 2 下降 向 3. HMM つめ以降の 符 ( レッツ ʼ グリッサンド 符は通常 符と同様の単位で, 任意の さをもちうる 3 Fig. 3 The efine musical notation rules. motsu 2.1 motsu 3 16 8 3 2 5-5, -2, 0, +3, +5 1 VOCALOID 3) HMM 2) HMM 2.2 HMM HMM 2 HMM HMM 6) 3 c 2012 Information Processing Society of Japan

1 Table 1 Singing voice ata use for moel training. / motsu 11 21 6 BPM 92 130 motsu 48kHz/16bit 49 STRAIGHT 5 ms SWIPE 7) 5 ms & ' Root = C#3 HMM HMM MLSA 4. 2.2 4.1 HMM motsu 2.2 13 13 motsu motsu 13 motsu 1 (1) (4) (1), (2) motsu motsu 13 11 4 (BPM 128) Fig. 4 A part of input rap score an contour of generate log F 0. (BPM 128) 2 1 4.2 HMM (Hien Semi-Markov Moels; HSMM) 8) left-to-right 5 HMM 4 HMM 4.3 2 4 2 2 4 c 2012 Information Processing Society of Japan

Table 2 2 Subjective evaluation methos. Root = C#3 Root = C#3 A B C D ' el 信頼度区間 DMOS ' Fig. 5 5 Subjective evaluation results. 6 (BPM 100) Fig. 6 Generate log F 0 contour on each experimental conition (BPM 100). 4 19 1 5 5 (Degraation Mean Opinion Score; DMOS) 10 5 6 A D 6 5 4.4 /a/ /o/ /a/ 5. HMM VOCALOID 3) 4 5 c 2012 Information Processing Society of Japan

7 HMM VOCALOID Fig. 7 An example of VOCALOID pitch parameters converte from the parameters generate from the HMMs. VOCALOID VOCALOID VOCALOID 7 HMM VOCALOID HMM VocaListener 9) 6. HMM 2 VOCALOID HMM 7. motsu 1) H. Kenmochi, VOCALOID an Hatsune Miku phenomenon in Japan, Proc.InterSinging 2010, pp.1 4, 2010. 2) K.Oura, A.Mase, T.Yamaa, S.Muto, Y.Nankaku, an K.Tokua, Recent Development of the HMM-base Singing Voice Synthesis System - Sinsy, Proc.SSW7, pp.211 216, 2010. 3) H.Kenmochi an H.Ohshita, VOCALOID-Commercial Singing Synthesizer Base on Sample Concatenation, Proc.INTERSPEECH 2007, pp.4011 4010, 2007. 4) [DVD BOOK], (2005). 5) M.O.V.E Official Website, http://electropica.com/inex.html. 6),,,, HMM,, vol.i, 1-8-20, pp.283 284, 2010. 7) A.Camacho, SWIPE: A Sawtooth Waveform Inspire Pitch Estimator for Speech an Music, Ph.D.Thesis, University of Floria, 2007. 8) H.Zen, T.Masuko, K.Tokua, T.Kobayashi, an T.Kitamura, A Hien Semi- Markov Moel-Base Speech Synthesis System, Proc.IEICE Trans., vol.90-d, no.5, pp.825 834, 2007. 9) T. Nakano, an M. Goto, VocaListener: A Singing-to-Singing Synthesis System Base on Iterative Parameter Estimation, Proc.SMC 2009. pp.343 348, 2009. 6 c 2012 Information Processing Society of Japan