STRAIGHT_Tutorial_Tianjin2016.key

Size: px
Start display at page:

Download "STRAIGHT_Tutorial_Tianjin2016.key"

Transcription

1 Lecture 1 Introduction to speech signal processing in STRAIGHT vocoder Hideki Kawahara Emeritus Professor: Wakayama University, Japan Tianjin University, China, 9 December, 2016

2 Collaborators Roy D. Patterson Masanori Morise Hideki Banno Toshio Irino Ryuichi Nisimura Verena G. Skuk Stefan Schweinberger Parham Zolfaghari Ken-Ichi Sakakibara Ikuyo Masuda-Katsuse Alain de Cheveigne Josh McDermott Osamu Fujimura Toru Takahashi Tomoki Toda and many others. 2

3 Matlab

4 Link to STRAIGHT resources TANDEM- STRAIGHT and morphing Username: Tianjin-Lecture Password: STRAIGHT (Valid on 9 December, 2016) Link 4

5 Link to STRAIGHT resources legacy- STRAIGHT Username: Tianjin-Lecture Password: STRAIGHT (Valid on 9 December, 2016) Link 5

6 Lecture 1 Introduction to speech signal processing in STRAIGHT vocoder Hideki Kawahara Emeritus Professor: Wakayama University, Japan Tianjin University, China, 9 December, 2016

7 Topic Application STRAIGHT Background 7

8 Summary Application STRAIGHT Background Interference-free representations play important roles Periodic excitation is an efficient and robust strategy for sampling and transmitting relevant information for communications using voice STRAIGHT is a collection of functions and applications Extended morphing provides a unique research strategy useful for para- and non-linguistic aspects of speech 8

9 Topic Application STRAIGHT Background 9

10 Interference Vocal tract SHAPE information is mixed with interfering structure caused by repetitive structure in voiced sounds Linear predictive analysis still suffers from estimation bias caused by repetitive structure in voiced sounds Interference-free representations 10

11 Visualization: spectrogram S(ω,t) = w(τ t)s(τ )e jωτ dτ 2 wide-band narrow-band 11

12 Movie 12

13 Visualization: spectrogram S(ω,t) = w(τ t)s(τ )e jωτ dτ 2 wide-band narrow-band 13

14 Matlab

15 Matlab

16 Matlab

17

18

19 Interference Vocal tract SHAPE information is mixed with interfering structure caused by repetitive structure in voiced sounds Linear predictive analysis still suffers from estimation bias caused by repetitive structure in voiced sounds Interference-free representations 19

20 In case of speech repetition Articulator Voicing organ filter transfer function mixing nightmare of signal processing voice source fundamental frequency source wave

21 Topic Application STRAIGHT Background 21

22 STRAIGHT is a VOCODER analysis physical attributes synthesis spectral envelope analysis spectral envelope filter output signal input signal-1 F0 analysis F0 modification periodic pulse generator shaper and mixer signal parameter non-periodicity analysis nonperiodicity non-periodic component generator data process 22

23 STRAIGHT: legacy to TANDEM spectrum instantaneous frequency group delay 23

24 STRAIGHT: legacy to TANDEM spectrum instantaneous frequency group delay 24

25 Interference-free representation of power spectrum Complementary set of pitch synchronized time windows and spline-based spectral smoothing and inverse filtering legacy-straight [kawahara et.al. 1999] - original idea is by Kawahara (1997) F0-adaptive set of time windows separated a half pitch period and F0-adaptive smoothing followed by digital filter compensation based on consistent sampling TANDEM-STRAIGHT [kawahara et.al. 2008]???? 25

26 Interference: power spectrum DVD 26

27 Interference-free representation of power spectrum Complementary set of pitch synchronized time windows and spline-based spectral smoothing and inverse filtering legacy-straight [kawahara et.al. 1999] F0-adaptive set of time windows separated a half pitch period and F0-adaptive smoothing followed by digital filter compensation based on consistent sampling TANDEM-STRAIGHT [kawahara et.al. 2008] - original time window idea is by Morise et.al. (2007)???? 27

28 TANDEM-STRAIGHT: periodic pulse power spectrum Movie 28

29 29

30 30

31 31

32 TANDEM: principle signal model power spectrum arbitrary real numbers windowing function TANDEM spectrum temporally varying term fundamental period 32

33 log-power spectrum TANDEM-STRAIGHT: synthetic vowel /a/ Movie 33

34 Selection of window How to select windowing function averaged spectrum temporal variation normalized duration 34

35 temporal variation single window normalized duration (re. T0)

36 * Nuttall windows Nuttall, A. H. (1981). Some windows with very good sidelobe behavior. Acoustics, Speech and Signal Processing, IEEE Transactions on, 29(1), Hann Blackman Nuttall gain (db) frequency (Hz) 36

37 Nuttall #12 in Table II Note: nuttallwin in Matlab is different.

38 temporal variation TANDEM window normalized duration (re.t0)

39 temporal variation TANDEM window normalized duration (re.t0)

40 Interference-free representation of power spectrum Complementary set of pitch synchronized time windows and spline-based spectral smoothing and inverse filtering legacy-straight [kawahara et.al. 1999] F0-adaptive set of time windows separated a half pitch period and F0-adaptive smoothing followed by digital filter compensation based on consistent sampling TANDEM-STRAIGHT [kawahara et.al. 2008, 2011]???? 40

41 Consistent sampling simple filtering Consistent sampling: recovery only at sampled points [Unser 2000] Sampling theory: whole waveform recovery 41

42 Consistent sampling recovered spectrum smoothed spectrum smoothing function frequency domain representation 42

43 Consistent sampling 1 correlation 1 filter coefficient [Kawahara & Morise 2011b] 43

44 Implementation cepstrum representation truncated and of TANDEM spectrum adjusted coefficients lifter form of the rectangular frequency smoother [Kawahara & Morise 2011b] 44

45 Test example STRAIGHT spectrum TANDEM spectrum [Kawahara & Morise 2011b] 45

46 TANDEM-STRAIGHT: natural speech Movie 46

47 STRAIGHT is a VOCODER analysis physical attributes synthesis spectral envelope analysis spectral envelope filter output signal input signal-1 F0 analysis F0 modification periodic pulse generator shaper and mixer signal parameter non-periodicity analysis nonperiodicity non-periodic component generator data process 47

48 STRAIGHT: legacy to TANDEM spectrum instantaneous frequency group delay 48

49 Other cause of interference Phase spectrogram and instantaneous frequency DVD 49

50 F0 extractors using instantaneous frequency Fundamental component selection using a constant-q filter bank and AM-FM magnitude legacy STRAIGHT [kawahara et.al.1999a] Fixed point of frequency to instantaneous frequency mapping [kawahara et.al.1999b] Refinement of initial estimates of F0s using instantaneous frequency Multi-source F0 extractor with intensive manual optimization of parameters [kawahara et.al. 2005] XSX: excitation source extractor based on interference-free representation of power spectra [kawahara et.al. 2008][Fujimura et.al. 2009] YANGsaf [Kawahara et.al., 2016] 50

51 Interference-free representation of instantaneous frequency Interferences in instantaneous frequency of periodic signals Interference-free representation of instantaneous frequency (animation) Derivation of Interference-free representation of instantaneous frequency 51

52 Movie 52

53 waveform and time windows time and frequency resolution Movie phase spectrogram viewer of target representation 53

54 Movie 54

55 Movie 55

56 Interference-free representation of instantaneous frequency Interferences in instantaneous frequency of periodic signals Interference-free representation of instantaneous frequency (animation) Derivation of Interference-free representation of instantaneous frequency 56

57 Movie 57

58 Movie 58

59 Interference-free representation of instantaneous frequency Interferences in instantaneous frequency of periodic signals Interference-free representation of instantaneous frequency (animation) Derivation of Interference-free representation of instantaneous frequency 59

60 Instantaneous frequency: problem Definition: Time derivative of phase where singularity 60

61 Flanagan s equation Derivation-1 No need of inverse function 61

62 Flanagan s equation Derivation-2 Simplification and notation = 62

63 Averaged instantaneous frequency Power weighted average Derivation-3 Note! Denominator is TANDEM spectrum 63

64 Averaged instantaneous frequency Power weighted average Derivation-4 Numerator: sum of each numerator 64

65 Numerators Derivation-5 Substitution and simplification Squared terms vanish 65

66 TANDEM trick Derivation-6 Independent on time This term should be eliminated = TANDEM trick 66

67 F0 extractors using instantaneous frequency Fundamental component selection using a constant-q filter bank and AM-FM magnitude legacy STRAIGHT [kawahara et.al.1999a] Fixed point of frequency to instantaneous frequency mapping [kawahara et.al.1999b] Refinement of initial estimates of F0s using instantaneous frequency NDF: Multi-source F0 extractor with intensive manual optimization of parameters [kawahara et.al. 2005] XSX: excitation source extractor based on interference-free representation of power spectra [kawahara et.al. 2008][Fujimura et.al. 2009] YANGsaf [Kawahara et.al., 2016] 67

68 Periodicity detection by spectral division Movie 68

69 TANDEM STRAIGHT F0 adaptive processing sp. division shaping

70 F0 adaptive processing Contradiction: no F0 information multiple hypothesis and integration

71 Multiple hypothesis and integration detector-1 signal detector-2 detector-3 detector-4 integration F0 salience blackman 2.5 npo:3 std: response detector-n normalized lag in semitone (re. T0)

72 Spectral division TANDEM spectrum: <- envelope and periodic structure F0-adaptive smoothed spectrum <- envelope TANDEM spectrum spectrum only with periodic component smoothed spectrum 72

73 Selecting base-band sp. division shaping 73

74 Integration of individual detectors shaping individual response shaped response integrated response 74

75 Integration of individual detectors shaped response integrated response 75

76 Integration of individual detectors Movie 76

77 Alternating amplitude Movie 77

78 Displacement of pulse timing Movie 78

79 Analysis of Noh voice Fujimura, O., Honda, K., Kawahara, H., Konparu, Y., Morise, M., & Williams, J. C. (2009). Noh voice quality. Logopedics Phoniatrics Vocology, 34(4),

80 F0 extractors using instantaneous frequency Fundamental component selection using a constant-q filter bank and AM-FM magnitude legacy STRAIGHT [kawahara et.al.1999a] Fixed point of frequency to instantaneous frequency mapping [kawahara et.al.1999b] Refinement of initial estimates of F0s using instantaneous frequency NDF: Multi-source F0 extractor with intensive manual optimization of parameters [kawahara et.al. 2005] XSX: excitation source extractor based on interference-free representation of power spectra [kawahara et.al. 2008][Fujimura et.al. 2009] YANGsaf:[Kawahara et.al., 2016] SSW9 80

81 Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis Hideki Kawahara, Yannis Agiomyrgiannakis, Heiga Zen Wakayama University, Japan ISCA SSW9, Sunnyvale, CA, USA, September, 2016

82 STRAIGHT: legacy to TANDEM spectrum instantaneous frequency group delay 82

83 STRAIGHT is a VOCODER analysis physical attributes synthesis spectral envelope analysis spectral envelope filter output signal input signal-1 F0 analysis F0 modification periodic pulse generator shaper and mixer signal parameter non-periodicity analysis nonperiodicity non-periodic component generator data process 83

84 Non-periodic component Distance between lower and upper power spectrum envelope and calibration based on simulation [Kawahara et.al. 2001] Residuals of pitch scale linear prediction of frequency sub-bands and sigmoidal spectral modeling [Kawahara et.al. 2010a] Interference-free group delay representation for estimating deviation from periodicity [Kawahara et.al. 2014] Event extraction based on running kurtosis [Kawahara et.al. 2010b] the least successful component 84

85 EGG is not almighty EGG speech time (s) 85

86 glottal closure instance GCI no GCI gender:f talkerid:14 sentenceid:28 reldscrpncy:5.8 % differentiated EGG signal speech signal mismatch EGG signal close open time (s)

87 amount of mismatch relative mismatch Mismatch is not rare 10% 1% 840 utterances were tested (30 sentences 28 speakers) only 77 (in 840) utterances do not have mismatch record count sorted utterance ID

88 F0 modulation in Noh voice 6*F frequency (Hz) x 10 4 time (ms)

89 F0 modulation in Noh voice coupling 1:3 chaos? 1:2 1:3 1:2 6*F frequency (Hz) x time (ms)

90 F0 modulation power spectrum vibrato modulation relative rms level (db, semitone) modulation frequency (Hz) 90

91 Power spectrum of periodic impulse train using previous windows zeroes noise component develops MAVEBA'2001

92 Envelope calculation cepstrum original and smoothed spectrum upper and lower envelope lifter MAVEBA'2001

93 Multi resolution analysis: example estimated excitation speech waveform MAVEBA'2001

94 Non-periodic component Distance between lower and upper power spectrum envelope and calibration based on simulation [Kawahara et.al. 2001] Residuals of pitch scale linear prediction of frequency sub-bands and sigmoidal spectral modeling [Kawahara et.al. 2010a] Interference-free group delay representation for estimating deviation from periodicity [Kawahara et.al. 2014] Event extraction based on running kurtosis [Kawahara et.al. 2010b] the least successful component YANGsaf may be the answer 94

95

96 Broad-band colored noise turbulence: random boundary frequency slope converted pulse to noise ratio square root of pulse to noise ratio

97 Parameter estimation logit conversion weighted least square solution weight update

98 Fitting example

99 Non-periodic component Distance between lower and upper power spectrum envelope and calibration based on simulation [Kawahara et.al. 2001] Residuals of pitch scale linear prediction of frequency sub-bands and sigmoidal spectral modeling [Kawahara et.al. 2010a] Interference-free group delay representation for estimating deviation from periodicity [Kawahara et.al. 2014] Event extraction based on running kurtosis [Kawahara et.al. 2010b] the least successful component 99

100

101 Event detection Examples original natural speech synthesis without events synthesis with event detection

102 Event detection Examples original natural speech synthesis without events synthesis with event detection

103 Event detection Examples original natural speech synthesis without events synthesis with event detection

104 Outline Introduction: TANDEM-STRAIGHT Non-periodic component in speech sounds Wide-band noise Acoustic events Discussion

105 Outline Introduction: TANDEM-STRAIGHT Non-periodic component in speech sounds Wide-band noise Acoustic events Conclusion

106 Acoustic event detection strongly distorted distribution 4th moment 2nd moment kurtosis (non-negative) r =2, 4 implementation as filtering

107 Acoustic event detection local peak of running kurtosis closest centroid of 4th power of the windowed wave peak picking (initial estimate) practical solution event location adjustment theoretical solution filtered signal

108 Event detection example

109 Peak kurtosis distribution for 112 utterances highly non-gaussian

110 Non-periodic component Distance between lower and upper power spectrum envelope and calibration based on simulation [Kawahara et.al. 2001] Residuals of pitch scale linear prediction of frequency sub-bands and sigmoidal spectral modeling [Kawahara et.al. 2010a] Interference-free group delay representation for estimating deviation from periodicity [Kawahara et.al. 2014] Event extraction based on running kurtosis [Kawahara et.al. 2010b] the least successful component 110

111 Other cause of interference Phase spectrogram and group delay DVD 111

112 Movie

113 113 Movie

114 Movie

115 group delay Flanagan-like equation di [log(x)] τ g = dω [ ] 1 dx = I X dω = R[X]I [ dx dω ] I[X]R [ dx dω ] X 2, 115

116 Flanagan-like equation X(ω,t)= X d (ω,t)= dx(ω,t) dω = = j w(τ)x(τ t)e jωτ dτ w(τ)x(τ t) de jωτ dω dτ τw(τ)x(τ t)e jωτ dτ, τ g (ω,t)= R[X(ω,t)]I[X d(ω,t)] I[X(ω,t)]R[X d (ω,t)] X(ω,t) 2, 116

117 Interference model x(t) =δ ( t T 0 2 ) + αδ ( t + T 0 2 ) time window covers temporally repeating events with amplitude modification 117

118 ( ) ( ) Interference in power spectrum P (ω,t)= ( w t T 0 ) 2 2 ( +2αw t T α ) ( w t + T 0 ( w t + T ) ) 2 cos (2π ff0 ) periodic variation on the frequency axis 118

119 ( ) ( ) ( Interference in power spectrum P (ω,t)= ( w t T 0 ) 2 2 ( +2αw t T α ) ( w t + T 0 ( w t + T ) ) 2 cos (2π ff0 ) TANDEM trick cancels the interference P F (ω,t)= P ( ω ω 0 4,t ) + P ( ω + ω 0 4,t ) 2 119

120 Power weighted average τ ga (ω,t)= τ g1(ω,t) S 1 (ω,t) 2 + τ g2 (ω,t) S 2 (ω,t) 2 S 1 (ω,t) 2 + S 2 (ω,t) 2 P F (ω,t) Using T0/2 separation makes this sum interference-free 120

121 Numerators R[S(ω,t)]I[S d (ω,t)] = w d (t 1 )w(t 1 ) cos 2 ( ωt 1 ) α 2 w d (t 2 )w(t 2 ) cos 2 ( ωt 2 ) αw d (t 2 )w(t 1 ) cos( ωt 1 ) cos( ωt 2 ) αw d (t 1 )w(t 2 ) cos( ωt 1 ) cos( ωt 2 ) I[S(ω,t)]R[S d (ω,t)] = w d (t 1 )w(t 1 ) sin 2 ( ωt 1 ) + α 2 w d (t 2 )w(t 2 ) sin 2 ( ωt 2 ) + αw d (t 2 )w(t 1 ) sin( ωt 1 ) sin( ωt 2 ) + αw d (t 1 )w(t 2 ) sin( ωt 1 ) sin( ωt 2 ), 121

122 d d α 2 w d (t 2 )w(t 2 ) sin 2 ( ωt 2 ) Numerators: simplification + αw d (t 2 )w(t 1 ) sin( ωt 1 ) sin( ωt 2 ) + αw d (t 1 )w(t 2 ) sin( ωt 1 ) sin( ωt 2 ), sin 2 θ + cos 2 θ =1 sin cos A cos 2 θ + B cos + sin 2 θ =1 cos A sin B = cos(a A cos B B) + sin A sin B cos(a B) R[S(ω,t)]I[S d (ω,t)] I[S(ω,t)]R[S d (ω,t)] = (27) w d (t 1 )w(t 1 ) α 2 w d (t 2 )w(t 2 ) α (w d (t 1 )w(t 2 )+w d (t 2 )w(t 1 )) cos (2π ff0 ), (28) periodic component 122 P F (ω,t)

123 Interference-free group τ df (ω,t)= 1 P F (ω,t) delay [R[S(ω 1,t)]I[S d (ω 1,t)] I[S(ω 1,t)]R[S d (ω 1,t)] + R[S(ω 2,t)]I[S d (ω 2,t)] I[S(ω 2,t)]R[S d (ω 2,t)]] R I I R ω 1 = ω ω 0 4, ω 2 = ω + ω 0 where 4 Interference on the frequency axis is removed but 123

124 Interference-free group delay on the frequency axis 124

125 Interference-free group delay both in the time and frequency 125

126 waveform and time windows modulation window phase spectrogram power spectrum group delay Movie

127 Movie

128 Movie

129 Movie

130 Synthesis Minimum phase impulse response Pitch synchronized overlap and add of periodic and aperiodic components Mixed mode excitation and approximate time varying filter 130

131 STRAIGHT is a VOCODER analysis physical attributes synthesis spectral envelope analysis spectral envelope filter output signal input signal-1 F0 analysis F0 modification periodic pulse generator shaper and mixer signal parameter non-periodicity analysis nonperiodicity non-periodic component generator data process 131

132 Synthesis: OLA analysis physical attributes synthesis spectral envelope analysis spectral envelope filter-1 + output signal input signal-1 F0 analysis F0 modification periodic pulse generator filter-2 signal parameter non-periodicity analysis nonperiodicity noise generator data process 132

133 Synthesis: approximate TVF analysis physical attributes synthesis spectral envelope analysis spectral envelope filter output signal input signal-1 F0 analysis F0 modification periodic pulse generator shaper and mixer signal parameter non-periodicity analysis nonperiodicity non-periodic component generator data process 133

134 x[n] = FFT based convolution k x[k]δ 0,n k ( y[n] = M k=0 x[k]h[n k] cyclic version of signal with length N N 1 ( ) 2πjkn x[n] = X[k] exp N k=0 DFT coefficient N 1 ỹ[n] = h[n k] x[k] Y [k] =H[k]X[k] k=0 cyclic convolution 134

135 original signal x[n] = FFT based convolution k= s (k) [n] = k= x (k) 1 [n]w[n kl] constraint on w[n kl] =1 subdivision window k= subdivided signal implementation w 1 [n] =1,n=0, 1,...,K 1 ( ) w 2nπ 2 [n] = cos 2L K =2L +1 FFT length limit ( ) 135

136 Time varying filter in STRAIGHT x[n] = M k=0 x[k]h[n k; k] minimum phase impulse response h min (t) = 1 2π H min (ω)e jωt dt h min (t) =R [ 1 2π ] H min (ω)e jωt dt R[ln(H min (ω))] + ji[ln(h min (ω))] = c(q) = 1 2π ln(p (ω))e jωq dω c min (q)e jωq dq c min (q) = c(q) (q>0) c(0)/2 (q = 0) 0 (q <0) 136

137 Numerical examples 137

138 STRAIGHT spectrogram: section 138

139 Minimum phase responses 139

140 Time invariant and variant responses: spectral views rectangular subdivision raised cosine subdivision time invariant filter time varying filter 140

141 STRAIGHT is a VOCODER enabling flexible manipulation analysis physical attributes synthesis spectral envelope analysis spectral envelope filter output signal input signal-1 F0 analysis F0 modification periodic pulse generator shaper and mixer signal parameter non-periodicity analysis nonperiodicity non-periodic component generator data process 141

142 STRAIGHT: summary STRAIGHT decomposes input speech into Interference-free spectrum Fundamental frequency: F0 Aperiodic component Virtually perfect removal of interferences Flexible manipulation of speech without introducing quality degradations STRAIGHT component procedures provide building blocks for various applications such as TTS systems STRAIGHT serves as a test-bed for new component algorithms 142

143 Topic Application STRAIGHT Background 143

144 Lecture 2 Hands on tutorial of generalized speech morphing based on STRAIGHT Hideki Kawahara Emeritus Professor: Wakayama University, Japan Tianjin University, China, 9 December, 2016

145 Topic Application STRAIGHT Background 145

146 Application Speech modification using STRAIGHT Modification using GUIs Modification by function calls Extended morphing for two voices Temporally variable multi-aspect arbitrary many voices morphing Formulation Implementation Morphing using GUIs Morphing by scripting Morphing as a research tool 146

147 Application Speech modification using STRAIGHT Modification using GUIs Modification by function calls Extended morphing for two voices Temporally variable multi-aspect arbitrary many voices morphing Formulation Implementation Morphing using GUIs Morphing by scripting Morphing as a research tool 147

148 STRAIGHT is a VOCODER enabling flexible manipulation analysis physical attributes synthesis spectral envelope analysis spectral envelope filter output signal input signal-1 F0 analysis F0 modification periodic pulse generator shaper and mixer signal parameter non-periodicity analysis nonperiodicity non-periodic component generator data process 148

149 STRAIGHT GUIs Matlab APSIPA DL talk 149

150 Snapshot: F0 extraction Matlab customizable F0 extractor 150

151 Snapshot: modification Matlab duration size F0 amplitude 151

152 Application Speech modification using STRAIGHT Modification using GUIs Modification by function calls Extended morphing for two voices Temporally variable multi-aspect arbitrary many voices morphing Formulation Implementation Morphing using GUIs Morphing by scripting Morphing as a research tool 152

153 Manipulation by function calls Analysis functions Source analysis Fundamental frequency Aperiodic component Filter analysis Synthesis function Default OLA synthesis Optional: approximate time varying filter Optional: sinusoidal synthesis 153

154 Manipulation by function calls Analysis functions Source analysis Fundamental frequency Aperiodic component Filter analysis Synthesis function Default OLA synthesis Optional: approximate time varying filter Optional: sinusoidal synthesis 154

155 Fundamental frequency r = exf0candidateststraightgb(x,fs,paramsin) samplingfrequency: f0: [159x1 double] periodicitylevel: [159x1 double].. post processing rc = autof0tracking(r,x); rc.vuv = refinevoicingdecision(x,rc); 155

156 Aperiodicity parameter q = aperiodicityratiosigmoid(x,rc,sidemargin,exponent,displayon) samplingfrequency: f0: [159x1 double] vuv: [159x1 double] sigmoidparameter: [2x159 double]

157 Manipulation by function calls Analysis functions Source analysis Fundamental frequency Aperiodic component Filter analysis Synthesis function Default OLA synthesis Optional: approximate time varying filter Optional: sinusoidal synthesis 157

158 Filter analysis exspectrumtstraightgb(x,fs,sourceobj,paramsin) ElapsedTimeForSpectrum: temporalpositions: [1x159 double] spectrogramstraight: [1025x159 double] samplingfrequency: TANDEMSTRAIGHTconditions: [1x1 struct] spectrogramtandem: [1025x159 double] dateofspectrumestimation: 'DD-MM :00:58' 158

159 Manipulation by function calls Analysis functions Source analysis Fundamental frequency Aperiodic component Filter analysis Synthesis function Default OLA synthesis Optional: approximate time varying filter Optional: sinusoidal synthesis 159

160 Default OLA synthesis exgeneralstraightsynthesisr2(sourcestructure,filterstructure) synthesisout: [17106x1 double] samplingfrequency: elapsedtime: generalized framework generalstraightsynthesisframeworkr2(feedinghandle, responsehandle, deterministichandle, randomhandle, shifterhandle, datasubstrate,optionalparameters) 160

161 Application Speech modification using STRAIGHT Modification using GUIs Modification by function calls Extended morphing for two voices Temporally variable multi-aspect arbitrary many voices morphing Formulation Implementation Morphing using GUIs Morphing by scripting Morphing as a research tool 161

162 Modification by function calls Fundamental frequency manipulation (example) making fundamental frequency 1.2 times higher rm = r; rm.f0 = r.f0*1.2; s = exgeneralstraightsynthesisr2(rm,f); making fundamental frequency 50 Hz higher rm = r; rm.f0 = r.f0+50; s = exgeneralstraightsynthesisr2(rm,f); 162

163 Modification by function calls Speaking rate manipulation (example) making total duration 2 times longer rm = r; rm.temporalpositions = r.temporalpositions*2; s = exgeneralstraightsynthesisr2(rm,f); 163

164 Modification by function calls Vocal tract length manipulation (example) making vocal tract length 1.2 times longer fftl = (size(f.spectrogramstraight,1)-1)*2; fxoriginal = (0:fftl/2)/fftl*f.samplingFrequency; fxtarget = fxoriginal*1.2; fxtarget = min(f.samplingfrequency/2, fxtarget); fm = f; fm.f.spectrogramstraight = interp1(fxoriginal,f.spectrogramstraight,fxtarget); s = exgeneralstraightsynthesisr2(r,fm); 164

165 Modification by function calls Vocal tract length manipulation (example) making vocal tract length 0.8 times of the original fftl = (size(f.spectrogramstraight,1)-1)*2; fxoriginal = (0:fftl/2)/fftl*f.samplingFrequency; fxtarget = fxoriginal*0.8; fm = f; fm.f.spectrogramstraight = interp1(fxoriginal,f.spectrogramstraight,fxtarget); s = exgeneralstraightsynthesisr2(r,fm); nonlinear frequency axis modification is possible by designing fxtarget 165

166 166

167 167

168 Application Speech modification using STRAIGHT Modification using GUIs Modification by function calls Extended morphing for two voices Temporally variable multi-aspect arbitrary many voices morphing Formulation Implementation Morphing using GUIs Morphing by scripting Morphing as a research tool 168

169 Matlab

170 170

171 Application Speech modification using STRAIGHT Modification using GUIs Modification by function calls Extended morphing for two voices Temporally variable multi-aspect arbitrary many voices morphing Formulation Implementation Morphing using GUIs Morphing by scripting Morphing as a research tool 171

172 voices Temporally variable multi-aspect N-way morphing attribute 172

173 Temporally variable multi-aspect N-way morphing analysis physical attributes synthesis spectral envelope analysis spectral envelope filter output signal F0 analysis F0 periodic pulse generator shaper and mixer input signal-1 non-periodicity analysis nonperiodicity morphing non-periodic component generator time axis alignment time axis mapping time axis alignment input signal-k frequency axis alignment frequency axis mapping frequency axis alignment signal analysis parameter physical attributes data input signal-n analysis a set of indexed weights of physical attributes process 173

174 STRAIGHT analysis physical attributes synthesis spectral envelope analysis spectral envelope filter output signal F0 analysis F0 periodic pulse generator shaper and mixer input signal-1 non-periodicity analysis nonperiodicity morphing non-periodic component generator time axis alignment time axis mapping time axis alignment input signal-k frequency axis alignment frequency axis mapping frequency axis alignment signal analysis parameter physical attributes data input signal-n analysis a set of indexed weights of physical attributes process 174

175 Temporally variable multi-aspect N-way morphing analysis physical attributes synthesis spectral envelope analysis spectral envelope filter output signal F0 analysis F0 periodic pulse generator shaper and mixer input signal-1 non-periodicity analysis nonperiodicity morphing non-periodic component generator time axis alignment time axis mapping time axis alignment input signal-k frequency axis alignment frequency axis mapping frequency axis alignment signal analysis parameter physical attributes data input signal-n analysis a set of indexed weights of physical attributes process 175

176 Application Speech modification using STRAIGHT Modification using GUIs Modification by function calls Extended morphing for two voices Temporally variable multi-aspect arbitrary many voices morphing Formulation Implementation Morphing using GUIs Morphing by scripting Morphing as a research tool 176

177 Generalized morphing enabling extrapolation location, speed... no constraint F0, power... positivity time axis, frequency axis... monotonicity w.sum(function) exponent(w.sum(log(function))) integration(exponent(w.sum(log(function )))) derivative of function 177

178 What is the problem? interpolation 178

179 What is the problem? interpolation Break down extrapolation Non-monotonic mapping 179

180 Speech parameter constraints ( 1 ) time increases monotonically ( 2 ) frequency increases monotonically ( 3 ) time-frequency spectral representation is positive ( 4 ) fundamental frequency is positive abstract time Θ ( ) { ) ) Θ (k) (ν, τ) = f (k) 0 ( t (k) (τ), a (k) (t (k) (τ) P (f (k) (k) (ν),t (k) (τ),f (k) (ν),t (k) (τ), (1) morphing entity 180 abstract frequency ), }

181 Speech parameter constraints ( 1 ) time increases monotonically ( 2 ) frequency increases monotonically ( 3 ) time-frequency spectral representation is positive ( 4 ) fundamental frequency is positive abstract time Θ ( ) { ) ) Θ (k) (ν, τ) = f (k) 0 ( t (k) (τ), a (k) (t (k) (τ) P (f (k) (k) (ν),t (k) (τ),f (k) (ν),t (k) (τ), (1) morphing entity 181 abstract frequency ), }

182 Speech parameter constraints ( 1 ) time increases monotonically ( 2 ) frequency increases monotonically ( 3 ) time-frequency spectral representation is positive ( 4 ) fundamental frequency is positive abstract time Θ ( ) { ) ) Θ (k) (ν, τ) = f (k) 0 ( t (k) (τ), a (k) (t (k) (τ) P (f (k) (k) (ν),t (k) (τ),f (k) (ν),t (k) (τ), (1) morphing entity 182 abstract frequency ), }

183 Speech parameter constraints ( 1 ) time increases monotonically ( 2 ) frequency increases monotonically ( 3 ) time-frequency spectral representation is positive ( 4 ) fundamental frequency is positive abstract time Θ ( ) { ) ) Θ (k) (ν, τ) = f (k) 0 ( t (k) (τ), a (k) (t (k) (τ) P (f (k) (k) (ν),t (k) (τ),f (k) (ν),t (k) (τ), (1) morphing entity 183 abstract frequency ), }

184 Speech parameter constraints ( 1 ) time increases monotonically ( 2 ) frequency increases monotonically ( 3 ) time-frequency spectral representation is positive ( 4 ) fundamental frequency is positive abstract time Θ ( ) { ) ) Θ (k) (ν, τ) = f (k) 0 ( t (k) (τ), a (k) (t (k) (τ) P (f (k) (k) (ν),t (k) (τ),f (k) (ν),t (k) (τ), (1) morphing entity 184 abstract frequency ), }

185 Generalized morphing enabling extrapolation location, speed... no constraint F0, power... positivity time axis, frequency axis... monotonicity w.sum(function) exponent(w.sum(log(function))) integration(exponent(w.sum(log(function )))) derivative of function 185

186 No constraint case morphed parameter: function number of cases weight N g m1 (t m3 (τ)) = w (k) (t (k) (τ))g (k) (t (k) (τ)), (2) k=1 speech parameter index of case N w (k) (t (k) (τ)) = 1. k=1 not always necessary 186

187 Generalized morphing enabling extrapolation location, speed... no constraint F0, power... positivity time axis, frequency axis... monotonicity w.sum(function) exponent(w.sum(log(function))) integration(exponent(w.sum(log(function )))) derivative of function 187

188 positivity constraint ( N g m2 (t m3 (τ)) = exp w (k) (t (k) (τ)) log ( g (k) (t (k) (τ)) )) k=1 ( k=1 ( N ( = g (k) (t (k) (τ)) ) w (k) (t (k) (τ)), (4) g m2 (t m3 (τ)) > 0 188

189 Generalized morphing enabling extrapolation location, speed... no constraint F0, power... positivity time axis, frequency axis... monotonicity w.sum(function) exponent(w.sum(log(function))) integration(exponent(w.sum(log(function )))) derivative of function 189

190 monotonicity constraint morphed attribute: function number of cases weight ( ( τ N ( ) ) dg g m3 (τ) = exp w (k) (k) (ξ) (ξ) log dξ 0 dξ k=1 index of case τ N ( ( ) dg (k) w (ξ) (k) (ξ) = dξ, (5) dξ 0 k=1 speech attribute abstract parameter dg m3 (τ) > 0 dτ 190

191 Generalized morphing ( ( ) ) morphing entity ( examplar ( Θ m (ν, τ)=t Θ (1) (ν, τ), Θ (2) (ν, τ),...,θ (K) (ν, τ); W ), (6) W ={w F0 (τ), w A (τ), w P (τ), w Fx (τ), w Tx } (τ)}, (7) w X (τ) =[w (1) X (τ),w(2) X (τ),...,w(k) X (τ)]t } X {F 0, A, P, F x,t x } F0 aperiodicity time-frequency rep. frequency c. time c. 191

192 Application Speech modification using STRAIGHT Modification using GUIs Modification by function calls Extended morphing for two voices Temporally variable multi-aspect arbitrary many voices morphing Formulation Implementation Morphing using GUIs Morphing by scripting Morphing as a research tool 192

193 ( Implementation: ) piece-wise linear function ( ) time axis of an example ID of the example ( ) t (k) (τ) =(p (k) (τ n+1 ) p (k) (τ n ))(τ τ n )+p (k) (τ n ). (8) morphed time axis value at an anchor anchor location ID of the anchor t m3 (τ) =(p m (τ n+1 ) p m (τ n ))(τ τ n )+p m (τ n ), (11) p m (τ n )= K ( p (k) (τ n ) p (k) (τ n 1 ) ) w (k) Tx (τ n) k=1 value at morphed location + p m (τ n 1 ), (12) 193

194 Matlab implementation of function inversion yi = interp1(x,y,xi, linear, extrap ); xi = interp1(y,x,yi, linear, extrap ); 194

195 Temporally variable multi-aspect N-way morphing voices attribute 195

196 Movie

197 Application Speech modification using STRAIGHT Modification using GUIs Modification by function calls Extended morphing for two voices Temporally variable multi-aspect arbitrary many voices morphing Formulation Implementation Morphing using GUIs Morphing by scripting Morphing as a research tool 197

198 GUI for generalized morphing preparation Matlab

199 Matlab November, 2013, APSIPA, Taiwan

200 Application Speech modification using STRAIGHT Modification using GUIs Modification by function calls Extended morphing for two voices Temporally variable multi-aspect arbitrary many voices morphing Formulation Implementation Morphing using GUIs Morphing by scripting Morphing as a research tool 200

201 Morphing by scripting Matlab function for temporally variable multi-aspects arbitrary many voices morphing morphedobject = tvariablenwaymorphingraw(objectbundle,contributionstructure,dispon); synthstructure = generatemorphedsound(morphedobject); objectbundless = STRAIGHTobject: {1x8 cell} contributionstructure = timeaxis: [47x8 double] fundamentalfrequency: [47x8 double] frequencyaxis: [47x8 double] aperiodicity: [47x8 double] spectrum: [47x8 double] 201

202 Morphing by scripting Matlab function for temporally variable multi-aspects arbitrary many voices morphing morphedobject = tvariablenwaymorphingraw(objectbundle,contributionstructure,dispon); synthstructure = generatemorphedsound(morphedobject); morphedobject = morphedtimeanchors: [49x1 double] timemorphedframe: [522x8 double] morphedtargetf0: morphedf0: [1x522 double] f0listonmorphedtime: [522x8 double] frequencymappingatanchor: [1x1 struct] frameonmorphing: [522x1 double] morphedvuv: [1x522 double] contributionstructure: [1x1 struct] morphedspectrogram: [2049x522 double] morphedaperiodicity: [2x522 double] elapsedtime: cutofflistfix: [5x1 double] samplingfrequency: procedurename: 'tvariablenwaymorphing' tmpobj: [1x1 struct] 202

203 Morphing by scripting Matlab function for temporally variable multi-aspects arbitrary many voices morphing morphedobject = tvariablenwaymorphingraw(objectbundle,contributionstructure,dispon); synthstructure = generatemorphedsound(morphedobject); contributionstructure = timeaxis: [47x8 double] fundamentalfrequency: [47x8 double] frequencyaxis: [47x8 double] aperiodicity: [47x8 double] spectrum: [47x8 double] flexible manipulation can be implemented by assigning relevant weights for contributionstructure 203

204 Application Speech modification using STRAIGHT Modification using GUIs Modification by function calls Extended morphing for two voices Temporally variable multi-aspect arbitrary many voices morphing Formulation Implementation Morphing using GUIs Morphing by scripting Morphing as a research tool 204

205

206 Topic Application STRAIGHT Background 206

207 Topic Application STRAIGHT Background 207

208 Summary Application STRAIGHT Background Interference-free representations play important roles Periodic excitation is an efficient and robust strategy for sampling and transmitting relevant information for communications using voice STRAIGHT is a collection of functions and applications Extended morphing provides a unique research strategy useful for para- and non-linguistic aspects of speech 208

209 Thank you! Roy D. Patterson Masanori Morise Hideki Banno Toshio Irino Ryuichi Nisimura Verena G. Skuk Stefan Schweinberger Parham Zolfaghari Ken-Ichi Sakakibara Ikuyo Masuda-Katsuse Alain de Cheveigne Josh McDermott Osamu Fujimura Toru Takahashi Tomoki Toda and many others. 209

210 References 210

211 Reference: STRAIGHT Kawahara, H., Morise, M., Toda, T., Banno, H., Nisimura, R., & Irino, T. (2014). Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase response compensation. In Fifteenth Annual Conference of the International Speech Communication Association. Kawahara, H., Morise, M., & Sakakibara, K. I. (2013d). Temporally fine F0 extractor applied for frequency modulation power spectral analysis of singing voices. Proc. MAVEBA, Kawahara, H., Morise, M., Banno, H., & Skuk, V. G. (2013c). Temporally variable multi-aspect N-way morphing based on interference-free speech representations. In Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific (pp. 1-10). IEEE. Kawahara, H., Morise, M., Toda, T., Nisimura, R., & Irino, T. (2013b). Beyond bandlimited sampling of speech spectral envelope imposed by the harmonic structure of voiced sounds. In INTERSPEECH (pp ). Kawahara, H., Morise, M., Nisimura, R., & Irino, T. (2013a). Higher order waveform symmetry measure and its application to periodicity detectors for speech and singing with fine temporal resolution. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on (pp ). IEEE. Kawahara, H., Morise, M., Nisimura, R., & Irino, T. (2012b). Deviation measure of waveform symmetry and its application to high-speed and temporally-fine F0 extraction for vocal sound texture manipulation. In Interspeech. Kawahara, H., & Morise, M. (2012a). Analysis and synthesis of strong vocal expressions: extension and application of audio texture features to singing voice. In Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on (pp ). IEEE. Kawahara, H., & Morise, M. (2011b). Technical foundations of TANDEM-STRAIGHT, a speech analysis, modification and synthesis framework. Sadhana, 36(5), Kawahara, H., Irino, T., & Morise, M. (2011a). An interference-free representation of instantaneous frequency of periodic signals and its application to F0 extraction. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on (pp ). IEEE. 211

212 Reference: STRAIGHT Kawahara, H., Morise, M., Takahashi, T., Banno, H., Nisimura, R. & Irino, T. (2010b). Kurtosis-based acoustic event detection and its application to speech analysis, modification and synthesis systems, Spring Annual Meeting of the Acoustical Society of Japan, [in Japanese] Kawahara, H., Morise, M., Takahashi, T., Banno, H., Nisimura, R., & Irino, T. (2010a). Simplification and extension of non-periodic excitation source representations for high-quality speech manipulation systems. In Interspeech 2010, Fujimura, O., Honda, K., Kawahara, H., Konparu, Y., Morise, M., & Williams, J. C. (2009). Noh voice quality. Logopedics Phoniatrics Vocology, 34(4), Kawahara, H., Takahashi, T., Morise, M., & Banno, H. (2009b). Development of exploratory research tools based on TANDEM-STRAIGHT. In Proceedings: APSIPA ASC 2009: Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference (pp ). Asia-Pacific Signal and Information Processing Association, 2009 Annual Summit and Conference, International Organizing Committee. Kawahara, H., Nisimura, R., Irino, T., Morise, M., Takahashi, T., & Banno, H. (2009a). Temporally variable multiaspect auditory morphing enabling extrapolation without objective and perceptual breakdown. In Acoustics, Speech and Signal Processing, ICASSP IEEE International Conference on (pp ). IEEE. Kawahara, H., Morise, M., Takahashi, T., Nisimura, R., Irino, T., & Banno, H. (2008, March). TANDEM-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity estimation. In Acoustics, Speech and Signal Processing, ICASSP IEEE International Conference on (pp ). IEEE. Banno, H., Hata, H., Morise, M., Takahashi, T., Irino, T., & Kawahara, H. (2007). Implementation of realtime STRAIGHT speech manipulation system: Report on its first implementation. Acoustical science and technology, 28(3), Kawahara, H. (2006). STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds. Acoustical science and technology, 27(6),

213 Reference: STRAIGHT Kawahara, H., de Cheveigné, A., Banno, H., Takahashi, T., & Irino, T. (2005, September). Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT. In Interspeech (pp ). Matsui, H., & Kawahara, H. (2003b). Investigation of emotionally morphed speech perception and its structure using a high quality speech manipulation system. In INTERSPEECH. Kawahara, H., & Matsui, H. (2003a). Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation. In Acoustics, Speech, and Signal Processing, Proceedings.(ICASSP'03) IEEE International Conference on (Vol. 1, pp. I-256). IEEE. Kawahara, H., Estill, J., & Fujimura, O. (2001, September). Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT. In MAVEBA (pp ). Kawahara, H., Atake, Y., & Zolfaghari, P. (2000). Accurate vocal event detection method based on a fixed-point analysis of mapping from time to weighted average group delay. In INTERSPEECH (pp ). Kawahara, H., Katayose, H., de Cheveigné, A., & Patterson, R. D. (1999b). Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity. In EuroSpeech (Vol. 99, No. 6, pp ). Kawahara, H., Masuda-Katsuse, I., & de Cheveigné, A. (1999a). Restructuring speech representations using a pitch-adaptive time frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds. Speech communication, 27(3), Kawahara, H. (1997). Speech representation and transformation using adaptive interpolation of weighted spectrum: vocoder revisited. In Acoustics, Speech, and Signal Processing, ICASSP-97., 1997 IEEE International Conference on (Vol. 2, pp ). IEEE. 213

214 Reference: using STRAIGHT Assmann, P. F., & T. M. Nearey (2008). Identification of frequency-shifted vowels. The Journal of the Acoustical Society of America, 124(5), Athanasios, T., Zañartu, M., Little, M.A., Fox, C., Ramig, L.O., & Clifford, G.D. (2014). Robust fundamental frequency estimation in sustained vowels: Detailed algorithmic comparisons and information fusion with adaptive Kalman filtering, The Journal of the Acoustical Society of America, 135(5), Bruckert, L., Bestelmeyer, P., Latinus, M., Rouger, J., Charest, I., Rousselet, G. A.,... & Belin, P. (2010). Vocal attractiveness increases by averaging. Current Biology, 20(2), d' Alessandro, C., Rilliard, A., & Le Beux, S. (2011). Chironomic stylization of intonationa). The Journal of the Acoustical Society of America, 129(3), Humes, L. E., Kewley-Port, D., Fogerty, D., & Kinney, D. (2010). Measures of hearing threshold and temporal processing across the adult lifespan. Hearing research, 264(1), Ives, D. T., Smith, D. R., & Patterson, R. D. (2005). Discrimination of speaker size from syllable phrasesa). The Journal of the Acoustical Society of America, 118(6), Kawahara, H., Kitamura, T., Takemoto, H., Nisimura, R., & Irino, T. (2014). Vocal tract length estimation based on vowels using a database consisting of 385 speakers and a database with MRI-based vocal tract shape information. In Fifteenth Annual Conference of the International Speech Communication Association. Kawahara, H., Mizobuchi, S., Morise, M., Nisimura, R., & Irino, T. (2014). Realtime conversion of growl-type voice qualities based on modulation and approximate time-varying filtering driven by a non-linear oscillator: Formulation. IPSJ SIG Technidal report, 2014-MUS-102(14), 1-6. Liu, C., & Kewley-Port, D. (2004). Vowel formant discrimination for high-fidelity speech. The Journal of the Acoustical Society of America, 116(2), Nguyen, P. C., Takao, O., & Akagi, M. (2003). Modified restricted temporal decomposition and its application to low rate speech coding. IEICE TRANSACTIONS on Information and Systems, 86(3),

215 Reference: using STRAIGHT Saitou, T., Goto, M., Unoki, M., & Akagi, M. (2007, October). Speech-to-singing synthesis: Converting speaking voices to singing voices by controlling acoustic features unique to singing voices. In Applications of Signal Processing to Audio and Acoustics, 2007 IEEE Workshop on (pp ). IEEE. chweinberger, S. R., Walther, C., Zäske, R., & Kovács, G. (2011). Neural correlates of adaptation to voice identity. British Journal of psychology, 102(4), Schweinberger, S. R., Casper, C., Hauthal, N., Kaufmann, J. M., Kawahara, H., Kloth, N.,... & Zäske, R. (2008). Auditory adaptation in voice perception. Current Biology, 18(9), Skuk, V. G., & Schweinberger, S. R. (2014). Influences of Fundamental Frequency, Formant Frequencies, Aperiodicity, and Spectrum Level on the Perception of Voice Gender. Journal of Speech, Language, and Hearing Research, 57(1), Skuk, V. G., & Schweinberger, S. R. (2013). Adaptation aftereffects in vocal emotion perception elicited by expressive faces and voices. PloS one, 8(11), e Smith, D. R., Patterson, R. D., Turner, R., Kawahara, H., & Irino, T. (2005). The processing and perception of size information in speech sounds. The Journal of the Acoustical Society of America, 117(1), 305. Toda, T., & Tokuda, K. (2007). A speech parameter generation algorithm considering global variance for HMMbased speech synthesis. IEICE TRANSACTIONS on Information and Systems, 90(5), Toda, T., Saruwatari, H., & Shikano, K. (2001). Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum. In Acoustics, Speech, and Signal Processing, Proceedings.(ICASSP'01) IEEE International Conference on (Vol. 2, pp ). IEEE. Tsanas, A., Zañartu, M., Little, M.A., Fox, C., Ramig, L.O., & Clifford, G.D. (2014). Robust fundamental frequency estimation in sustained vowels: detailed algorithmic comparisons and information fusion with adaptive Kalman filtering, The Journal of the Acoustical Society of America, XXX(X), XXX. von Kriegstein, K., Smith, D. R., Patterson, R. D., Kiebel, S. J., & Griffiths, T. D. (2010). How the human brain recognizes speech in the context of changing speakers. The Journal of Neuroscience, 30(2),

216 Reference: using STRAIGHT von Kriegstein, K., Smith, D. R., Patterson, R. D., Ives, D. T., & Griffiths, T. D. (2007). Neural representation of auditory size in the human voice and in sounds from other resonant sources. Current Biology, 17(13), von Kriegstein, K., Warren, J. D., Ives, D. T., Patterson, R. D., & Griffiths, T. D. (2006). Processing the acoustic effect of size in speech sounds. Neuroimage, 32(1), Yonezawa, T., Suzuki, N., Abe, S., Mase, K., & Kogure, K. (2007). Perceptual continuity and naturalness of expressive strength in singing voices based on speech morphing. EURASIP Journal on Audio, Speech, and Music Processing, 2007(3), 2. Yu, K., & Young, S. (2011). Continuous F0 modeling for HMM based statistical parametric speech synthesis. Audio, Speech, and Language Processing, IEEE Transactions on, 19(5), Zäske, R., Schweinberger, S. R., & Kawahara, H. (2010). Voice aftereffects of adaptation to speaker identity. Hearing research, 268(1), Zäske, R., Schweinberger, S. R., Kaufmann, J. M., & Kawahara, H. (2009). In the ear of the beholder: neural correlates of adaptation to voice gender. European Journal of Neuroscience, 30(3), Zen, H., Nose, T., Yamagishi, J., Sako, S., Masuko, T., Black, A. W., & Tokuda, K. (2007a). The HMM-based speech synthesis system (HTS) version 2.0. In SSW (pp ). Zen, H., Toda, T., Nakamura, M., & Tokuda, K. (2007b). Details of the Nitech HMM-based speech synthesis system for the Blizzard Challenge IEICE transactions on information and systems, 90(1), Zen, H., Tokuda, K., & Black, A. W. (2009). Statistical parametric speech synthesis. Speech Communication, 51(11),

27 5) STRAIGHT ) STRAIGHT 8) 3 STRAIGHT ),6),2) 7) 7),9) 5) 2. 2. STRAIGHT 5),7) 2.. spline 2..2 6) ms 2..3 4) STRAIGHT (db) ERB N(Effective Rectangul

27 5) STRAIGHT ) STRAIGHT 8) 3 STRAIGHT ),6),2) 7) 7),9) 5) 2. 2. STRAIGHT 5),7) 2.. spline 2..2 6) ms 2..3 4) STRAIGHT (db) ERB N(Effective Rectangul 2 2 4 3 STRAIGHT 3 5 2 Perceptual study on design reuse of voice identity and singing style based on singing voice morphing HIDEKI KAWAHARA, TAICHI IKOMA, MASANORI MORISE, TORU TAKAHASHI, KEN ICHI TOYODA

More information

YANGsaf [] 3. πn, (n Z) Z [16 18] 3.1 Flanagan [19] A.1 TANDEM-STRAIGHT [1] 1/ [0] A. TANDEM-STRAIGHT [] 3. [3,6] F0 [14] F0 [10] [10] 3.3 [] Vol.017-

YANGsaf [] 3. πn, (n Z) Z [16 18] 3.1 Flanagan [19] A.1 TANDEM-STRAIGHT [1] 1/ [0] A. TANDEM-STRAIGHT [] 3. [3,6] F0 [14] F0 [10] [10] 3.3 [] Vol.017- Vol.017-MUS-114 No.6 017//7 1,a),b) 3,c) 4,d) YANGsaf [1] (1) () (3) YANGsaf [, 3] FFT bin STRAIGHT TANDEM Revisiting aperiodicity estimation based on instantaneous frequency and group delay Kawahara Hideki

More information

7) 8) 9),10) 11) 18) 11),16) 18) 19) 20) Vocaloid 6) Vocaloid 1 VocaListener1 2 VocaListener1 3 VocaListener VocaListener1 VocaListener1 Voca

7) 8) 9),10) 11) 18) 11),16) 18) 19) 20) Vocaloid 6) Vocaloid 1 VocaListener1 2 VocaListener1 3 VocaListener VocaListener1 VocaListener1 Voca VocaListener2: 1 1 VocaListener2 VocaListener VocaListener2 VocaListener2 VocaListener VocaListener2 VocaListener2: A Singing Synthesis System Mimicking Voice Timbre Changes in Addition to Pitch and Dynamics

More information

11 22 33 12 23 1 2 3, 1 2, U2 3 U 1 U b 1 (o t ) b 2 (o t ) b 3 (o t ), 3 b (o t ) MULTI-SPEAKER SPEECH DATABASE Training Speech Analysis Mel-Cepstrum, logf0 /context1/ /context2/... Context Dependent

More information

IPSJ SIG Technical Report Vol.2014-MUS-104 No /8/27 F0 1,a) 1,b) 1,c) 2,d) (F0) F0 F0 Graphical User Interface (GUI) F0 1. [1] CD MIDI [2] [3,

IPSJ SIG Technical Report Vol.2014-MUS-104 No /8/27 F0 1,a) 1,b) 1,c) 2,d) (F0) F0 F0 Graphical User Interface (GUI) F0 1. [1] CD MIDI [2] [3, F,a),b),c) 2,d) (F) F F Graphical User Interface (GUI) F. [] CD MIDI [2] [3, 4] [5] 2 a) ikemiya@kuis.kyoto-u.ac.jp b) itoyama@kuis.kyoto-u.ac.jp c) yoshii@kuis.kyoto-u.ac.jp d) okuno@aoni.waseda.jp TANDEM-STRAIGHT

More information

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels).

Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels). Fig. 1 The scheme of glottal area as a function of time Fig. 3 Flow diagram of image processing. Black rectangle in the photo indicates the processing area (128 x 32 pixels). Fig, 4 Parametric representation

More information

The Japanese Journal of Psychology 2004, Vol. 75, No. 5, 397-406 Effects of fundamental frequency and speech rate on impression formation Teruhisa Uchida and Naoko Nakaune (Research Division, The National

More information

IPSJ SIG Technical Report Vol.2017-MUS-115 No /6/17 1,a) 1 1 WORLD F0 Vocaloid F0 ipad 1. Vocaloid [1] UTAU *1 Vocaloid Vocaloid F0 VocaListene

IPSJ SIG Technical Report Vol.2017-MUS-115 No /6/17 1,a) 1 1 WORLD F0 Vocaloid F0 ipad 1. Vocaloid [1] UTAU *1 Vocaloid Vocaloid F0 VocaListene 1,a) 1 1 WORLD F0 Vocaloid F0 ipad 1. Vocaloid [1] UTAU *1 Vocaloid Vocaloid F0 VocaListener [2], [3] Vocaloid *2 VocaListener Vocaloid 1 University of Yamanashi a) g16tk018@yamanashi.ac.jp *1 http://utau2008.web.fc2.com/

More information

Isogai, T., Building a dynamic correlation network for fat-tailed financial asset returns, Applied Network Science (7):-24, 206,

Isogai, T., Building a dynamic correlation network for fat-tailed financial asset returns, Applied Network Science (7):-24, 206, H28. (TMU) 206 8 29 / 34 2 3 4 5 6 Isogai, T., Building a dynamic correlation network for fat-tailed financial asset returns, Applied Network Science (7):-24, 206, http://link.springer.com/article/0.007/s409-06-0008-x

More information

IPSJ SIG Technical Report Vol.2014-MUS-103 No /5/25 GUI 1,a) 1,b) 1,c) 1,d).., GUI.,FIR, 3 3.,.GUI. GUI,. A GUI for manipulating growl-like tas

IPSJ SIG Technical Report Vol.2014-MUS-103 No /5/25 GUI 1,a) 1,b) 1,c) 1,d).., GUI.,FIR, 3 3.,.GUI. GUI,. A GUI for manipulating growl-like tas GUI 1,a) 1,b) 1,c) 1,d).., GUI.,FIR, 3 3.,.GUI. GUI,. A GUI for manipulating growl-like taste in singing voice Mizobuchi Shohei 1,a) Nisimura Ryuichi 1,b) Irino Toshio 1,c) Kawahara Hideki 1,d) Abstract:

More information

H(ω) = ( G H (ω)g(ω) ) 1 G H (ω) (6) 2 H 11 (ω) H 1N (ω) H(ω)= (2) H M1 (ω) H MN (ω) [ X(ω)= X 1 (ω) X 2 (ω) X N (ω) ] T (3)

H(ω) = ( G H (ω)g(ω) ) 1 G H (ω) (6) 2 H 11 (ω) H 1N (ω) H(ω)= (2) H M1 (ω) H MN (ω) [ X(ω)= X 1 (ω) X 2 (ω) X N (ω) ] T (3) 72 12 2016 pp. 777 782 777 * 43.60.Pt; 43.38.Md; 43.60.Sx 1. 1 2 [1 8] Flexible acoustic interface based on 3D sound reproduction. Yosuke Tatekura (Shizuoka University, Hamamatsu, 432 8561) 2. 2.1 3 M

More information

2013 M

2013 M 2013 M0110453 2013 : M0110453 20 1 1 1.1............................ 1 1.2.............................. 4 2 5 2.1................................. 6 2.2................................. 8 2.3.................................

More information

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L

Input image Initialize variables Loop for period of oscillation Update height map Make shade image Change property of image Output image Change time L 1,a) 1,b) 1/f β Generation Method of Animation from Pictures with Natural Flicker Abstract: Some methods to create animation automatically from one picture have been proposed. There is a method that gives

More information

10_08.dvi

10_08.dvi 476 67 10 2011 pp. 476 481 * 43.72.+q 1. MOS Mean Opinion Score ITU-T P.835 [1] [2] [3] Subjective and objective quality evaluation of noisereduced speech. Takeshi Yamada, Shoji Makino and Nobuhiko Kitawaki

More information

28 Horizontal angle correction using straight line detection in an equirectangular image

28 Horizontal angle correction using straight line detection in an equirectangular image 28 Horizontal angle correction using straight line detection in an equirectangular image 1170283 2017 3 1 2 i Abstract Horizontal angle correction using straight line detection in an equirectangular image

More information

1 - - - - - - - - - - 2 - - - - Google voice search - - Google translation - - Siri, - - 3 - - - - - 4 5 6 7 - - - - - 29-1 29-81 ( ) - 81 CD - 56.7 (CD =700 ) - - Sub-word 8 - sub-word - - /a/ /k/ /i/

More information

200708_LesHouches_02.ppt

200708_LesHouches_02.ppt Numerical Methods for Geodynamo Simulation Akira Kageyama Earth Simulator Center, JAMSTEC, Japan Part 2 Geodynamo Simulations in a Sphere or a Spherical Shell Outline 1. Various numerical methods used

More information

1. HNS [1] HNS HNS HNS [2] HNS [3] [4] [5] HNS 16ch SNR [6] 1 16ch 1 3 SNR [4] [5] 2. 2 HNS API HNS CS27-HNS [1] (SOA) [7] API Web 2

1. HNS [1] HNS HNS HNS [2] HNS [3] [4] [5] HNS 16ch SNR [6] 1 16ch 1 3 SNR [4] [5] 2. 2 HNS API HNS CS27-HNS [1] (SOA) [7] API Web 2 THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. 657 8531 1 1 E-mail: {soda,matsubara}@ws.cs.kobe-u.ac.jp, {masa-n,shinsuke,shin,yosimoto}@cs.kobe-u.ac.jp,

More information

IPSJ SIG Technical Report Vol.2012-MUS-94 No.3 Vol.2012-SLP-90 No /2/ DTM 200 GUIN-Resonator: A system synthesizing voice with the styl

IPSJ SIG Technical Report Vol.2012-MUS-94 No.3 Vol.2012-SLP-90 No /2/ DTM 200 GUIN-Resonator: A system synthesizing voice with the styl 1 1 2 1 DTM 200 GUIN-Resonator: A system synthesizing voice with the style of Amami folk songs Daisuke Suguru, 1 Takashi Baba, 1 Masanori Morise 2 and Haruhiro Katayose 1 The recent spread of Karaoke and

More information

IPSJ SIG Technical Report Vol.2012-MUS-95 No /6/2 1,a) 2,b) 1,c) 1,d) TANDEM-STRAIGHT 70 Hz 20 db Manipulation of temporal fine structures on ex

IPSJ SIG Technical Report Vol.2012-MUS-95 No /6/2 1,a) 2,b) 1,c) 1,d) TANDEM-STRAIGHT 70 Hz 20 db Manipulation of temporal fine structures on ex 1,a) 2,b) 1,c) 1,d) TANDEM-STRAIGHT 70 Hz 20 db Manipulation of temporal fine structures on excitation source and spectral envelope of singing voices and their effects on perceived impression Kawahara

More information

2007-Kanai-paper.dvi

2007-Kanai-paper.dvi 19 Estimation of Sound Source Zone using The Arrival Time Interval 1080351 2008 3 7 S/N 2 2 2 i Abstract Estimation of Sound Source Zone using The Arrival Time Interval Koichiro Kanai The microphone array

More information

Studies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth

Studies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth Studies of Foot Form for Footwear Design (Part 9) : Characteristics of the Foot Form of Young and Elder Women Based on their Sizes of Ball Joint Girth and Foot Breadth Akiko Yamamoto Fukuoka Women's University,

More information

IPSJ SIG Technical Report Vol.2015-MUS-107 No /5/23 HARK-Binaural Raspberry Pi 2 1,a) ( ) HARK 2 HARK-Binaural A/D Raspberry Pi 2 1.

IPSJ SIG Technical Report Vol.2015-MUS-107 No /5/23 HARK-Binaural Raspberry Pi 2 1,a) ( ) HARK 2 HARK-Binaural A/D Raspberry Pi 2 1. HARK-Binaural Raspberry Pi 2 1,a) 1 1 1 2 3 () HARK 2 HARK-Binaural A/D Raspberry Pi 2 1. [1,2] [2 5] () HARK (Honda Research Institute Japan audition for robots with Kyoto University) *1 GUI ( 1) Python

More information

pp d 2 * Hz Hz 3 10 db Wind-induced noise, Noise reduction, Microphone array, Beamforming 1

pp d 2 * Hz Hz 3 10 db Wind-induced noise, Noise reduction, Microphone array, Beamforming 1 72 12 2016 pp. 739 748 739 43.60.+d 2 * 1 2 2 3 2 125 Hz 0.3 0.8 2 125 Hz 3 10 db Wind-induced noise, Noise reduction, Microphone array, Beamforming 1. 1.1 PSS [1] [2 4] 2 Wind-induced noise reduction

More information

<4D F736F F D B B83578B6594BB2D834A836F815B82D082C88C60202E646F63>

<4D F736F F D B B83578B6594BB2D834A836F815B82D082C88C60202E646F63> 通信方式第 2 版 サンプルページ この本の定価 判型などは, 以下の URL からご覧いただけます. http://www.morikita.co.jp/books/mid/072662 このサンプルページの内容は, 第 2 版発行当時のものです. i 2 2 2 2012 5 ii,.,,,,,,.,.,,,,,.,,.,,..,,,,.,,.,.,,.,,.. 1990 5 iii 1 1

More information

Vol. 48 No. 3 Mar Evaluation of Music-noise Assimilation Playback for Portable Audio Players Akifumi Inoue, Shohei Bise, Satoshi Ichimura and

Vol. 48 No. 3 Mar Evaluation of Music-noise Assimilation Playback for Portable Audio Players Akifumi Inoue, Shohei Bise, Satoshi Ichimura and Vol. 48 No. 3 Mar. 2007 1 Evaluation of Music-noise Assimilation Playback for Portable Audio Players Akifumi Inoue, Shohei Bise, Satoshi Ichimura and Yutaka Matsushita Though the population of portable

More information

14 2 5

14 2 5 14 2 5 i ii Surface Reconstruction from Point Cloud of Human Body in Arbitrary Postures Isao MORO Abstract We propose a method for surface reconstruction from point cloud of human body in arbitrary postures.

More information

TADM-STRAIGHT [7], [8] 3 (1) (2) (3) [9] 0.9% [10] [11] 2. [12] [13] glottal formant [14], [15] 3 [16] [11] (dcgcfb) [10] X 284 ( ) P

TADM-STRAIGHT [7], [8] 3 (1) (2) (3) [9] 0.9% [10] [11] 2. [12] [13] glottal formant [14], [15] 3 [16] [11] (dcgcfb) [10] X 284 ( ) P Vol.2013-MUS-99 o.47 1,a) 1,b) 1,c) 1,d) 6 56 284 Voice tells your body information Kobayashi Mayuko 1,a) isimura Ryuichi 1,b) Irino Toshio 1,c) Kawahara Hideki 1,d) Abstract: When we hear a voice, we

More information

(MIRU2008) HOG Histograms of Oriented Gradients (HOG)

(MIRU2008) HOG Histograms of Oriented Gradients (HOG) (MIRU2008) 2008 7 HOG - - E-mail: katsu0920@me.cs.scitec.kobe-u.ac.jp, {takigu,ariki}@kobe-u.ac.jp Histograms of Oriented Gradients (HOG) HOG Shape Contexts HOG 5.5 Histograms of Oriented Gradients D Human

More information

LMC6022 Low Power CMOS Dual Operational Amplifier (jp)

LMC6022 Low Power CMOS Dual Operational Amplifier (jp) Low Power CMOS Dual Operational Amplifier Literature Number: JAJS754 CMOS CMOS (100k 5k ) 0.5mW CMOS CMOS LMC6024 100k 5k 120dB 2.5 V/ 40fA Low Power CMOS Dual Operational Amplifier 19910530 33020 23900

More information

SICE東北支部研究集会資料(2012年)

SICE東北支部研究集会資料(2012年) 77 (..3) 77- A study on disturbance compensation control of a wheeled inverted pendulum robot during arm manipulation using Extended State Observer Luis Canete Takuma Sato, Kenta Nagano,Luis Canete,Takayuki

More information

23_02.dvi

23_02.dvi Vol. 2 No. 2 10 21 (Mar. 2009) 1 1 1 Effect of Overconfidencial Investor to Stock Market Behaviour Ryota Inaishi, 1 Fei Zhai 1 and Eisuke Kita 1 Recently, the behavioral finance theory has been interested

More information

第 55 回自動制御連合講演会 2012 年 11 月 17 日,18 日京都大学 1K403 ( ) Interpolation for the Gas Source Detection using the Parameter Estimation in a Sensor Network S. T

第 55 回自動制御連合講演会 2012 年 11 月 17 日,18 日京都大学 1K403 ( ) Interpolation for the Gas Source Detection using the Parameter Estimation in a Sensor Network S. T 第 55 回自動制御連合講演会 212 年 11 月 日, 日京都大学 1K43 () Interpolation for the Gas Source Detection using the Parameter Estimation in a Sensor Network S. Tokumoto, T. Namerikawa (Keio Univ. ) Abstract The purpose of

More information

013858,繊維学会誌ファイバー1月/報文-02-古金谷

013858,繊維学会誌ファイバー1月/報文-02-古金谷 Development of Non-Contact Measuring Method for Final Twist Number of Double Ply Staple Yarn Keizo Koganeya 1, Youichi Yukishita 1, Hirotaka Fujisaki 1, Yasunori Jintoku 2, Hironori Okuno 2, and Motoharu

More information

IPSJ SIG Technical Report Vol.2015-MUS-106 No.25 Vol.2015-EC-35 No /3/3 1,a) 1,b) 1,c) 1,d),,, Improving voice attractiveness by speech paramet

IPSJ SIG Technical Report Vol.2015-MUS-106 No.25 Vol.2015-EC-35 No /3/3 1,a) 1,b) 1,c) 1,d),,, Improving voice attractiveness by speech paramet ,a),b),c),d),,, Improving voice attractiveness by speech parameter modification for interactive voice training applications Yoshimoto Shoki,a) Ryuichi Nisimura,b) Toshio Irino,c) Hideki Kawahara,d) Abstract:

More information

1 1 tf-idf tf-idf i

1 1 tf-idf tf-idf i 14 A Method of Article Retrieval Utilizing Characteristics in Newspaper Articles 1055104 2003 1 31 1 1 tf-idf tf-idf i Abstract A Method of Article Retrieval Utilizing Characteristics in Newspaper Articles

More information

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came

IPSJ SIG Technical Report Vol.2012-CG-148 No /8/29 3DCG 1,a) On rigid body animation taking into account the 3D computer graphics came 3DCG 1,a) 2 2 2 2 3 On rigid body animation taking into account the 3D computer graphics camera viewpoint Abstract: In using computer graphics for making games or motion pictures, physics simulation is

More information

IPSJ SIG Technical Report Vol.2014-CG-155 No /6/28 1,a) 1,2,3 1 3,4 CG An Interpolation Method of Different Flow Fields using Polar Inter

IPSJ SIG Technical Report Vol.2014-CG-155 No /6/28 1,a) 1,2,3 1 3,4 CG An Interpolation Method of Different Flow Fields using Polar Inter ,a),2,3 3,4 CG 2 2 2 An Interpolation Method of Different Flow Fields using Polar Interpolation Syuhei Sato,a) Yoshinori Dobashi,2,3 Tsuyoshi Yamamoto Tomoyuki Nishita 3,4 Abstract: Recently, realistic

More information

EQUIVALENT TRANSFORMATION TECHNIQUE FOR ISLANDING DETECTION METHODS OF SYNCHRONOUS GENERATOR -REACTIVE POWER PERTURBATION METHODS USING AVR OR SVC- Ju

EQUIVALENT TRANSFORMATION TECHNIQUE FOR ISLANDING DETECTION METHODS OF SYNCHRONOUS GENERATOR -REACTIVE POWER PERTURBATION METHODS USING AVR OR SVC- Ju EQUIVALENT TRANSFORMATION TECHNIQUE FOR ISLANDING DETECTION METHODS OF SYNCHRONOUS GENERATOR -REACTIVE POWER PERTURBATION METHODS USING AVR OR SVC- Jun Motohashi, Member, Takashi Ichinose, Member (Tokyo

More information

磁性物理学 - 遷移金属化合物磁性のスピンゆらぎ理論

磁性物理学 - 遷移金属化合物磁性のスピンゆらぎ理論 email: takahash@sci.u-hyogo.ac.jp May 14, 2009 Outline 1. 2. 3. 4. 5. 6. 2 / 262 Today s Lecture: Mode-mode Coupling Theory 100 / 262 Part I Effects of Non-linear Mode-Mode Coupling Effects of Non-linear

More information

a) Extraction of Similarities and Differences in Human Behavior Using Singular Value Decomposition Kenichi MISHIMA, Sayaka KANATA, Hiroaki NAKANISHI a

a) Extraction of Similarities and Differences in Human Behavior Using Singular Value Decomposition Kenichi MISHIMA, Sayaka KANATA, Hiroaki NAKANISHI a a) Extraction of Similarities and Differences in Human Behavior Using Singular Value Decomposition Kenichi MISHIMA, Sayaka KANATA, Hiroaki NAKANISHI a), Tetsuo SAWARAGI, and Yukio HORIGUCHI 1. Johansson

More information

IPSJ SIG Technical Report 1, Instrument Separation in Reverberant Environments Using Crystal Microphone Arrays Nobutaka ITO, 1, 2 Yu KITANO, 1

IPSJ SIG Technical Report 1, Instrument Separation in Reverberant Environments Using Crystal Microphone Arrays Nobutaka ITO, 1, 2 Yu KITANO, 1 1, 2 1 1 1 Instrument Separation in Reverberant Environments Using Crystal Microphone Arrays Nobutaka ITO, 1, 2 Yu KITANO, 1 Nobutaka ONO 1 and Shigeki SAGAYAMA 1 This paper deals with instrument separation

More information

B HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int

B HNS 7)8) HNS ( ( ) 7)8) (SOA) HNS HNS 4) HNS ( ) ( ) 1 TV power, channel, volume power true( ON) false( OFF) boolean channel volume int SOA 1 1 1 1 (HNS) HNS SOA SOA 3 3 A Service-Oriented Platform for Feature Interaction Detection and Resolution in Home Network System Yuhei Yoshimura, 1 Takuya Inada Hiroshi Igaki 1, 1 and Masahide Nakamura

More information

0A_SeibutsuJyoho-RF.ppt

0A_SeibutsuJyoho-RF.ppt A ON-Center OFF-Center DeAngelis, Ohzawa, Freeman 1995 Nobel Prize 1981: Physiology and Medicine D.H. Hubel and T.N. Wiesel T.N. Wiesel D.H. Hubel V1/V2: (spikes) Display? Amplifiers and Filters V1 - simple

More information

Fig. 2 Signal plane divided into cell of DWT Fig. 1 Schematic diagram for the monitoring system

Fig. 2 Signal plane divided into cell of DWT Fig. 1 Schematic diagram for the monitoring system Study of Health Monitoring of Vehicle Structure by Using Feature Extraction based on Discrete Wavelet Transform Akihisa TABATA *4, Yoshio AOKI, Kazutaka ANDO and Masataka KATO Department of Precision Machinery

More information

IPSJ-SLP

IPSJ-SLP F0 MFCC 1 2 3 1 1 1 1 MFCCF0 1 86.7% 90.2% A System for Automatic Discrimination between Singing and Speaking Voices on the Basis of Peak Interval of Spectral Change, F0, and MFCC Shimpei Aso, 1 Takeshi

More information

untitled

untitled JAIS 1 2 1 2 In this paper, we focus on the pauses that partly characterize the utterances of simultaneous interpreters, and attempt to analyze the results of experiments conducted using human subjects

More information

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS 2 3 4 5 2. 2.1 3 1) GPS Global Positioning System

258 5) GPS 1 GPS 6) GPS DP 7) 8) 10) GPS GPS 2 3 4 5 2. 2.1 3 1) GPS Global Positioning System Vol. 52 No. 1 257 268 (Jan. 2011) 1 2, 1 1 measurement. In this paper, a dynamic road map making system is proposed. The proposition system uses probe-cars which has an in-vehicle camera and a GPS receiver.

More information

浜松医科大学紀要

浜松医科大学紀要 On the Statistical Bias Found in the Horse Racing Data (1) Akio NODA Mathematics Abstract: The purpose of the present paper is to report what type of statistical bias the author has found in the horse

More information

鉄鋼協会プレゼン

鉄鋼協会プレゼン NN :~:, 8 Nov., Adaptive H Control for Linear Slider with Friction Compensation positioning mechanism moving table stand manipulator Point to Point Control [G] Continuous Path Control ground Fig. Positoining

More information

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2

1 Fig. 1 Extraction of motion,.,,, 4,,, 3., 1, 2. 2.,. CHLAC,. 2.1,. (256 ).,., CHLAC. CHLAC, HLAC. 2.3 (HLAC ) r,.,. HLAC. N. 2 HLAC Fig. 2 CHLAC 1 2 3 3,. (CHLAC), 1).,.,, CHLAC,.,. Suspicious Behavior Detection based on CHLAC Method Hideaki Imanishi, 1 Toyohiro Hayashi, 2 Shuichi Enokida 3 and Toshiaki Ejima 3 We have proposed a method for

More information

IPSJ SIG Technical Report Vol.2013-GN-87 No /3/ Research of a surround-sound field adjustmen system based on loudspeakers arrangement Ak

IPSJ SIG Technical Report Vol.2013-GN-87 No /3/ Research of a surround-sound field adjustmen system based on loudspeakers arrangement Ak 1 1 3 Research of a surround-sound field adjustmen system based on loudspeakers arrangement Akiyama Daichi 1 Kanai Hideaki 1 Abstract: In this paper, we propose a presentation method that does not depend

More information

, (GPS: Global Positioning Systemg),.,, (LBS: Local Based Services).. GPS,.,. RFID LAN,.,.,.,,,.,..,.,.,,, i

, (GPS: Global Positioning Systemg),.,, (LBS: Local Based Services).. GPS,.,. RFID LAN,.,.,.,,,.,..,.,.,,, i 25 Estimation scheme of indoor positioning using difference of times which chirp signals arrive 114348 214 3 6 , (GPS: Global Positioning Systemg),.,, (LBS: Local Based Services).. GPS,.,. RFID LAN,.,.,.,,,.,..,.,.,,,

More information

IPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing

IPSJ SIG Technical Report Vol.2009-BIO-17 No /5/26 DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing DNA 1 1 DNA DNA DNA DNA Correcting read errors on DNA sequences determined by Pyrosequencing Youhei Namiki 1 and Yutaka Akiyama 1 Pyrosequencing, one of the DNA sequencing technologies, allows us to determine

More information

Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information

Vol.54 No (July 2013) [9] [10] [11] [12], [13] 1 Fig. 1 Flowchart of the proposed system. c 2013 Information Vol.54 No.7 1937 1950 (July 2013) 1,a) 2012 11 1, 2013 4 5 1 Similar Sounds Sentences Generator Based on Morphological Analysis Manner and Low Class Words Masaaki Kanakubo 1,a) Received: November 1, 2012,

More information

Key Words: probabilisic scenario earthquake, active fault data, Great Hanshin earthquake, low frequency-high impact earthquake motion, seismic hazard map 3) Cornell, C. A.: Engineering Seismic

More information

,,.,.,,.,.,.,.,,.,..,,,, i

,,.,.,,.,.,.,.,,.,..,,,, i 22 A person recognition using color information 1110372 2011 2 13 ,,.,.,,.,.,.,.,,.,..,,,, i Abstract A person recognition using color information Tatsumo HOJI Recently, for the purpose of collection of

More information

UWB a) Accuracy of Relative Distance Measurement with Ultra Wideband System Yuichiro SHIMIZU a) and Yukitoshi SANADA (Ultra Wideband; UWB) UWB GHz DLL

UWB a) Accuracy of Relative Distance Measurement with Ultra Wideband System Yuichiro SHIMIZU a) and Yukitoshi SANADA (Ultra Wideband; UWB) UWB GHz DLL UWB a) Accuracy of Relative Distance Measurement with Ultra Wideband System Yuichiro SHIMIZU a) and Yukitoshi SANADA (Ultra Wideband; UWB) UWB GHz DLL UWB (DLL) UWB DLL 1. UWB FCC (Federal Communications

More information

kubostat2015e p.2 how to specify Poisson regression model, a GLM GLM how to specify model, a GLM GLM logistic probability distribution Poisson distrib

kubostat2015e p.2 how to specify Poisson regression model, a GLM GLM how to specify model, a GLM GLM logistic probability distribution Poisson distrib kubostat2015e p.1 I 2015 (e) GLM kubo@ees.hokudai.ac.jp http://goo.gl/76c4i 2015 07 22 2015 07 21 16:26 kubostat2015e (http://goo.gl/76c4i) 2015 (e) 2015 07 22 1 / 42 1 N k 2 binomial distribution logit

More information

Hz

Hz ( ) 2006 1 3 3 3 4 10 Hz 1 1 1.1.................................... 1 1.2.................................... 1 2 2 2.1.................................... 2 2.2.................................... 3

More information

JFE.dvi

JFE.dvi ,, Department of Civil Engineering, Chuo University Kasuga 1-13-27, Bunkyo-ku, Tokyo 112 8551, JAPAN E-mail : atsu1005@kc.chuo-u.ac.jp E-mail : kawa@civil.chuo-u.ac.jp SATO KOGYO CO., LTD. 12-20, Nihonbashi-Honcho

More information

"CAS を利用した Single Sign On 環境の構築"

CAS を利用した Single Sign On 環境の構築 CAS 2 Single Sign On 1,3, 2,3, 2, 2,3 1 2 3 May 31, 2007 ITRC p. 1/29 Plan of Talk Brief survey of Single Sign On using CAS Brief survey of Authorization Environment using CAS 2 Summary May 31, 2007 ITRC

More information

Clustering in Time and Periodicity of Strong Earthquakes in Tokyo Masami OKADA Kobe Marine Observatory (Received on March 30, 1977) The clustering in time and periodicity of earthquake occurrence are investigated

More information

75 unit: mm Fig. Structure of model three-phase stacked transformer cores (a) Alternate-lap joint (b) Step-lap joint 3 4)

75 unit: mm Fig. Structure of model three-phase stacked transformer cores (a) Alternate-lap joint (b) Step-lap joint 3 4) 3 * 35 (3), 7 Analysis of Local Magnetic Properties and Acoustic Noise in Three-Phase Stacked Transformer Core Model Masayoshi Ishida Kenichi Sadahiro Seiji Okabe 3.7 T 5 Hz..4 3 Synopsis: Methods of local

More information

2 ( ) i

2 ( ) i 25 Study on Rating System in Multi-player Games with Imperfect Information 1165069 2014 2 28 2 ( ) i ii Abstract Study on Rating System in Multi-player Games with Imperfect Information Shigehiko MORITA

More information

XFEL/SPring-8

XFEL/SPring-8 DEVELOPMENT STATUS OF RF SYSTEM OF INJECTOR SECTION FOR XFEL/SPRING-8 Takao Asaka 1,A), Takahiro Inagaki B), Hiroyasu Ego A), Toshiaki Kobayashi A), Kazuaki Togawa B), Shinsuke Suzuki A), Yuji Otake B),

More information

kubostat2017c p (c) Poisson regression, a generalized linear model (GLM) : :

kubostat2017c p (c) Poisson regression, a generalized linear model (GLM) : : kubostat2017c p.1 2017 (c), a generalized linear model (GLM) : kubo@ees.hokudai.ac.jp http://goo.gl/76c4i 2017 11 14 : 2017 11 07 15:43 kubostat2017c (http://goo.gl/76c4i) 2017 (c) 2017 11 14 1 / 47 agenda

More information

Vol. 42 No MUC-6 6) 90% 2) MUC-6 MET-1 7),8) 7 90% 1 MUC IREX-NE 9) 10),11) 1) MUCMET 12) IREX-NE 13) ARPA 1987 MUC 1992 TREC IREX-N

Vol. 42 No MUC-6 6) 90% 2) MUC-6 MET-1 7),8) 7 90% 1 MUC IREX-NE 9) 10),11) 1) MUCMET 12) IREX-NE 13) ARPA 1987 MUC 1992 TREC IREX-N Vol. 42 No. 6 June 2001 IREX-NE F 83.86 A Japanese Named Entity Extraction System Based on Building a Large-scale and High-quality Dictionary and Pattern-matching Rules Yoshikazu Takemoto, Toshikazu Fukushima

More information

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and

IPSJ SIG Technical Report Vol.2012-MUS-96 No /8/10 MIDI Modeling Performance Indeterminacies for Polyphonic Midi Score Following and MIDI 1 2 3 2 1 Modeling Performance Indeterminacies for Polyphonic Midi Score Following and Its Application to Automatic Accompaniment Nakamura Eita 1 Yamamoto Ryuichi 2 Saito Yasuyuki 3 Sako Shinji 2

More information

Study on Application of the cos a Method to Neutron Stress Measurement Toshihiko SASAKI*3 and Yukio HIROSE Department of Materials Science and Enginee

Study on Application of the cos a Method to Neutron Stress Measurement Toshihiko SASAKI*3 and Yukio HIROSE Department of Materials Science and Enginee Study on Application of the cos a Method to Neutron Stress Measurement Toshihiko SASAKI*3 and Yukio HIROSE Department of Materials Science and Engineering, Kanazawa University, Kakuma-machi, Kanazawa-shi,

More information

Introduction Purpose This training course describes the configuration and session features of the High-performance Embedded Workshop (HEW), a key tool

Introduction Purpose This training course describes the configuration and session features of the High-performance Embedded Workshop (HEW), a key tool Introduction Purpose This training course describes the configuration and session features of the High-performance Embedded Workshop (HEW), a key tool for developing software for embedded systems that

More information

スライド 1

スライド 1 CMOS : swk(at)ic.is.tohoku.ac.jp [ 2003] [Wong1999] 2 : CCD CMOS 3 : CCD Q Q V 4 : CMOS V C 5 6 CMOS light input photon shot noise α quantum efficiency dark current dark current shot noise dt time integration

More information

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing

[2] OCR [3], [4] [5] [6] [4], [7] [8], [9] 1 [10] Fig. 1 Current arrangement and size of ruby. 2 Fig. 2 Typography combined with printing 1,a) 1,b) 1,c) 2012 11 8 2012 12 18, 2013 1 27 WEB Ruby Removal Filters Using Genetic Programming for Early-modern Japanese Printed Books Taeka Awazu 1,a) Masami Takata 1,b) Kazuki Joe 1,c) Received: November

More information

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe

IPSJ SIG Technical Report Vol.2009-DPS-141 No.20 Vol.2009-GN-73 No.20 Vol.2009-EIP-46 No /11/27 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Spe 1. MIERUKEN 1 2 MIERUKEN MIERUKEN MIERUKEN: Speech Visualization System Based on Augmented Reality Yuichiro Nagano 1 and Takashi Yoshino 2 As the spread of the Augmented Reality(AR) technology and service,

More information

Bull. of Nippon Sport Sci. Univ. 47 (1) Devising musical expression in teaching methods for elementary music An attempt at shared teaching

Bull. of Nippon Sport Sci. Univ. 47 (1) Devising musical expression in teaching methods for elementary music An attempt at shared teaching Bull. of Nippon Sport Sci. Univ. 47 (1) 45 70 2017 Devising musical expression in teaching methods for elementary music An attempt at shared teaching materials for singing and arrangements for piano accompaniment

More information

main.dvi

main.dvi FDTD S A Study on FDTD Analysis based on S-Parameter 18 2 7 04GD168 FDTD FDTD S S FDTD S S S S FDTD FDTD i 1 1 1.1 FDTD.................................... 1 1.2 FDTD..................... 3 2 S 5 2.1 FDTD

More information

(check matrices and minimum distances) H : a check matrix of C the minimum distance d = (the minimum # of column vectors of H which are linearly depen

(check matrices and minimum distances) H : a check matrix of C the minimum distance d = (the minimum # of column vectors of H which are linearly depen Hamming (Hamming codes) c 1 # of the lines in F q c through the origin n = qc 1 q 1 Choose a direction vector h i for each line. No two vectors are colinear. A linearly dependent system of h i s consists

More information

GSP_SITA2017_web.key

GSP_SITA2017_web.key ytnk@cc.tuat.ac.jp 25 DFT spectrum 2 15 1 5 1 2 3 Frequency index 4 5 25 15 1 DFT spectrum 2 5 1 2 3 Frequency index 4 5 .8 GFT spectrum.6 1.4.2 5 1 15 Graph frequency (eigenvalue) 1 GFT

More information

Computational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate catego

Computational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate catego Computational Semantics 1 category specificity Warrington (1975); Warrington & Shallice (1979, 1984) 2 basic level superiority 3 super-ordinate category preservation 1 / 13 analogy by vector space Figure

More information

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro

& Vol.5 No (Oct. 2015) TV 1,2,a) , Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Ro TV 1,2,a) 1 2 2015 1 26, 2015 5 21 Augmented TV TV AR Augmented Reality 3DCG TV Estimation of TV Screen Position and Rotation Using Mobile Device Hiroyuki Kawakita 1,2,a) Toshio Nakagawa 1 Makoto Sato

More information

impulse_response.dvi

impulse_response.dvi 5 Time Time Level Level Frequency Frequency Fig. 5.1: [1] 2004. [2] P. A. Nelson, S. J. Elliott, Active Noise Control, Academic Press, 1992. [3] M. R. Schroeder, Integrated-impulse method measuring sound

More information

メタルバンドソー

メタルバンドソー Metal Band Saw Blades Tornado Series selection TiCN HSS Co FAX FMX PM VL Selection Chart Selection Chart Solids Selection Teeth 3 note 1) Structurals, Tubing H section steels Light gauge steels Tube 4

More information

<95DB8C9288E397C389C88A E696E6462>

<95DB8C9288E397C389C88A E696E6462> 2011 Vol.60 No.2 p.138 147 Performance of the Japanese long-term care benefit: An International comparison based on OECD health data Mie MORIKAWA[1] Takako TSUTSUI[2] [1]National Institute of Public Health,

More information

ADC121S Bit, ksps, Diff Input, Micro Pwr Sampling ADC (jp)

ADC121S Bit, ksps, Diff Input, Micro Pwr Sampling ADC (jp) ADC121S625 ADC121S625 12-Bit, 50 ksps to 200 ksps, Differential Input, Micro Power Sampling A/D Converter Literature Number: JAJSAB8 ADC121S625 12 50kSPS 200kSPS A/D ADC121S625 50kSPS 200kSPS 12 A/D 500mV

More information

SEISMIC HAZARD ESTIMATION BASED ON ACTIVE FAULT DATA AND HISTORICAL EARTHQUAKE DATA By Hiroyuki KAMEDA and Toshihiko OKUMURA A method is presented for using historical earthquake data and active fault

More information

Fig. 3 Coordinate system and notation Fig. 1 The hydrodynamic force and wave measured system Fig. 2 Apparatus of model testing

Fig. 3 Coordinate system and notation Fig. 1 The hydrodynamic force and wave measured system Fig. 2 Apparatus of model testing The Hydrodynamic Force Acting on the Ship in a Following Sea (1 St Report) Summary by Yutaka Terao, Member Broaching phenomena are most likely to occur in a following sea to relative small and fast craft

More information

通信容量制約を考慮したフィードバック制御 - 電子情報通信学会 情報理論研究会(IT) 若手研究者のための講演会

通信容量制約を考慮したフィードバック制御 -  電子情報通信学会 情報理論研究会(IT)  若手研究者のための講演会 IT 1 2 1 2 27 11 24 15:20 16:05 ( ) 27 11 24 1 / 49 1 1940 Witsenhausen 2 3 ( ) 27 11 24 2 / 49 1940 2 gun director Warren Weaver, NDRC (National Defence Research Committee) Final report D-2 project #2,

More information

IPSJ SIG Technical Report Vol.2014-ARC-213 No.24 Vol.2014-HPC-147 No /12/10 GPU 1,a) 1,b) 1,c) 1,d) GPU GPU Structure Of Array Array Of

IPSJ SIG Technical Report Vol.2014-ARC-213 No.24 Vol.2014-HPC-147 No /12/10 GPU 1,a) 1,b) 1,c) 1,d) GPU GPU Structure Of Array Array Of GPU 1,a) 1,b) 1,c) 1,d) GPU 1 GPU Structure Of Array Array Of Structure 1. MPS(Moving Particle Semi-Implicit) [1] SPH(Smoothed Particle Hydrodynamics) [] DEM(Distinct Element Method)[] [] 1 Tokyo Institute

More information

kiyo5_1-masuzawa.indd

kiyo5_1-masuzawa.indd .pp. A Study on Wind Forecast using Self-Organizing Map FUJIMATSU Seiichiro, SUMI Yasuaki, UETA Takuya, KOBAYASHI Asuka, TSUKUTANI Takao, FUKUI Yutaka SOM SOM Elman SOM SOM Elman SOM Abstract : Now a small

More information

数値計算:フーリエ変換

数値計算:フーリエ変換 ( ) 1 / 72 1 8 2 3 4 ( ) 2 / 72 ( ) 3 / 72 ( ) 4 / 72 ( ) 5 / 72 sample.m Fs = 1000; T = 1/Fs; L = 1000; t = (0:L-1)*T; % Sampling frequency % Sample time % Length of signal % Time vector y=1+0.7*sin(2*pi*50*t)+sin(2*pi*120*t)+2*randn(size(t));

More information

OPA134/2134/4134('98.03)

OPA134/2134/4134('98.03) OPA OPA OPA OPA OPA OPA OPA OPA OPA TM µ Ω ± ± ± ± + OPA OPA OPA Offset Trim Offset Trim Out A V+ Out A Out D In +In V+ Output In A +In A A B Out B In B In A +In A A D In D +In D V NC V +In B V+ V +In

More information

Abstract This paper concerns with a method of dynamic image cognition. Our image cognition method has two distinguished features. One is that the imag

Abstract This paper concerns with a method of dynamic image cognition. Our image cognition method has two distinguished features. One is that the imag 2004 RGB A STUDY OF RGB COLOR INFORMATION AND ITS APPLICATION 03R3237 Abstract This paper concerns with a method of dynamic image cognition. Our image cognition method has two distinguished features. One

More information

HP cafe HP of A A B of C C Map on N th Floor coupon A cafe coupon B Poster A Poster A Poster B Poster B Case 1 Show HP of each company on a user scree

HP cafe HP of A A B of C C Map on N th Floor coupon A cafe coupon B Poster A Poster A Poster B Poster B Case 1 Show HP of each company on a user scree LAN 1 2 3 2 LAN WiFiTag WiFiTag LAN LAN 100% WiFi Tag An Improved Determination Method with Multiple Access Points for Relative Position Estimation Using Wireless LAN Abstract: We have proposed a WiFiTag

More information

2.R R R R Pan-Tompkins(PT) [8] R 2 SQRS[9] PT Q R WQRS[10] Quad Level Vector(QLV)[11] QRS R Continuous Wavelet Transform(CWT)[12] Mexican hat 4

2.R R R R Pan-Tompkins(PT) [8] R 2 SQRS[9] PT Q R WQRS[10] Quad Level Vector(QLV)[11] QRS R Continuous Wavelet Transform(CWT)[12] Mexican hat 4 G-002 R Database and R-Wave Detecting System for Utilizing ECG Data Takeshi Nagatomo Ikuko Shimizu Takeshi Ikeda Akio Sashima Koichi Kurumatani R R MIT-BIH R 90% 1. R R [1] 2 24 16 Tokyo University of

More information

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE.

THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. E-mail: {ytamura,takai,tkato,tm}@vision.kuee.kyoto-u.ac.jp Abstract Current Wave Pattern Analysis for Anomaly

More information

Tornado Series selection SW TiCN HSS Co FAX VL PM

Tornado Series selection SW TiCN HSS Co FAX VL PM Metal Band Saw Blades Tornado Series selection SW TiCN HSS Co FAX VL PM Selection Chart Selection Chart Solids 3 Selection Teeth 4 note 1) Structurals, Tubing H section steels Light gauge steels Tube Products

More information

udc-2.dvi

udc-2.dvi 13 0.5 2 0.5 2 1 15 2001 16 2009 12 18 14 No.39, 2010 8 2009b 2009a Web Web Q&A 2006 2007a20082009 2007b200720082009 20072008 2009 2009 15 1 2 2 2.1 18 21 1 4 2 3 1(a) 1(b) 1(c) 1(d) 1) 18 16 17 21 10

More information

AUTOMATIC MEASUREMENTS OF STREAM FLOW USING FLUVIAL ACOUSTIC TOMOGRAPHY SYSTEM Kiyosi KAWANISI, Arata, KANEKO Noriaki GOHDA and Shinya

AUTOMATIC MEASUREMENTS OF STREAM FLOW USING FLUVIAL ACOUSTIC TOMOGRAPHY SYSTEM Kiyosi KAWANISI, Arata, KANEKO Noriaki GOHDA and Shinya 2010 9 AUTOMATIC MEASUREMENTS OF STREAM FLOW USING FLUVIAL ACOUSTIC TOMOGRAPHY SYSTEM 1 2 3 4 Kiyosi KAWANISI, Arata, KANEKO Noriaki GOHDA and Shinya NIGO 1 739-8527 1-4-1 2 739-8527 1-4-1 3 723-0047 12-2

More information

kubostat7f p GLM! logistic regression as usual? N? GLM GLM doesn t work! GLM!! probabilit distribution binomial distribution : : β + β x i link functi

kubostat7f p GLM! logistic regression as usual? N? GLM GLM doesn t work! GLM!! probabilit distribution binomial distribution : : β + β x i link functi kubostat7f p statistaical models appeared in the class 7 (f) kubo@eeshokudaiacjp https://googl/z9cjy 7 : 7 : The development of linear models Hierarchical Baesian Model Be more flexible Generalized Linear

More information

11) 13) 11),12) 13) Y c Z c Image plane Y m iy O m Z m Marker coordinate system T, d X m f O c X c Camera coordinate system 1 Coordinates and problem

11) 13) 11),12) 13) Y c Z c Image plane Y m iy O m Z m Marker coordinate system T, d X m f O c X c Camera coordinate system 1 Coordinates and problem 1 1 1 Posture Esimation by Using 2-D Fourier Transform Yuya Ono, 1 Yoshio Iwai 1 and Hiroshi Ishiguro 1 Recently, research fields of augmented reality and robot navigation are actively investigated. Estimating

More information