SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2014, Pages 2494-2498

A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech

(5) Nakamura, Kazuhiro a Hashimoto, Kei a Oura, Keiichiro a Nankaku, Yoshihiko a Tokuda, Keiichi a

a NAGOYA INSTITUTE OF TECHNOLOGY (Japan)

Author keywords

HMM based speech synthesis; Integrative model; Mel cepstral analysis

Indexed keywords

FEATURE EXTRACTION; SAMPLING; SPEECH SYNTHESIS;

CEPSTRAL ANALYSIS; CEPSTRAL COEFFICIENTS; HIGH FREQUENCY COMPONENTS; HIGH SAMPLING RATES; HMM-BASED SPEECH SYNTHESIS; INTEGRATIVE MODELING; OBJECTIVE FUNCTIONS; STATISTICAL APPROACH;

SPEECH COMMUNICATION;

EID: 84910069658 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (19)

References (28)

1
- 0029725605
- Speech synthesis from HMMs using dynamic features
- T. Masuko, K. Tokuda, T. Kobayashi and, S. Imai, "Speech synthesis from HMMs using dynamic features, " Proceedings of ICASSP, pp. 389-392, 1996.
- (1996) Proceedings of ICASSP , pp. 389-392
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

2
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis, " Proceedings of Eurospeech, pp. 2347-2350, 1999.
- (1999) Proceedings of Eurospeech , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

3
- 0033708106
- Speech parameter generation algorithms for HMM-based speech synthesis
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " Proceedings of ICASSP, pp. 1315-1318, 2000.
- (2000) Proceedings of ICASSP , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

4
- 73649117102
- Joint acoustic and language modeling for speech recognition
- J. T. Chien and C. H. Chueh, "Joint acoustic and language modeling for speech recognition, " Speech Communication, vol. 52, Issue 3, pp. 223-235, 2010.
- (2010) Speech Communication , vol.52 , Issue.3 , pp. 223-235
- Chien, J.T.¹ Chueh, C.H.²

5
- 79959824887
- Improving speech synthesis of machine translation output
- A. Parlikar, A. Black, and S. Vogel, "Improving speech synthesis of machine translation output, " Proceedings of Interspeech, pp. 194-197, 2010.
- (2010) Proceedings of Interspeech , pp. 194-197
- Parlikar, A.¹ Black, A.² Vogel, S.³

6
- 84861092214
- Impacts of machine translation and speech synthesis on speechto- speech translation
- K. Hashimoto, J. Yamagishi, W. Byrne, S. King, and K. Tokuda, "Impacts of machine translation and speech synthesis on speechto- speech translation, " Speech Communication, vol. 54, Issue 7, pp. 854-866, 2012.
- (2012) Speech Communication , vol.54 , Issue.7 , pp. 854-866
- Hashimoto, K.¹ Yamagishi, J.² Byrne, W.³ King, S.⁴ Tokuda, K.⁵

7
- 0036663562
- Efficient integrated response generation from multiple target using weighted finite state transducers
- I. Bulyko and M. Ostendorf, "Efficient integrated response generation from multiple target using weighted finite state transducers, " Computer Speech and Language, vol. 16, pp. 533-550, 2002.
- (2002) Computer Speech and Language , vol.16 , pp. 533-550
- Bulyko, I.¹ Ostendorf, M.²

8
- 70450158623
- Reranking realizations by predicted synthesis quality
- C. Nakatsu and M. White, "Reranking realizations by predicted synthesis quality, " Proceedings of ACL, pp. 1113-1120, 2006.
- (2006) Proceedings of ACL , pp. 1113-1120
- Nakatsu, C.¹ White, M.²

9
- 70450163425
- Predicting how it sounds: Re-ranking dialogue prompts based on TTS quality for adaptive spoken dialogue systems
- C. Boidin, V. Rieser, L. Plas, O. Lemon, and J. Chevelu, "Predicting how it sounds: Re-ranking dialogue prompts based on TTS quality for adaptive spoken dialogue systems, " Proceedings of Interspeech, pp. 2487-2490, 2009.
- (2009) Proceedings of Interspeech , pp. 2487-2490
- Boidin, C.¹ Rieser, V.² Plas, L.³ Lemon, O.⁴ Chevelu, J.⁵

10
- 84890493635
- Integration of acoustic modeling and mel-cepstral analysis for HMMbased speech synthesis
- K. Nakamura, K. Hashimoto, Y. Nankaku, and K. Tokuda, "Integration of acoustic modeling and mel-cepstral analysis for HMMbased speech synthesis, " Proceedings of ICASSP, pp. 7883-7887, 2013.
- (2013) Proceedings of ICASSP , pp. 7883-7887
- Nakamura, K.¹ Hashimoto, K.² Nankaku, Y.³ Tokuda, K.⁴

11
- 85016140477
- An adaptive algorithm for mel-cepstral analysis of speech
- T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech, " Proceedings of ICASSP, vol. 1, pp. 137-140, 1992.
- (1992) Proceedings of ICASSP , vol.1 , pp. 137-140
- Fukada, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

12
- 85131821539
- Mel-generated cepstral analysis - A unified approach to speech spectral estimation
- K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Mel-generated cepstral analysis - A unified approach to speech spectral estimation, " Proceedings of ICSLP, pp. 1043-1045, 1994.
- (1994) Proceedings of ICSLP , pp. 1043-1045
- Tokuda, K.¹ Kobayashi, T.² Masuko, T.³ Imai, S.⁴

13
- 0003323711
- Unbiased estimator of log spectrum and its application to speech signal processing
- S. Imai and C. Furuichi, "Unbiased estimator of log spectrum and its application to speech signal processing, " Proceedings of EURASIP, pp. 203-206, 1988.
- (1988) Proceedings of EURASIP , pp. 203-206
- Imai, S.¹ Furuichi, C.²

14
- 0004108066
- New York: Springer- Verlag
- K. Dzhaparidze, "Parameter estimation and hypothesis testing in spectral analysis of stationary time series, " New York: Springer- Verlag, 1986.
- (1986) Parameter Estimation and Hypothesis Testing in Spectral Analysis of Stationary Time Series
- Dzhaparidze, K.¹

15
- 0009553788
- A statistical method for estimation of speech spectral density and formant frequencies
- (Japanese Edition), Jan. Translation: R.W. Schafer and J.D. Markel, eds. Speech Analysis, 295-302, IEEE Press, New York, 1979
- F. Itakura and S. Saito, "A statistical method for estimation of speech spectral density and formant frequencies, " IECE Transactions on Fundamentals (Japanese Edition), vol.J53-A, no.1, pp35- 42, Jan. 1970. Translation: R.W. Schafer and J.D. Markel, eds., Speech Analysis, pp.295-302, IEEE Press, New York, 1979.
- (1970) IECE Transactions on Fundamentals , vol.J53-A , Issue.1 , pp. 35-42
- Itakura, F.¹ Saito, S.²

16
- 0000306505
- Mel log spectral approximation filter for speech synthesis
- (Japanese Edition), Feb
- S. Imai, K. Sumita, and C. Furuichi, "Mel log spectral approximation filter for speech synthesis, " IECE Translations on Fundamentals (Japanese Edition), vol. J66-A, pp. 122-129, Feb. 1983.
- (1983) IECE Translations on Fundamentals , vol.J66-A , pp. 122-129
- Imai, S.¹ Sumita, K.² Furuichi, C.³

17
- 0002629270
- Maximumlikelihood from incomplete data via the EM algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximumlikelihood from incomplete data via the EM algorithm, " J. Royal Statist. Soc., Ser. B, 39, pp. 1-38, 1977.
- (1977) J. Royal Statist. Soc., Ser. B , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

18
- 84972512635
- Memoir on the probability of the causes of events
- P. S. Laplace, "Memoir on the probability of the causes of events, " Statistical Science, pp. 364-378, 1986.
- (1986) Statistical Science , pp. 364-378
- Laplace, P.S.¹

19
- 0004109478
- Rprop - Description and implementation details
- M. Riedmiller, "Rprop - Description and implementation details, " Technical Report, University of Karlsruhe, 1994.
- (1994) Technical Report, University of Karlsruhe
- Riedmiller, M.¹

20
- 0032029288
- Deterministic annealing EM algorithm
- Mar
- N. Ueda, R. Nakano, "Deterministic annealing EM algorithm, " Neural Networks, vol.11, pp.271-282, Mar. 1998.
- (1998) Neural Networks , vol.11 , pp. 271-282
- Ueda, N.¹ Nakano, R.²

21
- 0033692729
- Narrowband to wideband conversion of speech using GMM based transformation
- K.-H. Park and H. S. Kim, "Narrowband to wideband conversion of speech using GMM based transformation, " Proceedings of ICASSP, vol. 3, pp. 1843-1846, 2000.
- (2000) Proceedings of ICASSP , vol.3 , pp. 1843-1846
- Park, K.-H.¹ Kim, H.S.²

22
- 78149261566
- Bandwidth extension of cellular phone speech based on maximum likelihood estimation with GMM
- W. Fujitsuru, H. Sekimoto, T. Toda, H. Saruwatari, and K. Shikano, "Bandwidth extension of cellular phone speech based on maximum likelihood estimation with GMM, " Proceedings of NCSP, pp. 283-286, 2008.
- (2008) Proceedings of NCSP , pp. 283-286
- Fujitsuru, W.¹ Sekimoto, H.² Toda, T.³ Saruwatari, H.⁴ Shikano, K.⁵

23
- 0025475528
- ATR Japanese speech database as a tool of speech recognition and synthesis
- A. Kuramatsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kawabara, and K. Shikano, "ATR Japanese speech database as a tool of speech recognition and synthesis, " Speech Communication, vol. 9, pp. 357-363, 1990.
- (1990) Speech Communication , vol.9 , pp. 357-363
- Kuramatsu, A.¹ Takeda, K.² Sagisaka, Y.³ Katagiri, S.⁴ Kawabara, H.⁵ Shikano, K.⁶

24
- 84878381939
- "Speech Signal Processing Toolkit (SPTK), " http://sptk.sourceforge.net/.
- Speech Signal Processing Toolkit (SPTK)

25
- 0032678076
- Hidden Markov models based on multi-space probability distribution for pitch pattern modeling
- K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden Markov models based on multi-space probability distribution for pitch pattern modeling, " Proceedings of ICASSP, pp. 229-232, 1999.
- (1999) Proceedings of ICASSP , pp. 229-232
- Tokuda, K.¹ Masuko, T.² Miyazaki, N.³ Kobayashi, T.⁴

26
- 0025419316
- Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition
- K. F. Lee, "Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition, " IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 38, no. 4, pp. 599-609, 1990.
- (1990) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.38 , Issue.4 , pp. 599-609
- Lee, K.F.¹

27
- 0002144369
- Tree-based state tying for high accuracy acoustic modelling
- S. Young, J. J. Odell, and P.Woodland, "Tree-based state tying for high accuracy acoustic modelling, " Proceedings of ARPA Workshop on Human Language Technology, pp. 307-312, 1994.
- (1994) Proceedings of ARPA Workshop on Human Language Technology , pp. 307-312
- Young, S.¹ Odell, J.J.² Woodland, P.³

28
- 85135145174
- Acoustic modeling based on the MDL criterion for speech recognition
- K. Shinoda and T. Watanabe, "Acoustic modeling based on the MDL criterion for speech recognition, " Proceedings of Eurospeech, pp. 99-102, 1997.
- (1997) Proceedings of Eurospeech , pp. 99-102
- Shinoda, K.¹ Watanabe, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.