메뉴 건너뛰기




Volumn , Issue , 2014, Pages 2494-2498

A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech

Author keywords

HMM based speech synthesis; Integrative model; Mel cepstral analysis

Indexed keywords

FEATURE EXTRACTION; SAMPLING; SPEECH SYNTHESIS;

EID: 84910069658     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (19)

References (28)
  • 4
    • 73649117102 scopus 로고    scopus 로고
    • Joint acoustic and language modeling for speech recognition
    • J. T. Chien and C. H. Chueh, "Joint acoustic and language modeling for speech recognition, " Speech Communication, vol. 52, Issue 3, pp. 223-235, 2010.
    • (2010) Speech Communication , vol.52 , Issue.3 , pp. 223-235
    • Chien, J.T.1    Chueh, C.H.2
  • 5
    • 79959824887 scopus 로고    scopus 로고
    • Improving speech synthesis of machine translation output
    • A. Parlikar, A. Black, and S. Vogel, "Improving speech synthesis of machine translation output, " Proceedings of Interspeech, pp. 194-197, 2010.
    • (2010) Proceedings of Interspeech , pp. 194-197
    • Parlikar, A.1    Black, A.2    Vogel, S.3
  • 6
    • 84861092214 scopus 로고    scopus 로고
    • Impacts of machine translation and speech synthesis on speechto- speech translation
    • K. Hashimoto, J. Yamagishi, W. Byrne, S. King, and K. Tokuda, "Impacts of machine translation and speech synthesis on speechto- speech translation, " Speech Communication, vol. 54, Issue 7, pp. 854-866, 2012.
    • (2012) Speech Communication , vol.54 , Issue.7 , pp. 854-866
    • Hashimoto, K.1    Yamagishi, J.2    Byrne, W.3    King, S.4    Tokuda, K.5
  • 7
    • 0036663562 scopus 로고    scopus 로고
    • Efficient integrated response generation from multiple target using weighted finite state transducers
    • I. Bulyko and M. Ostendorf, "Efficient integrated response generation from multiple target using weighted finite state transducers, " Computer Speech and Language, vol. 16, pp. 533-550, 2002.
    • (2002) Computer Speech and Language , vol.16 , pp. 533-550
    • Bulyko, I.1    Ostendorf, M.2
  • 8
    • 70450158623 scopus 로고    scopus 로고
    • Reranking realizations by predicted synthesis quality
    • C. Nakatsu and M. White, "Reranking realizations by predicted synthesis quality, " Proceedings of ACL, pp. 1113-1120, 2006.
    • (2006) Proceedings of ACL , pp. 1113-1120
    • Nakatsu, C.1    White, M.2
  • 9
    • 70450163425 scopus 로고    scopus 로고
    • Predicting how it sounds: Re-ranking dialogue prompts based on TTS quality for adaptive spoken dialogue systems
    • C. Boidin, V. Rieser, L. Plas, O. Lemon, and J. Chevelu, "Predicting how it sounds: Re-ranking dialogue prompts based on TTS quality for adaptive spoken dialogue systems, " Proceedings of Interspeech, pp. 2487-2490, 2009.
    • (2009) Proceedings of Interspeech , pp. 2487-2490
    • Boidin, C.1    Rieser, V.2    Plas, L.3    Lemon, O.4    Chevelu, J.5
  • 10
    • 84890493635 scopus 로고    scopus 로고
    • Integration of acoustic modeling and mel-cepstral analysis for HMMbased speech synthesis
    • K. Nakamura, K. Hashimoto, Y. Nankaku, and K. Tokuda, "Integration of acoustic modeling and mel-cepstral analysis for HMMbased speech synthesis, " Proceedings of ICASSP, pp. 7883-7887, 2013.
    • (2013) Proceedings of ICASSP , pp. 7883-7887
    • Nakamura, K.1    Hashimoto, K.2    Nankaku, Y.3    Tokuda, K.4
  • 11
    • 85016140477 scopus 로고
    • An adaptive algorithm for mel-cepstral analysis of speech
    • T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech, " Proceedings of ICASSP, vol. 1, pp. 137-140, 1992.
    • (1992) Proceedings of ICASSP , vol.1 , pp. 137-140
    • Fukada, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 12
    • 85131821539 scopus 로고
    • Mel-generated cepstral analysis - A unified approach to speech spectral estimation
    • K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Mel-generated cepstral analysis - A unified approach to speech spectral estimation, " Proceedings of ICSLP, pp. 1043-1045, 1994.
    • (1994) Proceedings of ICSLP , pp. 1043-1045
    • Tokuda, K.1    Kobayashi, T.2    Masuko, T.3    Imai, S.4
  • 13
    • 0003323711 scopus 로고
    • Unbiased estimator of log spectrum and its application to speech signal processing
    • S. Imai and C. Furuichi, "Unbiased estimator of log spectrum and its application to speech signal processing, " Proceedings of EURASIP, pp. 203-206, 1988.
    • (1988) Proceedings of EURASIP , pp. 203-206
    • Imai, S.1    Furuichi, C.2
  • 15
    • 0009553788 scopus 로고
    • A statistical method for estimation of speech spectral density and formant frequencies
    • (Japanese Edition), Jan. Translation: R.W. Schafer and J.D. Markel, eds. Speech Analysis, 295-302, IEEE Press, New York, 1979
    • F. Itakura and S. Saito, "A statistical method for estimation of speech spectral density and formant frequencies, " IECE Transactions on Fundamentals (Japanese Edition), vol.J53-A, no.1, pp35- 42, Jan. 1970. Translation: R.W. Schafer and J.D. Markel, eds., Speech Analysis, pp.295-302, IEEE Press, New York, 1979.
    • (1970) IECE Transactions on Fundamentals , vol.J53-A , Issue.1 , pp. 35-42
    • Itakura, F.1    Saito, S.2
  • 16
    • 0000306505 scopus 로고
    • Mel log spectral approximation filter for speech synthesis
    • (Japanese Edition), Feb
    • S. Imai, K. Sumita, and C. Furuichi, "Mel log spectral approximation filter for speech synthesis, " IECE Translations on Fundamentals (Japanese Edition), vol. J66-A, pp. 122-129, Feb. 1983.
    • (1983) IECE Translations on Fundamentals , vol.J66-A , pp. 122-129
    • Imai, S.1    Sumita, K.2    Furuichi, C.3
  • 18
    • 84972512635 scopus 로고
    • Memoir on the probability of the causes of events
    • P. S. Laplace, "Memoir on the probability of the causes of events, " Statistical Science, pp. 364-378, 1986.
    • (1986) Statistical Science , pp. 364-378
    • Laplace, P.S.1
  • 20
    • 0032029288 scopus 로고    scopus 로고
    • Deterministic annealing EM algorithm
    • Mar
    • N. Ueda, R. Nakano, "Deterministic annealing EM algorithm, " Neural Networks, vol.11, pp.271-282, Mar. 1998.
    • (1998) Neural Networks , vol.11 , pp. 271-282
    • Ueda, N.1    Nakano, R.2
  • 21
    • 0033692729 scopus 로고    scopus 로고
    • Narrowband to wideband conversion of speech using GMM based transformation
    • K.-H. Park and H. S. Kim, "Narrowband to wideband conversion of speech using GMM based transformation, " Proceedings of ICASSP, vol. 3, pp. 1843-1846, 2000.
    • (2000) Proceedings of ICASSP , vol.3 , pp. 1843-1846
    • Park, K.-H.1    Kim, H.S.2
  • 22
    • 78149261566 scopus 로고    scopus 로고
    • Bandwidth extension of cellular phone speech based on maximum likelihood estimation with GMM
    • W. Fujitsuru, H. Sekimoto, T. Toda, H. Saruwatari, and K. Shikano, "Bandwidth extension of cellular phone speech based on maximum likelihood estimation with GMM, " Proceedings of NCSP, pp. 283-286, 2008.
    • (2008) Proceedings of NCSP , pp. 283-286
    • Fujitsuru, W.1    Sekimoto, H.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 25
    • 0032678076 scopus 로고    scopus 로고
    • Hidden Markov models based on multi-space probability distribution for pitch pattern modeling
    • K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden Markov models based on multi-space probability distribution for pitch pattern modeling, " Proceedings of ICASSP, pp. 229-232, 1999.
    • (1999) Proceedings of ICASSP , pp. 229-232
    • Tokuda, K.1    Masuko, T.2    Miyazaki, N.3    Kobayashi, T.4
  • 26
    • 0025419316 scopus 로고
    • Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition
    • K. F. Lee, "Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition, " IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 38, no. 4, pp. 599-609, 1990.
    • (1990) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.38 , Issue.4 , pp. 599-609
    • Lee, K.F.1
  • 28
    • 85135145174 scopus 로고    scopus 로고
    • Acoustic modeling based on the MDL criterion for speech recognition
    • K. Shinoda and T. Watanabe, "Acoustic modeling based on the MDL criterion for speech recognition, " Proceedings of Eurospeech, pp. 99-102, 1997.
    • (1997) Proceedings of Eurospeech , pp. 99-102
    • Shinoda, K.1    Watanabe, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.