메뉴 건너뛰기




Volumn E91-D, Issue 11, 2008, Pages 2693-2700

A fully consistent hidden semi-markov model-based speech recognition system

Author keywords

Hidden Markov model; Hidden semi Markov model; Speech recognition; Weighted finite state transducer

Indexed keywords

APPROXIMATION ALGORITHMS; CONTINUOUS SPEECH RECOGNITION; DEEP NEURAL NETWORKS; HIDDEN MARKOV MODELS; MARKOV PROCESSES; PROBABILITY DISTRIBUTIONS; SPEECH; SPEECH PROCESSING; TRANSDUCERS; TRANSLATION (LANGUAGES);

EID: 68749108220     PISSN: 09168532     EISSN: 17451361     Source Type: Journal    
DOI: 10.1093/ietisy/e91-d.11.2693     Document Type: Article
Times cited : (11)

References (16)
  • 2
    • 0022685753 scopus 로고
    • Continuously variable duration hidden Markov models for automatic speech recognition
    • S.E. Levinson, "Continuously variable duration hidden Markov models for automatic speech recognition," Comput. Speech Lang., vol.1, pp.29-45, 1986.
    • (1986) Comput. Speech Lang , vol.1 , pp. 29-45
    • Levinson, S.E.1
  • 3
    • 44449177634 scopus 로고    scopus 로고
    • A hidden semi-Markov model-based speech synthesis system
    • May
    • H. Zen, K. Tokuda, T. Masuko, and T. Kitamura, "A hidden semi-Markov model-based speech synthesis system," IEICE Trans. Inf. & Syst., vol.E90-D, no.5, pp.825-834, May 2007.
    • (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.5 , pp. 825-834
    • Zen, H.1    Tokuda, K.2    Masuko, T.3    Kitamura, T.4
  • 5
    • 33846442604 scopus 로고    scopus 로고
    • Investigation of state duration model based gamma distribution for HMM-based speech synthesis,
    • no.352, pp
    • Y. Ishimatsu, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Investigation of state duration model based gamma distribution for HMM-based speech synthesis," IEICE Technical Report, vol.101, no.352, pp.57-62, 2001.
    • (2001) IEICE Technical Report , vol.101 , pp. 57-62
    • Ishimatsu, Y.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 6
    • 0023214109 scopus 로고
    • Experimental evaluation of duration modeling techniques for automatic speech recognition
    • M.J. Russell and A.E. Cook, "Experimental evaluation of duration modeling techniques for automatic speech recognition," Proc. ICASSP1987, vol.1, pp.2376-2379, 1987.
    • (1987) Proc. ICASSP1987 , vol.1 , pp. 2376-2379
    • Russell, M.J.1    Cook, A.E.2
  • 7
    • 85009126465 scopus 로고    scopus 로고
    • Context dependent phoneme duration modeling with tree-based state tying
    • M. Wan. Koo, S.J. Park, and D.Y. Son, "Context dependent phoneme duration modeling with tree-based state tying," Proc. INTERSPEECH2004, vol.1, pp.721-724, 2004.
    • (2004) Proc. INTERSPEECH2004 , vol.1 , pp. 721-724
    • Wan, M.1    Koo2    Park, S.J.3    Son, D.Y.4
  • 8
    • 85009090132 scopus 로고    scopus 로고
    • Modeling word duration
    • V.R.R. Gadde, "Modeling word duration," Proc. ICSLP2000, vol.1, pp.601-604, 2000.
    • (2000) Proc. ICSLP2000 , vol.1 , pp. 601-604
    • Gadde, V.R.R.1
  • 9
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," Proc. EUROSPEECH, vol.5, pp.2347-2350, 1999.
    • (1999) Proc. EUROSPEECH , vol.5 , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 11
    • 0025419316 scopus 로고
    • Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition
    • K.F. Lee, "Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition," IEEE Trans. Acoust. Speech Signal Process., vol.38, no.4, pp.599-609, 1990.
    • (1990) IEEE Trans. Acoust. Speech Signal Process , vol.38 , Issue.4 , pp. 599-609
    • Lee, K.F.1
  • 12
    • 0002824030 scopus 로고    scopus 로고
    • Weighted finite-state transducers in speech recognition
    • M. Mohri, F. Pereira, and M. Riley, "Weighted finite-state transducers in speech recognition," Proc. ASR2000, pp.97-106, 2000.
    • (2000) Proc. ASR2000 , pp. 97-106
    • Mohri, M.1    Pereira, F.2    Riley, M.3
  • 13
    • 0141591502 scopus 로고    scopus 로고
    • Generalized optimization algorithm for speech recognition transducers
    • C. Allauzen and M. Mohri, "Generalized optimization algorithm for speech recognition transducers," Proc. ICASSP2003, vol.1, pp.352-355, 2003.
    • (2003) Proc. ICASSP2003 , vol.1 , pp. 352-355
    • Allauzen, C.1    Mohri, M.2
  • 14
    • 85135145174 scopus 로고    scopus 로고
    • Acoustic modeling based on the MDL criterion for speech recognition
    • K. Shinoda and T. Watanabe, "Acoustic modeling based on the MDL criterion for speech recognition," Proc. EUROSPEECH, vol.1, pp.99-102, 1997.
    • (1997) Proc. EUROSPEECH , vol.1 , pp. 99-102
    • Shinoda, K.1    Watanabe, T.2
  • 15
    • 0037278070 scopus 로고    scopus 로고
    • An efficient forward-backward algorithm for an explicit-duration hidden Markov model
    • S.Z. Yu and H. Kobayashi, "An efficient forward-backward algorithm for an explicit-duration hidden Markov model," IEEE Signal Process. Lett., vol.10, no.1, pp.11-14, 2003.
    • (2003) IEEE Signal Process. Lett , vol.10 , Issue.1 , pp. 11-14
    • Yu, S.Z.1    Kobayashi, H.2
  • 16
    • 0013232586 scopus 로고
    • Statistical methods for comparing pattern recognition algorithms and comments on evaluating speech recognition performance
    • S. Nakagawa and H. Takagi, "Statistical methods for comparing pattern recognition algorithms and comments on evaluating speech recognition performance," J. Acoust. Soc. Jpn., vol.50, no.10, pp.849-854, 1994.
    • (1994) J. Acoust. Soc. Jpn , vol.50 , Issue.10 , pp. 849-854
    • Nakagawa, S.1    Takagi, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.