SCOPUS 정보 검색 플랫폼

IEICE Transactions on Information and Systems

Volumn E91-D, Issue 11, 2008, Pages 2693-2700

A fully consistent hidden semi-markov model-based speech recognition system

(5) Oura, Keiichiro a Zen, Heiga a Nankaku, Yoshihiko a Lee, Akinobu a Tokuda, Keiichi a

a NAGOYA INSTITUTE OF TECHNOLOGY (Japan)

Author keywords

Hidden Markov model; Hidden semi Markov model; Speech recognition; Weighted finite state transducer

Indexed keywords

APPROXIMATION ALGORITHMS; CONTINUOUS SPEECH RECOGNITION; DEEP NEURAL NETWORKS; HIDDEN MARKOV MODELS; MARKOV PROCESSES; PROBABILITY DISTRIBUTIONS; SPEECH; SPEECH PROCESSING; TRANSDUCERS; TRANSLATION (LANGUAGES);

CONTEXT-DEPENDENT DURATION MODELING; GENERALIZED FORWARD-BACKWARD; HIDDEN SEMI-MARKOV MODELING; HIDDEN SEMI-MARKOV MODELS; SPEAKER DEPENDENTS; SPEECH RECOGNITION SYSTEMS; TEMPORAL STRUCTURES; WEIGHTED FINITE-STATE TRANSDUCERS;

SPEECH RECOGNITION;

EID: 68749108220 PISSN: 09168532 EISSN: 17451361 Source Type: Journal
DOI: 10.1093/ietisy/e91-d.11.2693 Document Type: Article

Times cited : (11)

References (16)

1
- 0030245363
- From HMMs to segment models
- M. Ostendorf, V. Digalakis, and O. Kimball, "From HMMs to segment models," IEEE Trans. Speech Audio Process., vol.4, no.5, pp.360-378, 1996.
- (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.5 , pp. 360-378
- Ostendorf, M.¹ Digalakis, V.² Kimball, O.³

2
- 0022685753
- Continuously variable duration hidden Markov models for automatic speech recognition
- S.E. Levinson, "Continuously variable duration hidden Markov models for automatic speech recognition," Comput. Speech Lang., vol.1, pp.29-45, 1986.
- (1986) Comput. Speech Lang , vol.1 , pp. 29-45
- Levinson, S.E.¹

3
- 44449177634
- A hidden semi-Markov model-based speech synthesis system
- May
- H. Zen, K. Tokuda, T. Masuko, and T. Kitamura, "A hidden semi-Markov model-based speech synthesis system," IEICE Trans. Inf. & Syst., vol.E90-D, no.5, pp.825-834, May 2007.
- (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.5 , pp. 825-834
- Zen, H.¹ Tokuda, K.² Masuko, T.³ Kitamura, T.⁴

4
- 0002585974
- Variable duration models for speech
- J. Ferguson, "Variable duration models for speech," Proc. Symposium on the Application of Hidden Markov Models to Text and Speech, pp.143-179, 1980.
- (1980) Proc. Symposium on the Application of Hidden Markov Models to Text and Speech , pp. 143-179
- Ferguson, J.¹

5
- 33846442604
- Investigation of state duration model based gamma distribution for HMM-based speech synthesis,
- no.352, pp
- Y. Ishimatsu, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Investigation of state duration model based gamma distribution for HMM-based speech synthesis," IEICE Technical Report, vol.101, no.352, pp.57-62, 2001.
- (2001) IEICE Technical Report , vol.101 , pp. 57-62
- Ishimatsu, Y.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

6
- 0023214109
- Experimental evaluation of duration modeling techniques for automatic speech recognition
- M.J. Russell and A.E. Cook, "Experimental evaluation of duration modeling techniques for automatic speech recognition," Proc. ICASSP1987, vol.1, pp.2376-2379, 1987.
- (1987) Proc. ICASSP1987 , vol.1 , pp. 2376-2379
- Russell, M.J.¹ Cook, A.E.²

7
- 85009126465
- Context dependent phoneme duration modeling with tree-based state tying
- M. Wan. Koo, S.J. Park, and D.Y. Son, "Context dependent phoneme duration modeling with tree-based state tying," Proc. INTERSPEECH2004, vol.1, pp.721-724, 2004.
- (2004) Proc. INTERSPEECH2004 , vol.1 , pp. 721-724
- Wan, M.¹ Koo² Park, S.J.³ Son, D.Y.⁴

8
- 85009090132
- Modeling word duration
- V.R.R. Gadde, "Modeling word duration," Proc. ICSLP2000, vol.1, pp.601-604, 2000.
- (2000) Proc. ICSLP2000 , vol.1 , pp. 601-604
- Gadde, V.R.R.¹

9
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," Proc. EUROSPEECH, vol.5, pp.2347-2350, 1999.
- (1999) Proc. EUROSPEECH , vol.5 , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

10
- 0003805597
- Ph.D. Thesis, Cambridge University
- J. Odell, The use of context in large vocabulary speech recognition, Ph.D. Thesis, Cambridge University, 1995.
- (1995) The use of context in large vocabulary speech recognition
- Odell, J.¹

11
- 0025419316
- Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition
- K.F. Lee, "Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition," IEEE Trans. Acoust. Speech Signal Process., vol.38, no.4, pp.599-609, 1990.
- (1990) IEEE Trans. Acoust. Speech Signal Process , vol.38 , Issue.4 , pp. 599-609
- Lee, K.F.¹

12
- 0002824030
- Weighted finite-state transducers in speech recognition
- M. Mohri, F. Pereira, and M. Riley, "Weighted finite-state transducers in speech recognition," Proc. ASR2000, pp.97-106, 2000.
- (2000) Proc. ASR2000 , pp. 97-106
- Mohri, M.¹ Pereira, F.² Riley, M.³

13
- 0141591502
- Generalized optimization algorithm for speech recognition transducers
- C. Allauzen and M. Mohri, "Generalized optimization algorithm for speech recognition transducers," Proc. ICASSP2003, vol.1, pp.352-355, 2003.
- (2003) Proc. ICASSP2003 , vol.1 , pp. 352-355
- Allauzen, C.¹ Mohri, M.²

14
- 85135145174
- Acoustic modeling based on the MDL criterion for speech recognition
- K. Shinoda and T. Watanabe, "Acoustic modeling based on the MDL criterion for speech recognition," Proc. EUROSPEECH, vol.1, pp.99-102, 1997.
- (1997) Proc. EUROSPEECH , vol.1 , pp. 99-102
- Shinoda, K.¹ Watanabe, T.²

15
- 0037278070
- An efficient forward-backward algorithm for an explicit-duration hidden Markov model
- S.Z. Yu and H. Kobayashi, "An efficient forward-backward algorithm for an explicit-duration hidden Markov model," IEEE Signal Process. Lett., vol.10, no.1, pp.11-14, 2003.
- (2003) IEEE Signal Process. Lett , vol.10 , Issue.1 , pp. 11-14
- Yu, S.Z.¹ Kobayashi, H.²

16
- 0013232586
- Statistical methods for comparing pattern recognition algorithms and comments on evaluating speech recognition performance
- S. Nakagawa and H. Takagi, "Statistical methods for comparing pattern recognition algorithms and comments on evaluating speech recognition performance," J. Acoust. Soc. Jpn., vol.50, no.10, pp.849-854, 1994.
- (1994) J. Acoust. Soc. Jpn , vol.50 , Issue.10 , pp. 849-854
- Nakagawa, S.¹ Takagi, H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.