SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 17, Issue 2-3, 2003, Pages 233-262

Parameter reduction schemes for loosely coupled HMMs

(2) Nock, H J a Ostendorf, M a

a UNIVERSITY OF WASHINGTON (United States)

Author keywords

[No Author keywords available]

Indexed keywords

MARKOV PROCESSES; MATHEMATICAL MODELS; STATE SPACE METHODS; TOPOLOGY;

HIDDEN MARKOV MODELS; MULTIBAND MODELS; PARAMETER REDUCTION;

SPEECH RECOGNITION;

EID: 0038697760 PISSN: 08852308 EISSN: None Source Type: Journal
DOI: 10.1016/S0885-2308(03)00009-3 Document Type: Article

Times cited : (8)

References (56)

1
- 0000353178
- A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
- Baum, L.E., Petrie, T., Soules, G., Weiss, N., 1970. A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Annals of Mathematical Statistics 41 (1), 164-171.
- (1970) Annals of Mathematical Statistics , vol.41 , Issue.1 , pp. 164-171
- Baum, L.E.¹ Petrie, T.² Soules, G.³ Weiss, N.⁴

2
- 0037841255
- Multi-stream speech recognition
- Technical Report IDIAP-RR 96-07, IDIAP
- Bourlard, H., Dupont, S., Ris, C., 1996. Multi-stream speech recognition. Technical Report IDIAP-RR 96-07, IDIAP.
- (1996)
- Bourlard, H.¹ Dupont, S.² Ris, C.³

3
- 0030685285
- Coupled hidden Markov models for complex action recognition
- Brand, M., Oliver, N., Pentland, A., 1997. Coupled hidden Markov models for complex action recognition. In: Proceedings of IEEE CVPR, pp. 994-999.
- (1997) Proceedings of IEEE CVPR , pp. 994-999
- Brand, M.¹ Oliver, N.² Pentland, A.³

4
- 0031624621
- Pronunciation modelling using a hand-labelled corpus for conversational speech recognition
- Byrne, W., Finke, M., Khudanpur, S., McDonough, J., Nock, H., Riley, M., Saraclar, M., Wooters, C., Zavaliagkos, G., 1998. Pronunciation modelling using a hand-labelled corpus for conversational speech recognition. In: Proceedings of ICASSP, pp. 313-316.
- (1998) Proceedings of ICASSP , pp. 313-316
- Byrne, W.¹ Finke, M.² Khudanpur, S.³ McDonough, J.⁴ Nock, H.⁵ Riley, M.⁶ Saraclar, M.⁷ Wooters, C.⁸ Zavaliagkos, G.⁹

5
- 0003640523
- The ISOLET spoken letter database
- Technical Report CSE 90-004, OGI
- Cole, R., Muthusamy, Y., Fanty, M., 1990. The ISOLET spoken letter database. Technical Report CSE 90-004, OGI.
- (1990)
- Cole, R.¹ Muthusamy, Y.² Fanty, M.³

6
- 84875738293
- A new approach for multi-band speech recognition based on probabilistic graphical models
- Daoudi, K., Fohr, D., Antoine, C., 2000. A new approach for multi-band speech recognition based on probabilistic graphical models. In: Proceedings of ICSLP, pp. I:329-332.
- (2000) Proceedings of ICSLP , pp. 329-332
- Daoudi, K.¹ Fohr, D.² Antoine, C.³

7
- 85002022542
- Maximum likelihood from incomplete data via the EM algorithm
- Dempster, A.P., Laird, N.M., Rubin, D.B., 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society 39 (1), 1-38.
- (1977) Journal of the Royal Statistical Society , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

8
- 0026458724
- Structural design of a hidden Markov model based speech recognizer using multi-valued phonetic features: Comparison with segmental speech units
- Deng, L., Erler, K., 1992. Structural design of a hidden Markov model based speech recognizer using multi-valued phonetic features: comparison with segmental speech units. Journal of the Acoustical Society of America 92 (92), 3058-3067.
- (1992) Journal of the Acoustical Society of America , vol.92 , Issue.92 , pp. 3058-3067
- Deng, L.¹ Erler, K.²

9
- 80053229524
- Modeling and efficient decoding of large vocabulary conversational speech
- Finke, M., Fritsch, J., Koll, D., Waibel, A., 1999. Modeling and efficient decoding of large vocabulary conversational speech. In: Proceedings of Eurospeech, pp. 467-470.
- (1999) Proceedings of Eurospeech , pp. 467-470
- Finke, M.¹ Fritsch, J.² Koll, D.³ Waibel, A.⁴

10
- 85027454087
- Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition
- Finke, M., Waibel, A., 1997. Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition. In: Proceedings of Eurospeech, pp. 2379-2382.
- (1997) Proceedings of Eurospeech , pp. 2379-2382
- Finke, M.¹ Waibel, A.²

11
- 0003671941
- Model-based techniques for noise robust speech recognition
- PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK
- Gales, M.J.F., 1995. Model-based techniques for noise robust speech recognition. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK.
- (1995)
- Gales, M.J.F.¹

12
- 0031268341
- Factorial hidden Markov models
- Ghahramani, Z., Jordan, M.I., 1997. Factorial hidden Markov models. Machine Learning 29, 245-273.
- (1997) Machine Learning , vol.29 , pp. 245-273
- Ghahramani, Z.¹ Jordan, M.I.²

13
- 0024909979
- Some statistical issues in the comparison of speech recognition algorithms
- Gillick, L., Cox, S.J., 1989. Some statistical issues in the comparison of speech recognition algorithms. In: Proceedings of ICASSP, pp. 532-535.
- (1989) Proceedings of ICASSP , pp. 532-535
- Gillick, L.¹ Cox, S.J.²

14
- 0142005771
- Dynamic HMM selection for continuous speech recognition
- Hain, T., Woodland, P.C., 1999. Dynamic HMM selection for continuous speech recognition. In: Proceedings of Eurospeech, pp. 532-535.
- (1999) Proceedings of Eurospeech , pp. 532-535
- Hain, T.¹ Woodland, P.C.²

15
- 85009113591
- Modelling sub-phone insertions and deletions in continuous speech recognition
- Hain, T., Woodland, P.C., 2000. Modelling sub-phone insertions and deletions in continuous speech recognition. In: Proceedings of ICSLP, pp. IV:172-175.
- (2000) Proceedings of ICSLP , pp. 172-175
- Hain, T.¹ Woodland, P.C.²

16
- 0012236195
- The CU-HTK march 2000 Hub5e transcription system
- Hain, T., Woodland, P.C., Evermann, G., Povey, D., 2000. The CU-HTK march 2000 Hub5e transcription system. In: Proceedings of Speech Transcription Workshop.
- (2000) Proceedings of Speech Transcription Workshop
- Hain, T.¹ Woodland, P.C.² Evermann, G.³ Povey, D.⁴

17
- 0030365517
- Towards ASR on partially corrupted speech
- Hermansky, H., Tibrewala, S., Pavel, M., 1996. Towards ASR on partially corrupted speech. In: Proceedings of ICSLP. pp. 462-465.
- (1996) Proceedings of ICSLP , pp. 462-465
- Hermansky, H.¹ Tibrewala, S.² Pavel, M.³

18
- 0005998728
- Word recognition from tiered phonological models
- Huckvale, M.A., 1994. Word recognition from tiered phonological models. In: Proceedings of Institute of Acoustics Conference on Speech and Hearing, vol. 16, pp. 163-170.
- (1994) Proceedings of Institute of Acoustics Conference on Speech and Hearing , vol.16 , pp. 163-170
- Huckvale, M.A.¹

19
- 85032275702
- Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition
- Humphries, J.J., Woodland, P.C., 1997. Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition. In: Proceedings of Eurospeech, pp. 2367-2370.
- (1997) Proceedings of Eurospeech , pp. 2367-2370
- Humphries, J.J.¹ Woodland, P.C.²

20
- 0003857779
- Discriminative training of hidden Markov models
- PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK
- Kapadia, S., 1998. Discriminative training of hidden Markov models. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK.
- (1998)
- Kapadia, S.¹

21
- 4243728511
- Word-level phonetic variation in large speech corpora
- In: Pompino-Marschal, B. (Ed.), ZAS Working Papers in Linguistics
- Keating, P., 1997. Word-level phonetic variation in large speech corpora. In: Pompino-Marschal, B. (Ed.), ZAS Working Papers in Linguistics. Available from
- (1997)
- Keating, P.¹

22
- 79952968027
- Speech recognition via phonetically featured syllables
- King, S., Stephenson, T., Isard, S., Taylor, P., Strachan, A., 1998. Speech recognition via phonetically featured syllables. In: Proceedings of ICSLP, vol. 3, pp. 1031-1034.
- (1998) Proceedings of ICSLP , vol.3 , pp. 1031-1034
- King, S.¹ Stephenson, T.² Isard, S.³ Taylor, P.⁴ Strachan, A.⁵

23
- 0034297586
- Detection of phonological features in continuous speech using neural networks
- King, S., Taylor, P., 2000. Detection of phonological features in continuous speech using neural networks. Computer Speech and Language 14 (4), 333-353.
- (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 333-353
- King, S.¹ Taylor, P.²

24
- 19544365323
- COMLEX pronouncing lexicon (renamed in 1997 release as CALLHOME American English lexicon)
- Available from Linguistic Data Consortium
- Kingsbury, P., Strassel, S., McLemore, C., 1997. COMLEX pronouncing lexicon (renamed in 1997 release as CALLHOME American English lexicon). Available from Linguistic Data Consortium
- (1997)
- Kingsbury, P.¹ Strassel, S.² McLemore, C.³

25
- 0003424928
- Robust speech recognition using articulatory information
- PhD Thesis, University of Bielefeld, Germany
- Kirchhoff, K., 1999. Robust speech recognition using articulatory information. PhD Thesis, University of Bielefeld, Germany.
- (1999)
- Kirchhoff, K.¹

26
- 0030351374
- On designing pronunciation lexicons for large vocabulary, continuous speech recognition
- Lamel, L., Adda, G., 1996. On designing pronunciation lexicons for large vocabulary, continuous speech recognition. In: Proceedings of ICSLP, pp. 6-9.
- (1996) Proceedings of ICSLP , pp. 6-9
- Lamel, L.¹ Adda, G.²

27
- 0004047518
- Claredon, Oxford
- Lauritzen, S.L., 1996. Graphical Models. Claredon, Oxford.
- (1996) Graphical Models
- Lauritzen, S.L.¹

28
- 0021226391
- A database for speaker-independent digit recognition
- Leonard, R. G., 1984. A database for speaker-independent digit recognition. In: Proceedings of ICASSP, pp. 42.11-14.
- (1984) Proceedings of ICASSP , pp. 11-14
- Leonard, R.G.¹

29
- 0018918171
- An algorithm for vector quantizer design
- Linde, Y., Buzo, A., Gray, R.M., 1980. An algorithm for vector quantizer design. IEEE Transactions Communications 28, 84-95.
- (1980) IEEE Transactions Communications , vol.28 , pp. 84-95
- Linde, Y.¹ Buzo, A.² Gray, R.M.³

30
- 0038178805
- Factorial hidden Markov models for speech recognition: Preliminary experiments
- Technical Report 97/7, Cambridge Research Laboratory
- Logan, B., Moreno, P.J., 1997. Factorial hidden Markov models for speech recognition: preliminary experiments. Technical Report 97/7, Cambridge Research Laboratory.
- (1997)
- Logan, B.¹ Moreno, P.J.²

31
- 85009078254
- Asynchrony with trained transition probabilities improves performance in multi-band speech recognition
- Mak, B., Tam, Y.-C., 2000. Asynchrony with trained transition probabilities improves performance in multi-band speech recognition. In: Proceedings of ICSLP, pp. IV:149-152.
- (2000) Proceedings of ICSLP , pp. 149-152
- Mak, B.¹ Tam, Y.-C.²

32
- 0006133268
- Discriminative weighting of multi-resolution sub-band cepstral features for speech recognition
- McMahon, P., McCourt, P., Vaseghi, S., 1998. Discriminative weighting of multi-resolution sub-band cepstral features for speech recognition. In: Proceedings of ICSLP, pp. 1055-1058.
- (1998) Proceedings of ICSLP , pp. 1055-1058
- McMahon, P.¹ McCourt, P.² Vaseghi, S.³

33
- 0004119130
- A multi-band approach to automatic speech recognition
- PhD Thesis, ICSI, UC Berkeley, CA, USA
- Mirghafori, N. 1999. A Multi-band approach to automatic speech recognition. PhD Thesis, ICSI, UC Berkeley, CA, USA.
- (1999)
- Mirghafori, N.¹

34
- 0004052871
- Audio-visual speech recognition
- Technical Report, The Johns Hopkins University (Center for Language and Speech Processing) Summer Research Workshop
- Neti, C., Potamianos, G., Luettin, J., Matthews, I., Herve Gtin, Vergyri, D., Sison, J., Mashari, J., Zhou, J., 2000. Audio-visual speech recognition. Technical Report, The Johns Hopkins University (Center for Language and Speech Processing) Summer Research Workshop.
- (2000)
- Neti, C.¹ Potamianos, G.² Luettin, J.³ Matthews, I.⁴ Herve, G.⁵ Vergyri, D.⁶ Sison, J.⁷ Mashari, J.⁸ Zhou, J.⁹

35
- 85002102457
- NIST. Score package
- NIST. Score package. Available from.

36
- 0003951389
- Techniques for modelling phonological processes in automatic speech recognition
- PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK, August
- Nock, H.J., 2001. Techniques for modelling phonological processes in automatic speech recognition. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK, August.
- (2001)
- Nock, H.J.¹

37
- 84957551318
- Loosely-coupled HMMs for ASR
- Nock, H.J., Young, S.J., 2000. Loosely-coupled HMMs for ASR. In: Proceedings of ICSLP, pp. III:143-146.
- (2000) Proceedings of ICSLP , pp. 143-146
- Nock, H.J.¹ Young, S.J.²

38
- 0036081023
- Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models
- Nock, H.J., Young, S.J., 2002. Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models. Cognitive Science 26 (3), 283-301.
- (2002) Cognitive Science , vol.26 , Issue.3 , pp. 283-301
- Nock, H.J.¹ Young, S.J.²

39
- 0003805597
- The use of context in large vocabulary speech recognition
- PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK
- Odell, J., 1995. The use of context in large vocabulary speech recognition. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK.
- (1995)
- Odell, J.¹

40
- 0347307067
- Incorporating linguistic theories of pronunciation variation into speech recognition models
- London, UK
- Ostendorf, M., 2000. Incorporating linguistic theories of pronunciation variation into speech recognition models. In: Philosophical Transactions of Royal Society, vol. 358, London, UK, pp. 1325-1338.
- (2000) Philosophical Transactions of Royal Society , vol.358 , pp. 1325-1338
- Ostendorf, M.¹

41
- 0030715097
- HMM topology design using maximum likelihood successive state splitting
- Ostendorf, M., Singer, H., 1997. HMM topology design using maximum likelihood successive state splitting. Computer Speech and Language 11, 17-41.
- (1997) Computer Speech and Language , vol.11 , pp. 17-41
- Ostendorf, M.¹ Singer, H.²

42
- 0004244302
- Prentice-Hall, New Jersey
- Rabiner, L., Juang, B.-H., 1993. Fundamentals of Speech Recognition. Prentice-Hall, New Jersey.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.-H.²

43
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Rabiner, L.R., 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of IEEE 77(2), 257-285.
- (1989) Proceedings of IEEE , vol.77 , Issue.2 , pp. 257-285
- Rabiner, L.R.¹

44
- 78049399146
- Mixture density networks, human articulatory data and acoustic-to-articulatory inversion of continuous speech
- Richmond, K., 2001. Mixture density networks, human articulatory data and acoustic-to-articulatory inversion of continuous speech. In: Proceedings of Institute of Acoustics WISP 2001, Stratford-upon-Avon, UK.
- (2001) Proceedings of Institute of Acoustics WISP 2001, Stratford-Upon-Avon, UK
- Richmond, K.¹

45
- 0026405248
- A statistical model for generating pronunciation networks
- Riley, M.D., 1991. A statistical model for generating pronunciation networks. In: Proceedings of ICASSP, pp. 737-740.
- (1991) Proceedings of ICASSP , pp. 737-740
- Riley, M.D.¹

46
- 0005921215
- Pronunciation modeling for conversational speech recognition
- PhD Thesis, The Johns Hopkins University, MD, USA
- Saraclar, M., 2000. Pronunciation modeling for conversational speech recognition. PhD Thesis, The Johns Hopkins University, MD, USA.
- (2000)
- Saraclar, M.¹

47
- 0000114416
- Pronunciation modeling by sharing Gaussian densities across phonetic models
- Saraclar, M., Nock, H., Khudanpur, S., 2000. Pronunciation modeling by sharing Gaussian densities across phonetic models. Computer Speech and Language 14 (2), 137-160.
- (2000) Computer Speech and Language , vol.14 , Issue.2 , pp. 137-160
- Saraclar, M.¹ Nock, H.² Khudanpur, S.³

48
- 0032596518
- Mixed memory Markov models
- Saul, L. K., Jordan, M.I., 1999. Mixed memory Markov models. Machine Learning 37, 75-87.
- (1999) Machine Learning , vol.37 , pp. 75-87
- Saul, L.K.¹ Jordan, M.I.²

49
- 0029747053
- Integrating audio and visual information to provide highly robust speech recognition
- Tomlinson, M.J., Russell, M.J., Brooke, N.M., 1996. Integrating audio and visual information to provide highly robust speech recognition. In: Proceedings of ICASSP, vol. II, pp. 821-824.
- (1996) Proceedings of ICASSP , vol.2 , pp. 821-824
- Tomlinson, M.J.¹ Russell, M.J.² Brooke, N.M.³

50
- 0030643684
- Modelling asynchrony in speech using elementary single-signal decomposition
- Tomlinson, M.J., Russell, M.J., Moore, R.K., Buckland, A.P., Fawley, M.A. 1997. Modelling asynchrony in speech using elementary single-signal decomposition. In: Proceedings of ICASSP, pp. 1247-1250.
- (1997) Proceedings of ICASSP , pp. 1247-1250
- Tomlinson, M.J.¹ Russell, M.J.² Moore, R.K.³ Buckland, A.P.⁴ Fawley, M.A.⁵

51
- 0025681008
- Hidden Markov model decomposition of speech and noise
- Varga, A.P., Moore, R.K., 1990. Hidden Markov model decomposition of speech and noise. In: Proceedings of ICASSP, pp. 845-848.
- (1990) Proceedings of ICASSP , pp. 845-848
- Varga, A.P.¹ Moore, R.K.²

52
- 85009113617
- Gestural overlap, place of articulation and speech rate: An x-ray investigation
- Vaxeclaire, B., Sock, R., Perrier, P., 2000. Gestural overlap, place of articulation and speech rate: an X-ray investigation. In: Proceedings of ICSLP, pp. II:166-II:169.
- (2000) Proceedings of ICSLP , pp. II:166-II:169
- Vaxelaire, B.¹ Sock, R.² Perrier, P.³

53
- 0038517644
- SRI switchboard progress and experiments
- Weintraub, M., Stolcke, A., Sankar, A., 1995. SRI Switchboard progress and experiments. In: Proceedings of DARPA LVCSR Workshop.
- (1995) Proceedings of DARPA LVCSR Workshop
- Weintraub, M.¹ Stolcke, A.² Sankar, A.³

54
- 4243661293
- Automatic learning of word pronunciation from data
- Technical Report, The Johns Hopkins University (Center for Language and Speech Procesing) Summer Research Workshop
- Weintraub, M., Wegmann, S., Kao, Y.-H., Khudanpur, S., Galles, C., Fosler, E., Saraclar, M., 1996. Automatic learning of word pronunciation from data. Technical Report, The Johns Hopkins University (Center for Language and Speech Procesing) Summer Research Workshop.
- (1996)
- Weintraub, M.¹ Wegmann, S.² Kao, Y.-H.³ Khudanpur, S.⁴ Galles, C.⁵ Fosler, E.⁶ Saraclar, M.⁷

55
- 0038517643
- Progress towards improved speech modelling using asynchronous sub-bands and formant frequencies
- Wilkinson, N., Russell, M.J., 2001. Progress towards improved speech modelling using asynchronous sub-bands and formant frequencies. In: Proceedings of Institute of Acoustics WISP, Stratford-upon-Avon, UK.
- (2001) Proceedings of Institute of Acoustics WISP, Stratford-Upon-Avon, UK
- Wilkinson, N.¹ Russell, M.J.²

56
- 0003822743
- ECRL
- Young, S., Jansen, J., Odell, J., Ollason, D., Woodland, P., 1995. The HTK Book (Version 2.0), ECRL.
- (1995) The HTK Book (Version 2.0)
- Young, S.¹ Jansen, J.² Odell, J.³ Ollason, D.⁴ Woodland, P.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.