메뉴 건너뛰기




Volumn 17, Issue 2-3, 2003, Pages 233-262

Parameter reduction schemes for loosely coupled HMMs

Author keywords

[No Author keywords available]

Indexed keywords

MARKOV PROCESSES; MATHEMATICAL MODELS; STATE SPACE METHODS; TOPOLOGY;

EID: 0038697760     PISSN: 08852308     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0885-2308(03)00009-3     Document Type: Article
Times cited : (8)

References (56)
  • 1
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • Baum, L.E., Petrie, T., Soules, G., Weiss, N., 1970. A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Annals of Mathematical Statistics 41 (1), 164-171.
    • (1970) Annals of Mathematical Statistics , vol.41 , Issue.1 , pp. 164-171
    • Baum, L.E.1    Petrie, T.2    Soules, G.3    Weiss, N.4
  • 2
    • 0037841255 scopus 로고    scopus 로고
    • Multi-stream speech recognition
    • Technical Report IDIAP-RR 96-07, IDIAP
    • Bourlard, H., Dupont, S., Ris, C., 1996. Multi-stream speech recognition. Technical Report IDIAP-RR 96-07, IDIAP.
    • (1996)
    • Bourlard, H.1    Dupont, S.2    Ris, C.3
  • 3
    • 0030685285 scopus 로고    scopus 로고
    • Coupled hidden Markov models for complex action recognition
    • Brand, M., Oliver, N., Pentland, A., 1997. Coupled hidden Markov models for complex action recognition. In: Proceedings of IEEE CVPR, pp. 994-999.
    • (1997) Proceedings of IEEE CVPR , pp. 994-999
    • Brand, M.1    Oliver, N.2    Pentland, A.3
  • 5
    • 0003640523 scopus 로고
    • The ISOLET spoken letter database
    • Technical Report CSE 90-004, OGI
    • Cole, R., Muthusamy, Y., Fanty, M., 1990. The ISOLET spoken letter database. Technical Report CSE 90-004, OGI.
    • (1990)
    • Cole, R.1    Muthusamy, Y.2    Fanty, M.3
  • 6
    • 84875738293 scopus 로고    scopus 로고
    • A new approach for multi-band speech recognition based on probabilistic graphical models
    • Daoudi, K., Fohr, D., Antoine, C., 2000. A new approach for multi-band speech recognition based on probabilistic graphical models. In: Proceedings of ICSLP, pp. I:329-332.
    • (2000) Proceedings of ICSLP , pp. 329-332
    • Daoudi, K.1    Fohr, D.2    Antoine, C.3
  • 8
    • 0026458724 scopus 로고
    • Structural design of a hidden Markov model based speech recognizer using multi-valued phonetic features: Comparison with segmental speech units
    • Deng, L., Erler, K., 1992. Structural design of a hidden Markov model based speech recognizer using multi-valued phonetic features: comparison with segmental speech units. Journal of the Acoustical Society of America 92 (92), 3058-3067.
    • (1992) Journal of the Acoustical Society of America , vol.92 , Issue.92 , pp. 3058-3067
    • Deng, L.1    Erler, K.2
  • 9
    • 80053229524 scopus 로고    scopus 로고
    • Modeling and efficient decoding of large vocabulary conversational speech
    • Finke, M., Fritsch, J., Koll, D., Waibel, A., 1999. Modeling and efficient decoding of large vocabulary conversational speech. In: Proceedings of Eurospeech, pp. 467-470.
    • (1999) Proceedings of Eurospeech , pp. 467-470
    • Finke, M.1    Fritsch, J.2    Koll, D.3    Waibel, A.4
  • 10
    • 85027454087 scopus 로고    scopus 로고
    • Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition
    • Finke, M., Waibel, A., 1997. Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition. In: Proceedings of Eurospeech, pp. 2379-2382.
    • (1997) Proceedings of Eurospeech , pp. 2379-2382
    • Finke, M.1    Waibel, A.2
  • 11
    • 0003671941 scopus 로고
    • Model-based techniques for noise robust speech recognition
    • PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK
    • Gales, M.J.F., 1995. Model-based techniques for noise robust speech recognition. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK.
    • (1995)
    • Gales, M.J.F.1
  • 12
  • 13
    • 0024909979 scopus 로고
    • Some statistical issues in the comparison of speech recognition algorithms
    • Gillick, L., Cox, S.J., 1989. Some statistical issues in the comparison of speech recognition algorithms. In: Proceedings of ICASSP, pp. 532-535.
    • (1989) Proceedings of ICASSP , pp. 532-535
    • Gillick, L.1    Cox, S.J.2
  • 14
    • 0142005771 scopus 로고    scopus 로고
    • Dynamic HMM selection for continuous speech recognition
    • Hain, T., Woodland, P.C., 1999. Dynamic HMM selection for continuous speech recognition. In: Proceedings of Eurospeech, pp. 532-535.
    • (1999) Proceedings of Eurospeech , pp. 532-535
    • Hain, T.1    Woodland, P.C.2
  • 15
    • 85009113591 scopus 로고    scopus 로고
    • Modelling sub-phone insertions and deletions in continuous speech recognition
    • Hain, T., Woodland, P.C., 2000. Modelling sub-phone insertions and deletions in continuous speech recognition. In: Proceedings of ICSLP, pp. IV:172-175.
    • (2000) Proceedings of ICSLP , pp. 172-175
    • Hain, T.1    Woodland, P.C.2
  • 19
    • 85032275702 scopus 로고    scopus 로고
    • Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition
    • Humphries, J.J., Woodland, P.C., 1997. Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition. In: Proceedings of Eurospeech, pp. 2367-2370.
    • (1997) Proceedings of Eurospeech , pp. 2367-2370
    • Humphries, J.J.1    Woodland, P.C.2
  • 20
    • 0003857779 scopus 로고    scopus 로고
    • Discriminative training of hidden Markov models
    • PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK
    • Kapadia, S., 1998. Discriminative training of hidden Markov models. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK.
    • (1998)
    • Kapadia, S.1
  • 21
    • 4243728511 scopus 로고    scopus 로고
    • Word-level phonetic variation in large speech corpora
    • In: Pompino-Marschal, B. (Ed.), ZAS Working Papers in Linguistics
    • Keating, P., 1997. Word-level phonetic variation in large speech corpora. In: Pompino-Marschal, B. (Ed.), ZAS Working Papers in Linguistics. Available from
    • (1997)
    • Keating, P.1
  • 23
    • 0034297586 scopus 로고    scopus 로고
    • Detection of phonological features in continuous speech using neural networks
    • King, S., Taylor, P., 2000. Detection of phonological features in continuous speech using neural networks. Computer Speech and Language 14 (4), 333-353.
    • (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 333-353
    • King, S.1    Taylor, P.2
  • 24
    • 19544365323 scopus 로고    scopus 로고
    • COMLEX pronouncing lexicon (renamed in 1997 release as CALLHOME American English lexicon)
    • Available from Linguistic Data Consortium
    • Kingsbury, P., Strassel, S., McLemore, C., 1997. COMLEX pronouncing lexicon (renamed in 1997 release as CALLHOME American English lexicon). Available from Linguistic Data Consortium
    • (1997)
    • Kingsbury, P.1    Strassel, S.2    McLemore, C.3
  • 25
    • 0003424928 scopus 로고    scopus 로고
    • Robust speech recognition using articulatory information
    • PhD Thesis, University of Bielefeld, Germany
    • Kirchhoff, K., 1999. Robust speech recognition using articulatory information. PhD Thesis, University of Bielefeld, Germany.
    • (1999)
    • Kirchhoff, K.1
  • 26
    • 0030351374 scopus 로고    scopus 로고
    • On designing pronunciation lexicons for large vocabulary, continuous speech recognition
    • Lamel, L., Adda, G., 1996. On designing pronunciation lexicons for large vocabulary, continuous speech recognition. In: Proceedings of ICSLP, pp. 6-9.
    • (1996) Proceedings of ICSLP , pp. 6-9
    • Lamel, L.1    Adda, G.2
  • 28
    • 0021226391 scopus 로고
    • A database for speaker-independent digit recognition
    • Leonard, R. G., 1984. A database for speaker-independent digit recognition. In: Proceedings of ICASSP, pp. 42.11-14.
    • (1984) Proceedings of ICASSP , pp. 11-14
    • Leonard, R.G.1
  • 30
    • 0038178805 scopus 로고    scopus 로고
    • Factorial hidden Markov models for speech recognition: Preliminary experiments
    • Technical Report 97/7, Cambridge Research Laboratory
    • Logan, B., Moreno, P.J., 1997. Factorial hidden Markov models for speech recognition: preliminary experiments. Technical Report 97/7, Cambridge Research Laboratory.
    • (1997)
    • Logan, B.1    Moreno, P.J.2
  • 31
    • 85009078254 scopus 로고    scopus 로고
    • Asynchrony with trained transition probabilities improves performance in multi-band speech recognition
    • Mak, B., Tam, Y.-C., 2000. Asynchrony with trained transition probabilities improves performance in multi-band speech recognition. In: Proceedings of ICSLP, pp. IV:149-152.
    • (2000) Proceedings of ICSLP , pp. 149-152
    • Mak, B.1    Tam, Y.-C.2
  • 32
    • 0006133268 scopus 로고    scopus 로고
    • Discriminative weighting of multi-resolution sub-band cepstral features for speech recognition
    • McMahon, P., McCourt, P., Vaseghi, S., 1998. Discriminative weighting of multi-resolution sub-band cepstral features for speech recognition. In: Proceedings of ICSLP, pp. 1055-1058.
    • (1998) Proceedings of ICSLP , pp. 1055-1058
    • McMahon, P.1    McCourt, P.2    Vaseghi, S.3
  • 33
    • 0004119130 scopus 로고    scopus 로고
    • A multi-band approach to automatic speech recognition
    • PhD Thesis, ICSI, UC Berkeley, CA, USA
    • Mirghafori, N. 1999. A Multi-band approach to automatic speech recognition. PhD Thesis, ICSI, UC Berkeley, CA, USA.
    • (1999)
    • Mirghafori, N.1
  • 34
    • 0004052871 scopus 로고    scopus 로고
    • Audio-visual speech recognition
    • Technical Report, The Johns Hopkins University (Center for Language and Speech Processing) Summer Research Workshop
    • Neti, C., Potamianos, G., Luettin, J., Matthews, I., Herve Gtin, Vergyri, D., Sison, J., Mashari, J., Zhou, J., 2000. Audio-visual speech recognition. Technical Report, The Johns Hopkins University (Center for Language and Speech Processing) Summer Research Workshop.
    • (2000)
    • Neti, C.1    Potamianos, G.2    Luettin, J.3    Matthews, I.4    Herve, G.5    Vergyri, D.6    Sison, J.7    Mashari, J.8    Zhou, J.9
  • 35
    • 85002102457 scopus 로고    scopus 로고
    • NIST. Score package
    • NIST. Score package. Available from.
  • 36
    • 0003951389 scopus 로고    scopus 로고
    • Techniques for modelling phonological processes in automatic speech recognition
    • PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK, August
    • Nock, H.J., 2001. Techniques for modelling phonological processes in automatic speech recognition. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK, August.
    • (2001)
    • Nock, H.J.1
  • 37
  • 38
    • 0036081023 scopus 로고    scopus 로고
    • Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models
    • Nock, H.J., Young, S.J., 2002. Modelling asynchrony in automatic speech recognition using loosely coupled hidden Markov models. Cognitive Science 26 (3), 283-301.
    • (2002) Cognitive Science , vol.26 , Issue.3 , pp. 283-301
    • Nock, H.J.1    Young, S.J.2
  • 39
    • 0003805597 scopus 로고
    • The use of context in large vocabulary speech recognition
    • PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK
    • Odell, J., 1995. The use of context in large vocabulary speech recognition. PhD Thesis, Cambridge University Engineering Dept., Cambridge, UK.
    • (1995)
    • Odell, J.1
  • 40
    • 0347307067 scopus 로고    scopus 로고
    • Incorporating linguistic theories of pronunciation variation into speech recognition models
    • London, UK
    • Ostendorf, M., 2000. Incorporating linguistic theories of pronunciation variation into speech recognition models. In: Philosophical Transactions of Royal Society, vol. 358, London, UK, pp. 1325-1338.
    • (2000) Philosophical Transactions of Royal Society , vol.358 , pp. 1325-1338
    • Ostendorf, M.1
  • 41
    • 0030715097 scopus 로고    scopus 로고
    • HMM topology design using maximum likelihood successive state splitting
    • Ostendorf, M., Singer, H., 1997. HMM topology design using maximum likelihood successive state splitting. Computer Speech and Language 11, 17-41.
    • (1997) Computer Speech and Language , vol.11 , pp. 17-41
    • Ostendorf, M.1    Singer, H.2
  • 43
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner, L.R., 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of IEEE 77(2), 257-285.
    • (1989) Proceedings of IEEE , vol.77 , Issue.2 , pp. 257-285
    • Rabiner, L.R.1
  • 44
  • 45
    • 0026405248 scopus 로고
    • A statistical model for generating pronunciation networks
    • Riley, M.D., 1991. A statistical model for generating pronunciation networks. In: Proceedings of ICASSP, pp. 737-740.
    • (1991) Proceedings of ICASSP , pp. 737-740
    • Riley, M.D.1
  • 46
    • 0005921215 scopus 로고    scopus 로고
    • Pronunciation modeling for conversational speech recognition
    • PhD Thesis, The Johns Hopkins University, MD, USA
    • Saraclar, M., 2000. Pronunciation modeling for conversational speech recognition. PhD Thesis, The Johns Hopkins University, MD, USA.
    • (2000)
    • Saraclar, M.1
  • 47
    • 0000114416 scopus 로고    scopus 로고
    • Pronunciation modeling by sharing Gaussian densities across phonetic models
    • Saraclar, M., Nock, H., Khudanpur, S., 2000. Pronunciation modeling by sharing Gaussian densities across phonetic models. Computer Speech and Language 14 (2), 137-160.
    • (2000) Computer Speech and Language , vol.14 , Issue.2 , pp. 137-160
    • Saraclar, M.1    Nock, H.2    Khudanpur, S.3
  • 48
    • 0032596518 scopus 로고    scopus 로고
    • Mixed memory Markov models
    • Saul, L. K., Jordan, M.I., 1999. Mixed memory Markov models. Machine Learning 37, 75-87.
    • (1999) Machine Learning , vol.37 , pp. 75-87
    • Saul, L.K.1    Jordan, M.I.2
  • 49
    • 0029747053 scopus 로고    scopus 로고
    • Integrating audio and visual information to provide highly robust speech recognition
    • Tomlinson, M.J., Russell, M.J., Brooke, N.M., 1996. Integrating audio and visual information to provide highly robust speech recognition. In: Proceedings of ICASSP, vol. II, pp. 821-824.
    • (1996) Proceedings of ICASSP , vol.2 , pp. 821-824
    • Tomlinson, M.J.1    Russell, M.J.2    Brooke, N.M.3
  • 51
    • 0025681008 scopus 로고
    • Hidden Markov model decomposition of speech and noise
    • Varga, A.P., Moore, R.K., 1990. Hidden Markov model decomposition of speech and noise. In: Proceedings of ICASSP, pp. 845-848.
    • (1990) Proceedings of ICASSP , pp. 845-848
    • Varga, A.P.1    Moore, R.K.2
  • 52
    • 85009113617 scopus 로고    scopus 로고
    • Gestural overlap, place of articulation and speech rate: An x-ray investigation
    • Vaxeclaire, B., Sock, R., Perrier, P., 2000. Gestural overlap, place of articulation and speech rate: an X-ray investigation. In: Proceedings of ICSLP, pp. II:166-II:169.
    • (2000) Proceedings of ICSLP , pp. II:166-II:169
    • Vaxelaire, B.1    Sock, R.2    Perrier, P.3
  • 54
    • 4243661293 scopus 로고    scopus 로고
    • Automatic learning of word pronunciation from data
    • Technical Report, The Johns Hopkins University (Center for Language and Speech Procesing) Summer Research Workshop
    • Weintraub, M., Wegmann, S., Kao, Y.-H., Khudanpur, S., Galles, C., Fosler, E., Saraclar, M., 1996. Automatic learning of word pronunciation from data. Technical Report, The Johns Hopkins University (Center for Language and Speech Procesing) Summer Research Workshop.
    • (1996)
    • Weintraub, M.1    Wegmann, S.2    Kao, Y.-H.3    Khudanpur, S.4    Galles, C.5    Fosler, E.6    Saraclar, M.7


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.